Test results for amd64, comet, crypto_kem/ntskem1264

[Page version: 20241024 06:31:28]

Measurements for amd64, comet, crypto_kem Test results for amd64, comet, crypto_kem Test results for crypto_kem/ntskem1264

Computer: comet
Microarchitecture: amd64; Comet Lake (806ec)
Architecture: amd64
CPU ID: GenuineIntel-000806ec-bfebfbff
SUPERCOP version: 20241022
Operation: crypto_kem
Primitive: ntskem1264

Time	Object size	Test size	Implementation	Compiler	Benchmark date	SUPERCOP version
369294	98444 6228 16	119203 7132 1728	`T:avx2`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
408520	92992 6228 16	113867 7132 1728	`T:sse2`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
416495	163781 6228 16	184075 7132 1824	`T:sse2`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
432634	113496 6228 16	133758 7076 1792	`T:avx2`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
445162	66934 6228 16	85334 7076 1792	`T:avx2`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
461851	62391 6228 16	80953 7124 1824	`T:avx2`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
464100	65158 6228 16	82990 7076 1792	`T:avx2`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
481577	84871 6228 16	102731 7132 1728	`T:avx2`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
482689	58006 6228 16	76569 7124 1824	`T:sse2`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
556151	60305 6228 16	77141 7068 1760	`T:avx2`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
557526	80066 6228 16	97971 7132 1728	`T:sse2`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
562033	141768 6228 16	163635 7132 1824	`T:opt`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
580691	150971 6228 16	171123 7132 1728	`T:sse2`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
640295	101245 6228 16	123126 7076 1792	`T:opt`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
675111	91202 6228 16	113651 7132 1728	`T:opt`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
696599	56088 6228 16	76158 7076 1792	`T:opt`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
708539	53374 6228 16	73369 7124 1824	`T:opt`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
724553	145497 6228 16	167003 7132 1728	`T:opt`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
744417	53881 6228 16	73358 7076 1792	`T:opt`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
771715	49696 6228 16	67917 7068 1760	`T:opt`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
818156	75587 6228 16	95003 7132 1728	`T:opt`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
5951393	52754 76 16	75555 972 1728	`T:ref`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
6008668	63869 76 16	85750 900 1792	`T:ref`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
6027389	38066 76 16	59979 972 1728	`T:ref`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
6182417	58380 76 16	81251 972 1824	`T:ref`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
6697502	24737 76 16	44702 900 1792	`T:ref`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
6814577	29161 76 16	48555 972 1728	`T:ref`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
7099775	22650 76 16	42865 964 1824	`T:ref`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
7288524	22358 76 16	41774 900 1792	`T:ref`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625
7539025	19046 76 16	37253 892 1760	`T:ref`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20240711	20240625

Test failure

error 111

Number of similar (implementation,compiler) pairs: 1, namely:

Implementation	Compiler
`T:avx2`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)`

Compiler output

bitslice_fft_256.c: bitslice_fft_256.c:87:25: error: always_inline function '_mm256_set_epi64x' requires target feature 'avx', but would be inlined into function 'bitslice_butterflies12_256' that is compiled without support for 'avx'
bitslice_fft_256.c:             out[i][b] = _mm256_set_epi64x(-((in[0][b] >> reversal[4*i+3]) & 1),
bitslice_fft_256.c:                         ^
bitslice_fft_256.c: bitslice_fft_256.c:87:25: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
bitslice_fft_256.c: bitslice_fft_256.c:99:22: error: '__builtin_ia32_pshufd256' needs target feature avx2
bitslice_fft_256.c:                 vb = _mm256_shuffle_epi32(tmp[b], _MM_SHUFFLE(3, 2, 3, 2));
bitslice_fft_256.c:                      ^
bitslice_fft_256.c: /usr/lib/llvm-14/lib/clang/14.0.6/include/avx2intrin.h:470:13: note: expanded from macro '_mm256_shuffle_epi32'
bitslice_fft_256.c:   ((__m256i)__builtin_ia32_pshufd256((__v8si)(__m256i)(a), (int)(imm)))
bitslice_fft_256.c:             ^
bitslice_fft_256.c: bitslice_fft_256.c:100:22: error: '__builtin_ia32_pslldqi256_byteshift' needs target feature avx2
bitslice_fft_256.c:                 va = _mm256_slli_si256(out[k][b], 8);
bitslice_fft_256.c:                      ^
bitslice_fft_256.c: /usr/lib/llvm-14/lib/clang/14.0.6/include/avx2intrin.h:497:13: note: expanded from macro '_mm256_slli_si256'
bitslice_fft_256.c:   ((__m256i)__builtin_ia32_pslldqi256_byteshift((__v4di)(__m256i)(a), (int)(imm)))
bitslice_fft_256.c:             ^
bitslice_fft_256.c: bitslice_fft_256.c:101:22: error: always_inline function '_mm256_xor_si256' requires target feature 'avx2', but would be inlined into function 'bitslice_butterflies12_256' that is compiled without support for 'avx2'
bitslice_fft_256.c:                 vb = _mm256_xor_si256(va, vb);
bitslice_fft_256.c:                      ^
bitslice_fft_256.c: bitslice_fft_256.c:101:22: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
bitslice_fft_256.c: bitslice_fft_256.c:102:29: error: always_inline function '_mm256_xor_si256' requires target feature 'avx2', but would be inlined into function 'bitslice_butterflies12_256' that is compiled without support for 'avx2'
bitslice_fft_256.c:                 out[k][b] = _mm256_xor_si256(out[k][b], vb);
bitslice_fft_256.c:                             ^
bitslice_fft_256.c: bitslice_fft_256.c:102:29: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
bitslice_fft_256.c: bitslice_fft_256.c:112:22: error: always_inline function '_mm256_set_epi64x' requires target feature 'avx', but would be inlined into function 'bitslice_butterflies12_256' that is compiled without support for 'avx'
bitslice_fft_256.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:

Implementation	Compiler
`T:avx2`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6)`

Compiler output

m4r.c: m4r.c: In function 'zero_vector':
m4r.c: m4r.c:85:20: error: incompatible types when assigning to type 'vector' {aka '__m128i'} from type '__m256i'
m4r.c:    85 |         *vec_ptr = _mm256_setzero_si256(); vec_ptr++;
m4r.c:       |                    ^~~~~~~~~~~~~~~~~~~~
m4r.c: m4r.c:86:20: error: incompatible types when assigning to type 'vector' {aka '__m128i'} from type '__m256i'
m4r.c:    86 |         *vec_ptr = _mm256_setzero_si256(); vec_ptr++;
m4r.c:       |                    ^~~~~~~~~~~~~~~~~~~~
m4r.c: m4r.c: In function '_m4ri_make_table_rev':
m4r.c: m4r.c:147:12: error: incompatible types when assigning to type 'vector' {aka '__m128i'} from type '__m256i'
m4r.c:   147 |     mask = _mm256_set_epi64x(v[3], v[2], v[1], v[0]);
m4r.c:       |            ^~~~~~~~~~~~~~~~~
m4r.c: m4r.c:196:46: error: incompatible type for argument 1 of '_mm256_and_si256'
m4r.c:   196 |     S_ptr[nblocks-1] = _mm256_and_si256(S_ptr[nblocks-1], mask);
m4r.c:       |                                         ~~~~~^~~~~~~~~~~
m4r.c:       |                                              |
m4r.c:       |                                              vector {aka __m128i}
m4r.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/12/include/immintrin.h:47,
m4r.c:                  from bits.h:28,
m4r.c:                  from m4r.c:26:
m4r.c: /usr/lib/gcc/x86_64-linux-gnu/12/include/avx2intrin.h:179:27: note: expected '__m256i' but argument is of type 'vector' {aka '__m128i'}
m4r.c:   179 | _mm256_and_si256 (__m256i __A, __m256i __B)
m4r.c:       |                   ~~~~~~~~^~~
m4r.c: m4r.c:196:59: error: incompatible type for argument 2 of '_mm256_and_si256'
m4r.c:   196 |     S_ptr[nblocks-1] = _mm256_and_si256(S_ptr[nblocks-1], mask);
m4r.c:       |                                                           ^~~~
m4r.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:

Implementation	Compiler
`T:sse2`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)`
`T:sse2`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)`
`T:sse2`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)`
`T:sse2`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0)`