Test results for amd64, wolfdale, crypto_kem/ntskem1264

[Page version: 20260412 01:29:25]

Measurements for amd64, wolfdale, crypto_kem Test results for amd64, wolfdale, crypto_kem Test results for crypto_kem/ntskem1264

Computer: wolfdale
Microarchitecture: amd64; Core 2 45nm (1067a)
Architecture: amd64
CPU ID: GenuineIntel-0001067a-bfebfbff
SUPERCOP version: 20260330
Operation: crypto_kem
Primitive: ntskem1264

Time	Object size	Test size	Implementation	Compiler	Benchmark date	SUPERCOP version
1179556	121483 6228 16	149880 7512 2232	`T:opt`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260331	20260217
1204519	85336 6228 16	114348 7496 2328	`T:opt`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260331	20260217
1209802	121580 6228 16	149768 7512 2232	`T:opt`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260331	20260217
1223772	81360 6228 16	108920 7512 2232	`T:opt`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260331	20260217
1249475	55899 6228 16	84108 7496 2328	`T:opt`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260331	20260217
1255969	58892 6228 16	84334 7504 2232	`T:opt`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260331	20260217
1336528	54060 6228 16	81564 7496 2328	`T:opt`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260331	20260217
1454673	72127 6228 16	98750 7504 2232	`T:opt`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260331	20260217
1746257	49597 6228 16	75779 7488 2296	`T:opt`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260331	20260217
8530679	36226 76 16	64024 1352 2232	`T:ref`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260331	20260217
8958972	42063 76 16	70720 1352 2232	`T:ref`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260331	20260217
9422186	41366 76 16	69824 1352 2232	`T:ref`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260331	20260217
9502901	21834 76 16	47326 1344 2232	`T:ref`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260331	20260217
9899689	45604 76 16	74556 1320 2328	`T:ref`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260331	20260217
10505708	23807 76 16	50406 1344 2232	`T:ref`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260331	20260217
10745609	24778 76 16	52884 1320 2328	`T:ref`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260331	20260217
11021273	22195 76 16	49692 1320 2328	`T:ref`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260331	20260217
11592695	19173 76 16	45371 1312 2296	`T:ref`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260331	20260217

Test failure

error 139
Segmentation fault

Number of similar (implementation,compiler) pairs: 9, namely:

Implementation	Compiler
`T:sse2`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:sse2`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:sse2`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:sse2`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:sse2`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:sse2`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)`
`T:sse2`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)`
`T:sse2`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)`
`T:sse2`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)`

Compiler output

bitslice_fft_256.c: bitslice_fft_256.c:87:25: error: always_inline function '_mm256_set_epi64x' requires target feature 'avx', but would be inlined into function 'bitslice_butterflies12_256' that is compiled without support for 'avx'
bitslice_fft_256.c:             out[i][b] = _mm256_set_epi64x(-((in[0][b] >> reversal[4*i+3]) & 1),
bitslice_fft_256.c:                         ^
bitslice_fft_256.c: bitslice_fft_256.c:87:25: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
bitslice_fft_256.c: bitslice_fft_256.c:99:22: error: '__builtin_ia32_pshufd256' needs target feature avx2
bitslice_fft_256.c:                 vb = _mm256_shuffle_epi32(tmp[b], _MM_SHUFFLE(3, 2, 3, 2));
bitslice_fft_256.c:                      ^
bitslice_fft_256.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/avx2intrin.h:470:12: note: expanded from macro '_mm256_shuffle_epi32'
bitslice_fft_256.c:   (__m256i)__builtin_ia32_pshufd256((__v8si)(__m256i)(a), (int)(imm))
bitslice_fft_256.c:            ^
bitslice_fft_256.c: bitslice_fft_256.c:100:22: error: '__builtin_ia32_pslldqi256_byteshift' needs target feature avx2
bitslice_fft_256.c:                 va = _mm256_slli_si256(out[k][b], 8);
bitslice_fft_256.c:                      ^
bitslice_fft_256.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/avx2intrin.h:497:12: note: expanded from macro '_mm256_slli_si256'
bitslice_fft_256.c:   (__m256i)__builtin_ia32_pslldqi256_byteshift((__v4di)(__m256i)(a), (int)(imm))
bitslice_fft_256.c:            ^
bitslice_fft_256.c: bitslice_fft_256.c:101:22: error: always_inline function '_mm256_xor_si256' requires target feature 'avx2', but would be inlined into function 'bitslice_butterflies12_256' that is compiled without support for 'avx2'
bitslice_fft_256.c:                 vb = _mm256_xor_si256(va, vb);
bitslice_fft_256.c:                      ^
bitslice_fft_256.c: bitslice_fft_256.c:101:22: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
bitslice_fft_256.c: bitslice_fft_256.c:102:29: error: always_inline function '_mm256_xor_si256' requires target feature 'avx2', but would be inlined into function 'bitslice_butterflies12_256' that is compiled without support for 'avx2'
bitslice_fft_256.c:                 out[k][b] = _mm256_xor_si256(out[k][b], vb);
bitslice_fft_256.c:                             ^
bitslice_fft_256.c: bitslice_fft_256.c:102:29: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
bitslice_fft_256.c: bitslice_fft_256.c:112:22: error: always_inline function '_mm256_set_epi64x' requires target feature 'avx', but would be inlined into function 'bitslice_butterflies12_256' that is compiled without support for 'avx'
bitslice_fft_256.c: ...

Number of similar (implementation,compiler) pairs: 5, namely:

Implementation	Compiler
`T:avx2`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:avx2`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:avx2`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:avx2`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:avx2`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`

Compiler output

bitslice_fft_256.c: bitslice_fft_256.c: In function 'bitslice_butterflies12_256':
bitslice_fft_256.c: bitslice_fft_256.c:87:23: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
bitslice_fft_256.c:    87 |             out[i][b] = _mm256_set_epi64x(-((in[0][b] >> reversal[4*i+3]) & 1),
bitslice_fft_256.c:       |             ~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
bitslice_fft_256.c:    88 |                                           -((in[0][b] >> reversal[4*i+2]) & 1),
bitslice_fft_256.c:       |                                           ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
bitslice_fft_256.c:    89 |                                           -((in[0][b] >> reversal[4*i+1]) & 1),
bitslice_fft_256.c:       |                                           ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
bitslice_fft_256.c:    90 |                                           -((in[0][b] >> reversal[4*i+0]) & 1));
bitslice_fft_256.c:       |                                           ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
bitslice_fft_256.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:51,
bitslice_fft_256.c:                  from bitslice_fft_256.h:21,
bitslice_fft_256.c:                  from bitslice_fft_256.c:18:
bitslice_fft_256.c: /usr/lib/gcc/x86_64-linux-gnu/10/include/avxintrin.h:1296:1: error: inlining failed in call to 'always_inline' '_mm256_set_epi64x': target specific option mismatch
bitslice_fft_256.c:  1296 | _mm256_set_epi64x (long long __A, long long __B, long long __C,
bitslice_fft_256.c:       | ^~~~~~~~~~~~~~~~~
bitslice_fft_256.c: bitslice_fft_256.c:87:25: note: called from here
bitslice_fft_256.c:    87 |             out[i][b] = _mm256_set_epi64x(-((in[0][b] >> reversal[4*i+3]) & 1),
bitslice_fft_256.c:       |                         ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
bitslice_fft_256.c:    88 |                                           -((in[0][b] >> reversal[4*i+2]) & 1),
bitslice_fft_256.c:       |                                           ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
bitslice_fft_256.c:    89 |                                           -((in[0][b] >> reversal[4*i+1]) & 1),
bitslice_fft_256.c:       |                                           ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
bitslice_fft_256.c:    90 |                                           -((in[0][b] >> reversal[4*i+0]) & 1));
bitslice_fft_256.c:       |                                           ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
bitslice_fft_256.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:

Implementation	Compiler
`T:avx2`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)`
`T:avx2`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)`
`T:avx2`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)`
`T:avx2`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)`

Namespace violations

bitslice_bma_64.o bitslice_bma T
bitslice_fft_64.o _twist_factors12_64_64 R
bitslice_fft_64.o a64_consts_64 D
bitslice_fft_64.o bitslice_fft12_64 T
bitslice_fft_64.o twiddle_factors R
bitslice_fft_64.o twist_factors D
ff.o bitslice_mul12_64 T
ff.o ff_add_m T
ff.o ff_create T
ff.o ff_inv_12 T
ff.o ff_mul_12 T
ff.o ff_reduce_12 T
ff.o ff_release T
ff.o ff_sqr_12 T
ff.o vector_ff_inv_12 T
ff.o vector_ff_mul_12 T
ff.o vector_ff_pow16_12 T
ff.o vector_ff_pow4_12 T
ff.o vector_ff_pow8_12 T
ff.o vector_ff_sqr_12 T
ff.o vector_ff_sqr_inv_12 T
ff.o vector_ff_transpose_xor_12 T
keccak.o shake_256 T
m4r.o _gray_codes_lut D
m4r.o _gray_codes_lut2 R
m4r.o _gray_codes_lut3 R
m4r.o _gray_codes_lut4 R
m4r.o _gray_codes_lut5 R
m4r.o _gray_codes_lut6 R
m4r.o _gray_codes_lut7 R
m4r.o _gray_codes_lut8 R
m4r.o _m4ri_gauss_submatrix T
m4r.o _m4ri_make_table_rev T
m4r.o m4r_rref T
matrix_ff2.o alloc_matrix_ff2 T
matrix_ff2.o calloc_matrix_ff2 T
matrix_ff2.o clone_matrix_ff2 T
matrix_ff2.o column_swap_matrix_ff2 T
matrix_ff2.o free_matrix_ff2 T
matrix_ff2.o is_equal_matrix_ff2 T
matrix_ff2.o reduce_row_echelon_matrix_ff2 T
matrix_ff2.o zero_matrix_ff2 T
nts_kem.o compute_syndrome T
nts_kem.o correct_error_and_recover_ke T
nts_kem.o create_matrix_G T
nts_kem.o create_random_goppa_polynomial T
nts_kem.o deserialise_private_key T
nts_kem.o fisher_yates_shuffle T
nts_kem.o is_valid_goppa_polynomial T
nts_kem.o load_input_ciphertext T
nts_kem.o nts_kem_ciphertext_size T
nts_kem.o nts_kem_create T
nts_kem.o nts_kem_decapsulate T
nts_kem.o nts_kem_encapsulate T
nts_kem.o nts_kem_init_from_private_key T
nts_kem.o nts_kem_key_size T
nts_kem.o nts_kem_release T
nts_kem.o pack_buffer T
nts_kem.o random_vector T
nts_kem.o serialise_private_key T
nts_kem.o serialise_public_key T
nts_kem.o unpack_buffer T
polynomial.o clone_poly T
polynomial.o create_random_poly T
polynomial.o evaluate_poly T
polynomial.o formal_derivative_poly T
polynomial.o free_poly T
polynomial.o gcd_poly T
polynomial.o init_poly T
polynomial.o is_equal_poly T
polynomial.o modulo_reduce_poly T
polynomial.o poly_from_roots T
polynomial.o update_poly_degree T
polynomial.o zero_poly T
random.o random_uint16_bounded T
random.o randombit T
vector_utils.o vector_ff_or_64 T
vector_utils.o vector_load_1d_64 T
vector_utils.o vector_load_2d_64 T
vector_utils.o vector_store_1d_64 T
vector_utils.o vector_store_2d_64 T

Number of similar (implementation,compiler) pairs: 9, namely:

Implementation	Compiler
`T:opt`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:opt`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:opt`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:opt`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:opt`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:opt`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)`
`T:opt`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)`
`T:opt`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)`
`T:opt`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)`

Namespace violations

additive_fft.o _additive_fft_construct_basis T
additive_fft.o _additive_fft_core T
additive_fft.o _additive_fft_ff_value T
additive_fft.o additive_fft T
additive_fft.o ff_poly_taylor_expansion T
berlekamp_massey.o berlekamp_massey T
ff.o ff_add_m T
ff.o ff_create T
ff.o ff_inv_12 T
ff.o ff_mul_12 T
ff.o ff_reduce_12 T
ff.o ff_release T
ff.o ff_sqr_12 T
keccak.o Keccak T
keccak.o KeccakF1600_StatePermute T
keccak.o LFSR86540 T
keccak.o sha3_256 T
keccak.o sha3_384 T
keccak.o sha3_512 T
keccak.o shake_256 T
m4r.o _gray_codes_lut D
m4r.o _gray_codes_lut2 R
m4r.o _gray_codes_lut3 R
m4r.o _gray_codes_lut4 R
m4r.o _gray_codes_lut5 R
m4r.o _gray_codes_lut6 R
m4r.o _gray_codes_lut7 R
m4r.o _gray_codes_lut8 R
m4r.o _m4ri_gauss_submatrix T
m4r.o _m4ri_make_table_rev T
m4r.o m4r_rref T
matrix_ff2.o alloc_matrix_ff2 T
matrix_ff2.o calloc_matrix_ff2 T
matrix_ff2.o clone_matrix_ff2 T
matrix_ff2.o column_swap_matrix_ff2 T
matrix_ff2.o free_matrix_ff2 T
matrix_ff2.o is_equal_matrix_ff2 T
matrix_ff2.o reduce_row_echelon_matrix_ff2 T
matrix_ff2.o zero_matrix_ff2 T
nts_kem.o compute_syndrome T
nts_kem.o correct_error_and_recover_ke T
nts_kem.o create_matrix_G T
nts_kem.o create_random_goppa_polynomial T
nts_kem.o deserialise_private_key T
nts_kem.o fisher_yates_shuffle T
nts_kem.o is_valid_goppa_polynomial T
nts_kem.o nts_kem_ciphertext_size T
nts_kem.o nts_kem_create T
nts_kem.o nts_kem_decapsulate T
nts_kem.o nts_kem_encapsulate T
nts_kem.o nts_kem_init_from_private_key T
nts_kem.o nts_kem_key_size T
nts_kem.o nts_kem_release T
nts_kem.o pack_buffer T
nts_kem.o random_vector T
nts_kem.o serialise_private_key T
nts_kem.o serialise_public_key T
nts_kem.o unpack_buffer T
polynomial.o clone_poly T
polynomial.o create_random_poly T
polynomial.o evaluate_poly T
polynomial.o formal_derivative_poly T
polynomial.o free_poly T
polynomial.o gcd_poly T
polynomial.o init_poly T
polynomial.o is_equal_poly T
polynomial.o modulo_reduce_poly T
polynomial.o poly_from_roots T
polynomial.o update_poly_degree T
polynomial.o zero_poly T
random.o random_uint16_bounded T
random.o randombit T
stack.o stack_create T
stack.o stack_free T
stack.o stack_pop T
stack.o stack_push T
stack.o stack_size T

Number of similar (implementation,compiler) pairs: 9, namely:

Implementation	Compiler
`T:ref`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:ref`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:ref`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:ref`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:ref`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)`
`T:ref`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)`
`T:ref`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)`
`T:ref`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)`
`T:ref`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)`