Test results for amd64, gemini, crypto_sign/nccsign5
[Page version: 20251005 00:07:22]
Measurements for amd64, gemini, crypto_sign
Test results for amd64, gemini, crypto_sign
Test results for crypto_sign/nccsign5
Computer: gemini
Microarchitecture: amd64; Goldmont Plus (706a8)
Architecture: amd64
CPU ID: GenuineIntel-000706a8-bfebfbff 
SUPERCOP version: 20250922
Operation: crypto_sign
Primitive: nccsign5
| Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version | 
|---|
| 3971073 | 36835 18428 0 | 45832 19260 1720 | ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250504 | 20250415 | 
| 4029450 | 39899 18428 0 | 50216 19260 1720 | ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250504 | 20250415 | 
| 4065769 | 42463 18428 0 | 54424 19260 1720 | ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250504 | 20250415 | 
| 4127461 | 86338 18428 0 | 83808 19228 1784 | ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250504 | 20250415 | 
| 4130631 | 26496 18428 0 | 37960 19228 1784 | ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250504 | 20250415 | 
| 4394379 | 24195 18428 0 | 35662 19252 1720 | ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250504 | 20250415 | 
| 4410898 | 24063 18428 0 | 36008 19228 1784 | ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250504 | 20250415 | 
| 5114338 | 22400 18428 0 | 33168 19220 1752 | ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250504 | 20250415 | 
| 5342401 | 26318 18428 0 | 37390 19252 1720 | ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250504 | 20250415 | 
Compiler output
aes.c: aes.c:51:24: warning: unused function 'br_swap32' [-Wunused-function]
aes.c: static inline uint32_t br_swap32(uint32_t x)
aes.c:                        ^
aes.c: 1 warning generated.
poly.c: poly.c:904:17: error: always_inline function '_mm256_set_epi32' requires target feature 'avx', but would be inlined into function 'vector_4way_to_3way' that is compiled without support for 'avx'
poly.c:                 out3[i*4+0] = _mm256_set_epi32(0,0,0,temp0[4],0,temp0[2],0,temp0[0]);
poly.c:                               ^
poly.c: poly.c:904:17: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
poly.c: poly.c:905:17: error: always_inline function '_mm256_set_epi32' requires target feature 'avx', but would be inlined into function 'vector_4way_to_3way' that is compiled without support for 'avx'
poly.c:                 out3[i*4+1] = _mm256_set_epi32(0,0,0,temp1[2],0,temp1[0],0,temp0[6]);
poly.c:                               ^
poly.c: poly.c:905:17: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
poly.c: poly.c:906:17: error: always_inline function '_mm256_set_epi32' requires target feature 'avx', but would be inlined into function 'vector_4way_to_3way' that is compiled without support for 'avx'
poly.c:                 out3[i*4+2] = _mm256_set_epi32(0,0,0,temp2[0],0,temp1[6],0,temp1[4]);
poly.c:                               ^
poly.c: poly.c:906:17: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
poly.c: poly.c:907:17: error: always_inline function '_mm256_set_epi32' requires target feature 'avx', but would be inlined into function 'vector_4way_to_3way' that is compiled without support for 'avx'
poly.c:                 out3[i*4+3] = _mm256_set_epi32(0,0,0,temp2[6],0,temp2[4],0,temp2[2]);
poly.c:                               ^
poly.c: poly.c:907:17: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
poly.c: 8 errors generated.
Number of similar (implementation,compiler) pairs: 4, namely:
| Implementation | Compiler | 
| avx2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| avx2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| avx2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) | 
| avx2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
Compiler output
aes.c: aes.c:660:5: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes.c:     BLOCK1(0x01);
aes.c:     ^
aes.c: aes.c:641:13: note: expanded from macro 'BLOCK1'
aes.c:     temp1 = _mm_aeskeygenassist_si128(temp2, IMM);                        \
aes.c:             ^
aes.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/__wmmintrin_aes.h:136:12: note: expanded from macro '_mm_aeskeygenassist_si128'
aes.c:   (__m128i)__builtin_ia32_aeskeygenassist128((__v2di)(__m128i)(C), (int)(R))
aes.c:            ^
aes.c: aes.c:661:5: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes.c:     BLOCK2(0x01);
aes.c:     ^
aes.c: aes.c:651:13: note: expanded from macro 'BLOCK2'
aes.c:     temp1 = _mm_aeskeygenassist_si128(temp0, IMM);                        \
aes.c:             ^
aes.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/__wmmintrin_aes.h:136:12: note: expanded from macro '_mm_aeskeygenassist_si128'
aes.c:   (__m128i)__builtin_ia32_aeskeygenassist128((__v2di)(__m128i)(C), (int)(R))
aes.c:            ^
aes.c: aes.c:663:5: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes.c:     BLOCK1(0x02);
aes.c:     ^
aes.c: aes.c:641:13: note: expanded from macro 'BLOCK1'
aes.c:     temp1 = _mm_aeskeygenassist_si128(temp2, IMM);                        \
aes.c:             ^
aes.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/__wmmintrin_aes.h:136:12: note: expanded from macro '_mm_aeskeygenassist_si128'
aes.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler | 
| avx2 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) | 
Compiler output
poly.c: poly.c: In function 'vector_4way_to_3way':
poly.c: poly.c:904:15: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
poly.c:   904 |   out3[i*4+0] = _mm256_set_epi32(0,0,0,temp0[4],0,temp0[2],0,temp0[0]);
poly.c:       |   ~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
poly.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:51,
poly.c:                  from params.h:5,
poly.c:                  from poly.c:4:
poly.c: /usr/lib/gcc/x86_64-linux-gnu/10/include/avxintrin.h:1258:1: error: inlining failed in call to 'always_inline' '_mm256_set_epi32': target specific option mismatch
poly.c:  1258 | _mm256_set_epi32 (int __A, int __B, int __C, int __D,
poly.c:       | ^~~~~~~~~~~~~~~~
poly.c: poly.c:907:17: note: called from here
poly.c:   907 |   out3[i*4+3] = _mm256_set_epi32(0,0,0,temp2[6],0,temp2[4],0,temp2[2]);
poly.c:       |                 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
poly.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:51,
poly.c:                  from params.h:5,
poly.c:                  from poly.c:4:
poly.c: /usr/lib/gcc/x86_64-linux-gnu/10/include/avxintrin.h:1258:1: error: inlining failed in call to 'always_inline' '_mm256_set_epi32': target specific option mismatch
poly.c:  1258 | _mm256_set_epi32 (int __A, int __B, int __C, int __D,
poly.c:       | ^~~~~~~~~~~~~~~~
poly.c: poly.c:906:17: note: called from here
poly.c:   906 |   out3[i*4+2] = _mm256_set_epi32(0,0,0,temp2[0],0,temp1[6],0,temp1[4]);
poly.c:       |                 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
poly.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:51,
poly.c:                  from params.h:5,
poly.c:                  from poly.c:4:
poly.c: ...
Number of similar (implementation,compiler) pairs: 4, namely:
| Implementation | Compiler | 
| avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| avx2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
Namespace violations
aes.o aes128_ctr T
aes.o aes128_ctr_keyexp T
aes.o aes128_ctx_release T
aes.o aes128_ecb T
aes.o aes128_ecb_keyexp T
aes.o aes192_ctr T
aes.o aes192_ctr_keyexp T
aes.o aes192_ctx_release T
aes.o aes192_ecb T
aes.o aes192_ecb_keyexp T
aes.o aes256_ctr T
aes.o aes256_ctr_keyexp T
aes.o aes256_ctx_release T
aes.o aes256_ecb T
aes.o aes256_ecb_keyexp T
aes.o aes256ctr_init T
aes.o aes256ctr_prf T
aes.o aes256ctr_squeezeblocks T
fips202.o sha3_256 T
fips202.o sha3_512 T
fips202.o shake128 T
fips202.o shake128_absorb T
fips202.o shake128_finalize T
fips202.o shake128_init T
fips202.o shake128_squeeze T
fips202.o shake128_squeezeblocks T
fips202.o shake256 T
fips202.o shake256_absorb T
fips202.o shake256_finalize T
fips202.o shake256_init T
fips202.o shake256_squeeze T
fips202.o shake256_squeezeblocks T
packing.o pack_pk T
packing.o pack_sig T
packing.o pack_sk T
packing.o polyeta_pack T
packing.o polyeta_unpack T
packing.o polyt0_pack T
packing.o polyt0_unpack T
packing.o polyt1_pack T
packing.o polyt1_unpack T
packing.o polyw1_pack T
packing.o polyz_pack T
packing.o polyz_unpack T
packing.o unpack_pk T
packing.o unpack_sig T
packing.o unpack_sk T
poly.o base_mul T
poly.o invntt_tomont T
poly.o ntt T
poly.o pointwise_mul T
poly.o poly_add T
poly.o poly_base_mul T
poly.o poly_caddq T
poly.o poly_challenge T
poly.o poly_chknorm T
poly.o poly_decompose T
poly.o poly_make_hint T
poly.o poly_modadd T
poly.o poly_modsub T
poly.o poly_mul_schoolbook T
poly.o poly_power2round T
poly.o poly_reduce T
poly.o poly_shiftl T
poly.o poly_sub T
poly.o poly_uniform T
poly.o poly_uniform_eta T
poly.o poly_uniform_gamma1 T
poly.o poly_use_hint T
poly.o reduce_modQ T
poly.o zetas D
poly.o zetas_inv D
reduce.o caddq T
reduce.o csubq T
reduce.o freeze T
reduce.o mod_add T
reduce.o mod_sub T
reduce.o montgomery_reduce T
reduce.o reduce32 T
rounding.o decompose T
rounding.o make_hint T
rounding.o power2round T
rounding.o use_hint T
sign.o crypto_sign_signature T
sign.o crypto_sign_verify T
symmetric-aes.o NIMS_aes256ctr_init T
symmetric-shake.o NIMS_shake128_stream_init T
symmetric-shake.o NIMS_shake256_stream_init T
Number of similar (implementation,compiler) pairs: 9, namely:
| Implementation | Compiler | 
| ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) | 
| ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) | 
| ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
Passed TIMECOP
TIMECOP iterations: 10
Number of similar (implementation,compiler) pairs: 9, namely:
| Implementation | Compiler | 
| ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) | 
| ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) | 
| ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) | 
| ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) | 
| ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) | 
| ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |