[Page version: 20240921 21:27:00]
Measurements for amd64, prodesk, crypto_kem Test results for amd64, prodesk, crypto_kem Test results for crypto_kem/newhope512ccaTime | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
---|---|---|---|---|---|---|
204624 | 165388 0 0 | 188646 832 1576 | T:avx2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240920 | 20240909 |
205992 | 158331 0 0 | 181294 832 1576 | T:avx2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240920 | 20240909 |
223876 | 242664 0 0 | 264909 760 1608 | T:avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240920 | 20240909 |
228376 | 140429 0 0 | 160404 824 1640 | T:avx2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240920 | 20240909 |
250604 | 142029 0 0 | 162413 760 1608 | T:avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240920 | 20240909 |
267088 | 142932 0 0 | 162222 832 1576 | T:avx2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240920 | 20240909 |
303012 | 139362 0 0 | 158109 752 1576 | T:avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240920 | 20240909 |
303296 | 139510 0 0 | 159405 760 1608 | T:avx2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240920 | 20240909 |
380392 | 34653 0 0 | 58214 832 1576 | T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240920 | 20240909 |
395232 | 30434 0 0 | 53718 832 1576 | T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240920 | 20240909 |
430692 | 22965 0 0 | 45014 832 1576 | T:ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240920 | 20240909 |
442172 | 52715 0 0 | 74805 760 1608 | T:ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240920 | 20240909 |
453648 | 12437 0 0 | 32868 824 1640 | T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240920 | 20240909 |
512620 | 12951 0 0 | 32678 832 1576 | T:ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240920 | 20240909 |
551624 | 13253 0 0 | 33493 760 1608 | T:ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240920 | 20240909 |
652344 | 12615 0 0 | 32341 760 1608 | T:ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240920 | 20240909 |
704544 | 11776 0 0 | 30357 752 1576 | T:ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240920 | 20240909 |
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'KeccakP1600times4_AddLanesAll' that is compiled without support for 'avx' KeccakP-1600-times4-SIMD256.c: Xor_In4( 0 ); KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:135:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: #define Xor_In4( argIndex ) lanes0 = LOAD256u( curData0[argIndex]),\ KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:44:37: note: expanded from macro 'LOAD256u' KeccakP-1600-times4-SIMD256.c: #define LOAD256u(a) _mm256_loadu_si256((const V256 *)&(a)) KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:135:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: #define Xor_In4( argIndex ) lanes0 = LOAD256u( curData0[argIndex]),\ KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:44:37: note: expanded from macro 'LOAD256u' KeccakP-1600-times4-SIMD256.c: #define LOAD256u(a) _mm256_loadu_si256((const V256 *)&(a)) KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'KeccakP1600times4_AddLanesAll' that is compiled without support for 'avx' KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:136:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: lanes1 = LOAD256u( curData1[argIndex]),\ KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:44:37: note: expanded from macro 'LOAD256u' KeccakP-1600-times4-SIMD256.c: #define LOAD256u(a) _mm256_loadu_si256((const V256 *)&(a)) KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:136:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: ...
Implementation | Compiler |
T:avx2 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
KeccakP-1600-times4-SIMD256.o KeccakF1600times4_FastLoop_Absorb T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_12rounds_FastLoop_Absorb T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_AddBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_AddLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractAndAddBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractAndAddLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_InitializeAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_OverwriteBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_OverwriteLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_OverwriteWithZeroes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_PermuteAll_12rounds T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_PermuteAll_24rounds T consts.o neg2 R consts.o neg4 R consts.o q_vector R consts.o qinv_vector R cpapke.o cpapke_dec T cpapke.o cpapke_enc T cpapke.o cpapke_keypair T fips202.o KeccakF1600_StatePermute T fips202.o shake128_absorb T fips202.o shake128_squeezeblocks T fips202.o shake256 T fips202x4.o shake128x4 T fips202x4.o shake256x4 T ntt.o bitrev_vector T ntt.o mul_coefficients T ntt.o ntt T ntt_double.o _ntt_double1024 T ntt_double.o _ntt_double512 T ntt_double.o _pwmul_double1024 T ntt_double.o _pwmul_double512 T ntt_double.o ntt_double1024 T ntt_double.o ntt_double512 T ntt_double.o pwmul_double1024 T ntt_double.o pwmul_double512 T poly.o poly_add T poly.o poly_compress T poly.o poly_decompress T poly.o poly_frombytes T poly.o poly_frommsg T poly.o poly_invntt T poly.o poly_ntt T poly.o poly_pointwise T poly.o poly_sample T poly.o poly_sub T poly.o poly_tobytes T poly.o poly_tomsg T poly.o poly_uniform T precomp.o bitrev_lut R precomp.o omegas_double R precomp.o omegas_inv_double R precomp.o psis_bitrev R precomp.o psis_inv R reduce.o barrett_reduce T reduce.o montgomery_reduce T verify.o cmov T verify.o verify T
Implementation | Compiler |
T:avx2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
T:avx2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
T:avx2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
T:avx2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
T:avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
T:avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
T:avx2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
T:avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
cpapke.o cpapke_dec T cpapke.o cpapke_enc T cpapke.o cpapke_keypair T fips202.o KeccakF1600_StatePermute T fips202.o shake128_absorb T fips202.o shake128_squeezeblocks T fips202.o shake256 T ntt.o bitrev_vector T ntt.o mul_coefficients T ntt.o ntt T poly.o poly_add T poly.o poly_compress T poly.o poly_decompress T poly.o poly_frombytes T poly.o poly_frommsg T poly.o poly_invntt T poly.o poly_mul_pointwise T poly.o poly_ntt T poly.o poly_sample T poly.o poly_sub T poly.o poly_tobytes T poly.o poly_tomsg T poly.o poly_uniform T precomp.o omegas_bitrev_montgomery R precomp.o omegas_inv_bitrev_montgomery R precomp.o psis_bitrev_montgomery R precomp.o psis_inv_montgomery R reduce.o montgomery_reduce T verify.o cmov T verify.o verify T
Implementation | Compiler |
T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
T:ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
T:ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
T:ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
T:ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
T:ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
T:ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |