[Page version: 20240911 12:47:05]
Measurements for amd64, like, crypto_kem Test results for amd64, like, crypto_kem Test results for crypto_kem/newhope512ccaTime | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
---|---|---|---|---|---|---|
174375 | 153191 0 0 | 174616 820 1752 | T:avx2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240806 | 20240716 |
177118 | 159602 0 0 | 181144 820 1752 | T:avx2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240806 | 20240716 |
212161 | 140147 0 0 | 158278 812 1752 | T:avx2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240806 | 20240716 |
214473 | 233800 0 0 | 255472 788 1784 | T:avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240806 | 20240716 |
252014 | 144022 0 0 | 163880 788 1784 | T:avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240806 | 20240716 |
262057 | 141011 0 0 | 159606 812 1752 | T:avx2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240806 | 20240716 |
279529 | 139481 0 0 | 158808 788 1784 | T:avx2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240806 | 20240716 |
293491 | 138432 0 0 | 156696 780 1752 | T:avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240806 | 20240716 |
356791 | 24671 0 0 | 46584 820 1752 | T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240806 | 20240716 |
360411 | 21728 0 0 | 43528 820 1752 | T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240806 | 20240716 |
397621 | 17484 0 0 | 38744 820 1752 | T:ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240806 | 20240716 |
398413 | 42317 0 0 | 63864 788 1784 | T:ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240806 | 20240716 |
433799 | 12394 0 0 | 30990 812 1752 | T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240806 | 20240716 |
519206 | 13262 0 0 | 32238 812 1752 | T:ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240806 | 20240716 |
573275 | 12575 0 0 | 31752 788 1784 | T:ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240806 | 20240716 |
604262 | 12958 0 0 | 32648 788 1784 | T:ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240806 | 20240716 |
628111 | 11776 0 0 | 29864 780 1752 | T:ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240806 | 20240716 |
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'KeccakP1600times4_AddLanesAll' that is compiled without support for 'avx' KeccakP-1600-times4-SIMD256.c: Xor_In4( 0 ); KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:135:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: #define Xor_In4( argIndex ) lanes0 = LOAD256u( curData0[argIndex]),\ KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:44:37: note: expanded from macro 'LOAD256u' KeccakP-1600-times4-SIMD256.c: #define LOAD256u(a) _mm256_loadu_si256((const V256 *)&(a)) KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:135:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: #define Xor_In4( argIndex ) lanes0 = LOAD256u( curData0[argIndex]),\ KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:44:37: note: expanded from macro 'LOAD256u' KeccakP-1600-times4-SIMD256.c: #define LOAD256u(a) _mm256_loadu_si256((const V256 *)&(a)) KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'KeccakP1600times4_AddLanesAll' that is compiled without support for 'avx' KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:136:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: lanes1 = LOAD256u( curData1[argIndex]),\ KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:44:37: note: expanded from macro 'LOAD256u' KeccakP-1600-times4-SIMD256.c: #define LOAD256u(a) _mm256_loadu_si256((const V256 *)&(a)) KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:136:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: ...
Implementation | Compiler |
T:avx2 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
KeccakP-1600-times4-SIMD256.o KeccakF1600times4_FastLoop_Absorb T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_12rounds_FastLoop_Absorb T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_AddBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_AddLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractAndAddBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractAndAddLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_InitializeAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_OverwriteBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_OverwriteLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_OverwriteWithZeroes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_PermuteAll_12rounds T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_PermuteAll_24rounds T consts.o neg2 R consts.o neg4 R consts.o q_vector R consts.o qinv_vector R cpapke.o cpapke_dec T cpapke.o cpapke_enc T cpapke.o cpapke_keypair T fips202.o KeccakF1600_StatePermute T fips202.o shake128_absorb T fips202.o shake128_squeezeblocks T fips202.o shake256 T fips202x4.o shake128x4 T fips202x4.o shake256x4 T ntt.o bitrev_vector T ntt.o mul_coefficients T ntt.o ntt T ntt_double.o _ntt_double1024 T ntt_double.o _ntt_double512 T ntt_double.o _pwmul_double1024 T ntt_double.o _pwmul_double512 T ntt_double.o ntt_double1024 T ntt_double.o ntt_double512 T ntt_double.o pwmul_double1024 T ntt_double.o pwmul_double512 T poly.o poly_add T poly.o poly_compress T poly.o poly_decompress T poly.o poly_frombytes T poly.o poly_frommsg T poly.o poly_invntt T poly.o poly_ntt T poly.o poly_pointwise T poly.o poly_sample T poly.o poly_sub T poly.o poly_tobytes T poly.o poly_tomsg T poly.o poly_uniform T precomp.o bitrev_lut R precomp.o omegas_double R precomp.o omegas_inv_double R precomp.o psis_bitrev R precomp.o psis_inv R reduce.o barrett_reduce T reduce.o montgomery_reduce T verify.o cmov T verify.o verify T
Implementation | Compiler |
T:avx2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:avx2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:avx2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:avx2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
T:avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
T:avx2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
T:avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
cpapke.o cpapke_dec T cpapke.o cpapke_enc T cpapke.o cpapke_keypair T fips202.o KeccakF1600_StatePermute T fips202.o shake128_absorb T fips202.o shake128_squeezeblocks T fips202.o shake256 T ntt.o bitrev_vector T ntt.o mul_coefficients T ntt.o ntt T poly.o poly_add T poly.o poly_compress T poly.o poly_decompress T poly.o poly_frombytes T poly.o poly_frommsg T poly.o poly_invntt T poly.o poly_mul_pointwise T poly.o poly_ntt T poly.o poly_sample T poly.o poly_sub T poly.o poly_tobytes T poly.o poly_tomsg T poly.o poly_uniform T precomp.o omegas_bitrev_montgomery R precomp.o omegas_inv_bitrev_montgomery R precomp.o psis_bitrev_montgomery R precomp.o psis_inv_montgomery R reduce.o montgomery_reduce T verify.o cmov T verify.o verify T
Implementation | Compiler |
T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
T:ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
T:ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
T:ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |