[Page version: 20241013 16:36:29]
Measurements for amd64, bolero, crypto_kem Test results for amd64, bolero, crypto_kem Test results for crypto_kem/newhope512ccaTime | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
---|---|---|---|---|---|---|
200136 | 165388 0 0 | 188596 824 1576 | T:avx2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240720 | 20240716 |
200840 | 158331 0 0 | 181244 824 1576 | T:avx2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240720 | 20240716 |
218108 | 233343 0 0 | 255845 792 1608 | T:avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240720 | 20240716 |
224928 | 140429 0 0 | 160358 816 1640 | T:avx2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240720 | 20240716 |
250452 | 143220 0 0 | 163789 792 1608 | T:avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240720 | 20240716 |
267784 | 142932 0 0 | 162172 824 1576 | T:avx2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240720 | 20240716 |
293768 | 140949 0 0 | 161093 792 1608 | T:avx2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240720 | 20240716 |
295480 | 140630 0 0 | 159693 784 1576 | T:avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240720 | 20240716 |
374212 | 34653 0 0 | 58180 824 1576 | T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240720 | 20240716 |
386900 | 30434 0 0 | 53652 824 1576 | T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240720 | 20240716 |
416900 | 22965 0 0 | 44964 824 1576 | T:ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240720 | 20240716 |
438052 | 47373 0 0 | 69814 784 1608 | T:ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240720 | 20240716 |
462164 | 12437 0 0 | 32822 816 1640 | T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240720 | 20240716 |
528856 | 12951 0 0 | 32628 824 1576 | T:ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240720 | 20240716 |
546572 | 13461 0 0 | 33878 784 1608 | T:ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240720 | 20240716 |
609528 | 13400 0 0 | 33414 784 1608 | T:ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240720 | 20240716 |
675512 | 12642 0 0 | 31542 776 1576 | T:ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240720 | 20240716 |
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'KeccakP1600times4_AddLanesAll' that is compiled without support for 'avx' KeccakP-1600-times4-SIMD256.c: Xor_In4( 0 ); KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:135:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: #define Xor_In4( argIndex ) lanes0 = LOAD256u( curData0[argIndex]),\ KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:44:37: note: expanded from macro 'LOAD256u' KeccakP-1600-times4-SIMD256.c: #define LOAD256u(a) _mm256_loadu_si256((const V256 *)&(a)) KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:135:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: #define Xor_In4( argIndex ) lanes0 = LOAD256u( curData0[argIndex]),\ KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:44:37: note: expanded from macro 'LOAD256u' KeccakP-1600-times4-SIMD256.c: #define LOAD256u(a) _mm256_loadu_si256((const V256 *)&(a)) KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'KeccakP1600times4_AddLanesAll' that is compiled without support for 'avx' KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:136:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: lanes1 = LOAD256u( curData1[argIndex]),\ KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:44:37: note: expanded from macro 'LOAD256u' KeccakP-1600-times4-SIMD256.c: #define LOAD256u(a) _mm256_loadu_si256((const V256 *)&(a)) KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:136:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: ...
Implementation | Compiler |
T:avx2 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
KeccakP-1600-times4-SIMD256.o KeccakF1600times4_FastLoop_Absorb T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_12rounds_FastLoop_Absorb T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_AddBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_AddLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractAndAddBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractAndAddLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_InitializeAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_OverwriteBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_OverwriteLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_OverwriteWithZeroes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_PermuteAll_12rounds T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_PermuteAll_24rounds T consts.o neg2 R consts.o neg4 R consts.o q_vector R consts.o qinv_vector R cpapke.o cpapke_dec T cpapke.o cpapke_enc T cpapke.o cpapke_keypair T fips202.o KeccakF1600_StatePermute T fips202.o shake128_absorb T fips202.o shake128_squeezeblocks T fips202.o shake256 T fips202x4.o shake128x4 T fips202x4.o shake256x4 T ntt.o bitrev_vector T ntt.o mul_coefficients T ntt.o ntt T ntt_double.o _ntt_double1024 T ntt_double.o _ntt_double512 T ntt_double.o _pwmul_double1024 T ntt_double.o _pwmul_double512 T ntt_double.o ntt_double1024 T ntt_double.o ntt_double512 T ntt_double.o pwmul_double1024 T ntt_double.o pwmul_double512 T poly.o poly_add T poly.o poly_compress T poly.o poly_decompress T poly.o poly_frombytes T poly.o poly_frommsg T poly.o poly_invntt T poly.o poly_ntt T poly.o poly_pointwise T poly.o poly_sample T poly.o poly_sub T poly.o poly_tobytes T poly.o poly_tomsg T poly.o poly_uniform T precomp.o bitrev_lut R precomp.o omegas_double R precomp.o omegas_inv_double R precomp.o psis_bitrev R precomp.o psis_inv R reduce.o barrett_reduce T reduce.o montgomery_reduce T verify.o cmov T verify.o verify T
Implementation | Compiler |
T:avx2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
T:avx2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
T:avx2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
T:avx2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
T:avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
T:avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
T:avx2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
T:avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
cpapke.o cpapke_dec T cpapke.o cpapke_enc T cpapke.o cpapke_keypair T fips202.o KeccakF1600_StatePermute T fips202.o shake128_absorb T fips202.o shake128_squeezeblocks T fips202.o shake256 T ntt.o bitrev_vector T ntt.o mul_coefficients T ntt.o ntt T poly.o poly_add T poly.o poly_compress T poly.o poly_decompress T poly.o poly_frombytes T poly.o poly_frommsg T poly.o poly_invntt T poly.o poly_mul_pointwise T poly.o poly_ntt T poly.o poly_sample T poly.o poly_sub T poly.o poly_tobytes T poly.o poly_tomsg T poly.o poly_uniform T precomp.o omegas_bitrev_montgomery R precomp.o omegas_inv_bitrev_montgomery R precomp.o psis_bitrev_montgomery R precomp.o psis_inv_montgomery R reduce.o montgomery_reduce T verify.o cmov T verify.o verify T
Implementation | Compiler |
T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
T:ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
T:ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
T:ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
T:ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
T:ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
T:ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |