[Page version: 20240911 12:49:53]
Measurements for amd64, know, crypto_kem Test results for amd64, know, crypto_kem Test results for crypto_kem/newhope512ccaTime | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
---|---|---|---|---|---|---|
185110 | 163275 0 0 | 185337 860 1792 | T:avx2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240804 | 20240716 |
185150 | 156362 0 0 | 178145 860 1728 | T:avx2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240804 | 20240716 |
212828 | 140985 0 0 | 160071 852 1792 | T:avx2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240804 | 20240716 |
220181 | 246010 0 0 | 267252 788 1760 | T:avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240804 | 20240716 |
243743 | 144301 0 0 | 163636 788 1760 | T:avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240804 | 20240716 |
245890 | 143491 0 0 | 161737 860 1728 | T:avx2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240804 | 20240716 |
276000 | 139362 0 0 | 157084 780 1728 | T:avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240804 | 20240716 |
286176 | 139405 0 0 | 158284 788 1760 | T:avx2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240804 | 20240716 |
347335 | 43741 0 0 | 66185 860 1792 | T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240804 | 20240716 |
359734 | 31594 0 0 | 53729 860 1728 | T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240804 | 20240716 |
395111 | 53146 0 0 | 74228 788 1760 | T:ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240804 | 20240716 |
397228 | 22965 0 0 | 44017 860 1728 | T:ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240804 | 20240716 |
443229 | 12437 0 0 | 31975 852 1792 | T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240804 | 20240716 |
521764 | 13635 0 0 | 32812 788 1760 | T:ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240804 | 20240716 |
524380 | 12967 0 0 | 31665 860 1728 | T:ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240804 | 20240716 |
601815 | 12615 0 0 | 31316 788 1760 | T:ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240804 | 20240716 |
627354 | 11774 0 0 | 29316 780 1728 | T:ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240804 | 20240716 |
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'KeccakP1600times4_AddLanesAll' that is compiled without support for 'avx' KeccakP-1600-times4-SIMD256.c: Xor_In4( 0 ); KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:135:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: #define Xor_In4( argIndex ) lanes0 = LOAD256u( curData0[argIndex]),\ KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:44:37: note: expanded from macro 'LOAD256u' KeccakP-1600-times4-SIMD256.c: #define LOAD256u(a) _mm256_loadu_si256((const V256 *)&(a)) KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:135:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: #define Xor_In4( argIndex ) lanes0 = LOAD256u( curData0[argIndex]),\ KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:44:37: note: expanded from macro 'LOAD256u' KeccakP-1600-times4-SIMD256.c: #define LOAD256u(a) _mm256_loadu_si256((const V256 *)&(a)) KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'KeccakP1600times4_AddLanesAll' that is compiled without support for 'avx' KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:136:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: lanes1 = LOAD256u( curData1[argIndex]),\ KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:44:37: note: expanded from macro 'LOAD256u' KeccakP-1600-times4-SIMD256.c: #define LOAD256u(a) _mm256_loadu_si256((const V256 *)&(a)) KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:136:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: ...
Implementation | Compiler |
T:avx2 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
KeccakP-1600-times4-SIMD256.o KeccakF1600times4_FastLoop_Absorb T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_12rounds_FastLoop_Absorb T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_AddBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_AddLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractAndAddBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractAndAddLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_InitializeAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_OverwriteBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_OverwriteLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_OverwriteWithZeroes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_PermuteAll_12rounds T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_PermuteAll_24rounds T consts.o neg2 R consts.o neg4 R consts.o q_vector R consts.o qinv_vector R cpapke.o cpapke_dec T cpapke.o cpapke_enc T cpapke.o cpapke_keypair T fips202.o KeccakF1600_StatePermute T fips202.o shake128_absorb T fips202.o shake128_squeezeblocks T fips202.o shake256 T fips202x4.o shake128x4 T fips202x4.o shake256x4 T ntt.o bitrev_vector T ntt.o mul_coefficients T ntt.o ntt T ntt_double.o _ntt_double1024 T ntt_double.o _ntt_double512 T ntt_double.o _pwmul_double1024 T ntt_double.o _pwmul_double512 T ntt_double.o ntt_double1024 T ntt_double.o ntt_double512 T ntt_double.o pwmul_double1024 T ntt_double.o pwmul_double512 T poly.o poly_add T poly.o poly_compress T poly.o poly_decompress T poly.o poly_frombytes T poly.o poly_frommsg T poly.o poly_invntt T poly.o poly_ntt T poly.o poly_pointwise T poly.o poly_sample T poly.o poly_sub T poly.o poly_tobytes T poly.o poly_tomsg T poly.o poly_uniform T precomp.o bitrev_lut R precomp.o omegas_double R precomp.o omegas_inv_double R precomp.o psis_bitrev R precomp.o psis_inv R reduce.o barrett_reduce T reduce.o montgomery_reduce T verify.o cmov T verify.o verify T
Implementation | Compiler |
T:avx2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
T:avx2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
T:avx2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
T:avx2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
T:avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
T:avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
T:avx2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
T:avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
cpapke.o cpapke_dec T cpapke.o cpapke_enc T cpapke.o cpapke_keypair T fips202.o KeccakF1600_StatePermute T fips202.o shake128_absorb T fips202.o shake128_squeezeblocks T fips202.o shake256 T ntt.o bitrev_vector T ntt.o mul_coefficients T ntt.o ntt T poly.o poly_add T poly.o poly_compress T poly.o poly_decompress T poly.o poly_frombytes T poly.o poly_frommsg T poly.o poly_invntt T poly.o poly_mul_pointwise T poly.o poly_ntt T poly.o poly_sample T poly.o poly_sub T poly.o poly_tobytes T poly.o poly_tomsg T poly.o poly_uniform T precomp.o omegas_bitrev_montgomery R precomp.o omegas_inv_bitrev_montgomery R precomp.o psis_bitrev_montgomery R precomp.o psis_inv_montgomery R reduce.o montgomery_reduce T verify.o cmov T verify.o verify T
Implementation | Compiler |
T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
T:ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
T:ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
T:ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
T:ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
T:ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
T:ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |