[Page version: 20240911 12:41:04]
Measurements for amd64, cubi10, crypto_kem Test results for amd64, cubi10, crypto_kem Test results for crypto_kem/newhope512ccaTime | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
---|---|---|---|---|---|---|
175112 | 153191 0 0 | 174352 820 1720 | T:avx2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240728 | 20240716 |
175240 | 159602 0 0 | 180848 820 1720 | T:avx2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240728 | 20240716 |
211534 | 233800 0 0 | 255176 788 1752 | T:avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240728 | 20240716 |
211590 | 140147 0 0 | 158006 812 1720 | T:avx2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240728 | 20240716 |
244146 | 144022 0 0 | 163616 788 1752 | T:avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240728 | 20240716 |
265051 | 141011 0 0 | 159342 812 1720 | T:avx2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240728 | 20240716 |
269876 | 138432 0 0 | 156400 780 1720 | T:avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240728 | 20240716 |
278756 | 139481 0 0 | 158512 788 1752 | T:avx2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240728 | 20240716 |
343438 | 24671 0 0 | 46320 820 1720 | T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240728 | 20240716 |
351725 | 21728 0 0 | 43264 820 1720 | T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240728 | 20240716 |
395936 | 17484 0 0 | 38480 820 1720 | T:ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240728 | 20240716 |
401643 | 42317 0 0 | 63552 788 1752 | T:ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240728 | 20240716 |
422882 | 12394 0 0 | 30734 812 1720 | T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240728 | 20240716 |
499177 | 13262 0 0 | 31958 812 1720 | T:ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240728 | 20240716 |
509014 | 12958 0 0 | 32336 788 1752 | T:ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240728 | 20240716 |
555309 | 12575 0 0 | 31488 788 1752 | T:ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240728 | 20240716 |
602100 | 11776 0 0 | 29568 780 1720 | T:ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240728 | 20240716 |
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'KeccakP1600times4_AddLanesAll' that is compiled without support for 'avx' KeccakP-1600-times4-SIMD256.c: Xor_In4( 0 ); KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:135:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: #define Xor_In4( argIndex ) lanes0 = LOAD256u( curData0[argIndex]),\ KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:44:37: note: expanded from macro 'LOAD256u' KeccakP-1600-times4-SIMD256.c: #define LOAD256u(a) _mm256_loadu_si256((const V256 *)&(a)) KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:135:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: #define Xor_In4( argIndex ) lanes0 = LOAD256u( curData0[argIndex]),\ KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:44:37: note: expanded from macro 'LOAD256u' KeccakP-1600-times4-SIMD256.c: #define LOAD256u(a) _mm256_loadu_si256((const V256 *)&(a)) KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'KeccakP1600times4_AddLanesAll' that is compiled without support for 'avx' KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:136:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: lanes1 = LOAD256u( curData1[argIndex]),\ KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:44:37: note: expanded from macro 'LOAD256u' KeccakP-1600-times4-SIMD256.c: #define LOAD256u(a) _mm256_loadu_si256((const V256 *)&(a)) KeccakP-1600-times4-SIMD256.c: ^ KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:136:42: note: expanded from macro 'Xor_In4' KeccakP-1600-times4-SIMD256.c: ...
Implementation | Compiler |
T:avx2 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
KeccakP-1600-times4-SIMD256.o KeccakF1600times4_FastLoop_Absorb T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_12rounds_FastLoop_Absorb T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_AddBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_AddLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractAndAddBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractAndAddLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_ExtractLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_InitializeAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_OverwriteBytes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_OverwriteLanesAll T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_OverwriteWithZeroes T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_PermuteAll_12rounds T KeccakP-1600-times4-SIMD256.o KeccakP1600times4_PermuteAll_24rounds T consts.o neg2 R consts.o neg4 R consts.o q_vector R consts.o qinv_vector R cpapke.o cpapke_dec T cpapke.o cpapke_enc T cpapke.o cpapke_keypair T fips202.o KeccakF1600_StatePermute T fips202.o shake128_absorb T fips202.o shake128_squeezeblocks T fips202.o shake256 T fips202x4.o shake128x4 T fips202x4.o shake256x4 T ntt.o bitrev_vector T ntt.o mul_coefficients T ntt.o ntt T ntt_double.o _ntt_double1024 T ntt_double.o _ntt_double512 T ntt_double.o _pwmul_double1024 T ntt_double.o _pwmul_double512 T ntt_double.o ntt_double1024 T ntt_double.o ntt_double512 T ntt_double.o pwmul_double1024 T ntt_double.o pwmul_double512 T poly.o poly_add T poly.o poly_compress T poly.o poly_decompress T poly.o poly_frombytes T poly.o poly_frommsg T poly.o poly_invntt T poly.o poly_ntt T poly.o poly_pointwise T poly.o poly_sample T poly.o poly_sub T poly.o poly_tobytes T poly.o poly_tomsg T poly.o poly_uniform T precomp.o bitrev_lut R precomp.o omegas_double R precomp.o omegas_inv_double R precomp.o psis_bitrev R precomp.o psis_inv R reduce.o barrett_reduce T reduce.o montgomery_reduce T verify.o cmov T verify.o verify T
Implementation | Compiler |
T:avx2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:avx2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:avx2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:avx2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
T:avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
T:avx2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
T:avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
cpapke.o cpapke_dec T cpapke.o cpapke_enc T cpapke.o cpapke_keypair T fips202.o KeccakF1600_StatePermute T fips202.o shake128_absorb T fips202.o shake128_squeezeblocks T fips202.o shake256 T ntt.o bitrev_vector T ntt.o mul_coefficients T ntt.o ntt T poly.o poly_add T poly.o poly_compress T poly.o poly_decompress T poly.o poly_frombytes T poly.o poly_frommsg T poly.o poly_invntt T poly.o poly_mul_pointwise T poly.o poly_ntt T poly.o poly_sample T poly.o poly_sub T poly.o poly_tobytes T poly.o poly_tomsg T poly.o poly_uniform T precomp.o omegas_bitrev_montgomery R precomp.o omegas_inv_bitrev_montgomery R precomp.o psis_bitrev_montgomery R precomp.o psis_inv_montgomery R reduce.o montgomery_reduce T verify.o cmov T verify.o verify T
Implementation | Compiler |
T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
T:ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
T:ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
T:ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |