Test results for amd64, dali, crypto_kem/kyber512
[Page version: 20251001 11:56:25]
Measurements for amd64, dali, crypto_kem
Test results for amd64, dali, crypto_kem
Test results for crypto_kem/kyber512
Computer: dali
Microarchitecture: amd64; Zen (820f01)
Architecture: amd64
CPU ID: AuthenticAMD-00820f01-178bfbff
SUPERCOP version: 20250922
Operation: crypto_kem
Primitive: kyber512
Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
94781 | 153620 0 0 | 174412 788 1760 | avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250928 | 20250922 |
96154 | 135468 0 0 | 156953 860 1728 | avx2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250928 | 20250922 |
97029 | 127008 0 0 | 146103 852 1792 | avx2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250928 | 20250922 |
97165 | 141484 0 0 | 163329 860 1728 | avx2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250928 | 20250922 |
98438 | 129563 0 0 | 149244 788 1760 | avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250928 | 20250922 |
98772 | 131819 0 0 | 150377 860 1728 | avx2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250928 | 20250922 |
99635 | 125868 0 0 | 144916 788 1760 | avx2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250928 | 20250922 |
100468 | 125193 0 0 | 143148 780 1728 | avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250928 | 20250922 |
323531 | 20736 0 0 | 48512 788 1752 | compact | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250424 | 20250415 |
324748 | 31402 0 0 | 52360 820 1720 | ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250424 | 20250415 |
330470 | 52795 0 0 | 80488 820 1720 | compact | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250424 | 20250415 |
338366 | 23104 0 0 | 43920 820 1720 | ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250424 | 20250415 |
342119 | 33670 0 0 | 54216 788 1752 | ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250424 | 20250415 |
343797 | 28486 0 0 | 49720 820 1720 | ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250424 | 20250415 |
346353 | 12220 0 0 | 30550 812 1720 | ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250424 | 20250415 |
351321 | 26724 0 0 | 54472 820 1720 | compact | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250424 | 20250415 |
361330 | 29755 0 0 | 57928 820 1720 | compact | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250424 | 20250415 |
374428 | 12824 0 0 | 32504 788 1752 | ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250424 | 20250415 |
390648 | 11747 0 0 | 30912 788 1752 | ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250424 | 20250415 |
409576 | 6730 0 0 | 32510 812 1720 | compact | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250424 | 20250415 |
431744 | 11101 0 0 | 29176 780 1720 | ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250424 | 20250415 |
441545 | 6690 0 0 | 33608 788 1752 | compact | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250424 | 20250415 |
449348 | 13436 0 0 | 32414 812 1720 | ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250424 | 20250415 |
492695 | 5890 0 0 | 32328 788 1752 | compact | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250424 | 20250415 |
642050 | 5366 0 0 | 30776 780 1720 | compact | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250424 | 20250415 |
720709 | 8434 0 0 | 34814 812 1720 | compact | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250424 | 20250415 |
Compiler output
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'crypto_kem_kyber512_avx2_constbranchindex_KeccakP1600times4_AddLanesAll' that is compiled without support for 'avx'
KeccakP-1600-times4-SIMD256.c: Xor_In4( 0 );
KeccakP-1600-times4-SIMD256.c: ^
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:135:42: note: expanded from macro 'Xor_In4'
KeccakP-1600-times4-SIMD256.c: #define Xor_In4( argIndex ) lanes0 = LOAD256u( curData0[argIndex]),\
KeccakP-1600-times4-SIMD256.c: ^
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:44:37: note: expanded from macro 'LOAD256u'
KeccakP-1600-times4-SIMD256.c: #define LOAD256u(a) _mm256_loadu_si256((const V256 *)&(a))
KeccakP-1600-times4-SIMD256.c: ^
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:135:42: note: expanded from macro 'Xor_In4'
KeccakP-1600-times4-SIMD256.c: #define Xor_In4( argIndex ) lanes0 = LOAD256u( curData0[argIndex]),\
KeccakP-1600-times4-SIMD256.c: ^
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:44:37: note: expanded from macro 'LOAD256u'
KeccakP-1600-times4-SIMD256.c: #define LOAD256u(a) _mm256_loadu_si256((const V256 *)&(a))
KeccakP-1600-times4-SIMD256.c: ^
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'crypto_kem_kyber512_avx2_constbranchindex_KeccakP1600times4_AddLanesAll' that is compiled without support for 'avx'
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:136:42: note: expanded from macro 'Xor_In4'
KeccakP-1600-times4-SIMD256.c: lanes1 = LOAD256u( curData1[argIndex]),\
KeccakP-1600-times4-SIMD256.c: ^
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:44:37: note: expanded from macro 'LOAD256u'
KeccakP-1600-times4-SIMD256.c: #define LOAD256u(a) _mm256_loadu_si256((const V256 *)&(a))
KeccakP-1600-times4-SIMD256.c: ^
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:136:42: note: expanded from macro 'Xor_In4'
KeccakP-1600-times4-SIMD256.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
avx2 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
Compiler output
kem.c: kem.c:24:32: warning: argument 1 of type 'uint8_t[800]' {aka 'unsigned char[800]'} with mismatched bound [-Warray-parameter=]
kem.c: 24 | int crypto_kem_keypair(uint8_t pk[KYBER_PUBLICKEYBYTES],
kem.c: | ~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~
kem.c: In file included from kem.c:5:
kem.c: kem.h:33:33: note: previously declared as 'uint8_t *' {aka 'unsigned char *'}
kem.c: 33 | int crypto_kem_keypair(uint8_t *pk, uint8_t *sk);
kem.c: | ~~~~~~~~~^~
kem.c: kem.c:25:32: warning: argument 2 of type 'uint8_t[1632]' {aka 'unsigned char[1632]'} with mismatched bound [-Warray-parameter=]
kem.c: 25 | uint8_t sk[KYBER_SECRETKEYBYTES])
kem.c: | ~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~
kem.c: kem.h:33:46: note: previously declared as 'uint8_t *' {aka 'unsigned char *'}
kem.c: 33 | int crypto_kem_keypair(uint8_t *pk, uint8_t *sk);
kem.c: | ~~~~~~~~~^~
kem.c: kem.c:50:28: warning: argument 1 of type 'uint8_t[768]' {aka 'unsigned char[768]'} with mismatched bound [-Warray-parameter=]
kem.c: 50 | int crypto_kem_enc(uint8_t ct[KYBER_CIPHERTEXTBYTES],
kem.c: | ~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~
kem.c: kem.h:36:29: note: previously declared as 'uint8_t *' {aka 'unsigned char *'}
kem.c: 36 | int crypto_kem_enc(uint8_t *ct, uint8_t *ss, const uint8_t *pk);
kem.c: | ~~~~~~~~~^~
kem.c: kem.c:51:28: warning: argument 2 of type 'uint8_t[32]' {aka 'unsigned char[32]'} with mismatched bound [-Warray-parameter=]
kem.c: 51 | uint8_t ss[KYBER_SSBYTES],
kem.c: | ~~~~~~~~^~~~~~~~~~~~~~~~~
kem.c: kem.h:36:42: note: previously declared as 'uint8_t *' {aka 'unsigned char *'}
kem.c: 36 | int crypto_kem_enc(uint8_t *ct, uint8_t *ss, const uint8_t *pk);
kem.c: | ~~~~~~~~~^~
kem.c: ...
poly.c: poly.c:407:42: warning: argument 5 of type 'const uint8_t[32]' {aka 'const unsigned char[32]'} with mismatched bound [-Warray-parameter=]
poly.c: 407 | const uint8_t seed[32],
poly.c: | ~~~~~~~~~~~~~~^~~~~~~~
poly.c: In file included from poly.c:6:
poly.c: poly.h:37:43: note: previously declared as 'const uint8_t *' {aka 'const unsigned char *'}
poly.c: 37 | const uint8_t *seed,
poly.c: | ~~~~~~~~~~~~~~~^~~~
poly.c: poly.c:442:45: warning: argument 5 of type 'const uint8_t[32]' {aka 'const unsigned char[32]'} with mismatched bound [-Warray-parameter=]
poly.c: 442 | const uint8_t seed[32],
poly.c: | ~~~~~~~~~~~~~~^~~~~~~~
poly.c: poly.h:49:46: note: previously declared as 'const uint8_t *' {aka 'const unsigned char *'}
poly.c: 49 | const uint8_t *seed,
poly.c: | ~~~~~~~~~~~~~~~^~~~
Number of similar (implementation,compiler) pairs: 4, namely:
Implementation | Compiler |
avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
avx2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x405906
at 0x...: poly_compress (kem.c:153)
by 0x...: pack_ciphertext (kem.c:344)
by 0x...: indcpa_enc (kem.c:432)
by 0x...: crypto_kem_kyber512_compact_constbranchindex_enc (kem.c:477)
by 0x...: test (try.c:141)
by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
compact | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x408069
at 0x...: poly_compress (kem.c:153)
by 0x...: pack_ciphertext (kem.c:344)
by 0x...: indcpa_enc (kem.c:432)
by 0x...: crypto_kem_kyber512_compact_constbranchindex_enc (kem.c:477)
by 0x...: test (try.c:141)
by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
compact | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x405AF1
at 0x...: crypto_kem_kyber512_ref_constbranchindex_poly_tomsg (poly.c:199)
by 0x...: crypto_kem_kyber512_ref_constbranchindex_indcpa_dec (indcpa.c:330)
by 0x...: crypto_kem_kyber512_ref_constbranchindex_dec (kem.c:106)
by 0x...: test (try.c:160)
by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
TIMECOP error (can be valgrind bug)
error 132
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x405C8B
at 0x...: crypto_kem_kyber512_ref_constbranchindex_poly_tomsg (poly.c:199)
by 0x...: crypto_kem_kyber512_ref_constbranchindex_indcpa_dec (indcpa.c:330)
by 0x...: crypto_kem_kyber512_ref_constbranchindex_dec (kem.c:106)
by 0x...: test (try.c:160)
by 0x...: main (try-anything.c:345)
Illegal instruction
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
Passed TIMECOP
TIMECOP iterations: 1
Number of similar (implementation,compiler) pairs: 22, namely:
Implementation | Compiler |
avx2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
avx2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
avx2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
avx2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
avx2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
compact | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
compact | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
compact | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
compact | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_14.0.6) |
ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |
ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (12.2.0) |