Test results for amd64, h9ivy, crypto_kem/kyber90s512

[Page version: 20260503 03:45:06]

Measurements for amd64, h9ivy, crypto_kem Test results for amd64, h9ivy, crypto_kem Test results for crypto_kem/kyber90s512
Computer: h9ivy
Microarchitecture: amd64; Ivy Bridge+AES (306a9)
Architecture: amd64
CPU ID: GenuineIntel-000306a9-bfebfbff
SUPERCOP version: 20260330
Operation: crypto_kem
Primitive: kyber90s512
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
55732154235 0 082875 1248 2312refclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026050120260330
57642357083 0 085579 1248 2312refclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026050120260330
59520652976 0 081955 1248 2312refclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026050120260330
61085342680 0 069423 1208 2376refgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026050120260330
61972353661 0 082159 1208 2376refgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026050120260330
62522642081 0 067517 1240 2312refclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026050120260330
64452942342 0 068363 1248 2312refclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026050120260330
71295341956 0 068463 1208 2376refgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026050120260330
74046240116 0 065002 1200 2344refgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026050120260330

Compiler output


cbd.c: cbd.c:86:10: error: '__builtin_ia32_permdi256' needs target feature avx2
cbd.c:    86 |     f0 = _mm256_permute4x64_epi64(f0,0x94);
cbd.c:       |          ^
cbd.c: /usr/lib/llvm-19/lib/clang/19/include/avx2intrin.h:3394:13: note: expanded from macro '_mm256_permute4x64_epi64'
cbd.c:  3394 |   ((__m256i)__builtin_ia32_permdi256((__v4di)(__m256i)(V), (int)(M)))
cbd.c:       |             ^
cbd.c: cbd.c:87:10: error: always_inline function '_mm256_shuffle_epi8' requires target feature 'avx2', but would be inlined into function 'cbd3' that is compiled without support for 'avx2'
cbd.c:    87 |     f0 = _mm256_shuffle_epi8(f0,shufbidx);
cbd.c:       |          ^
cbd.c: cbd.c:89:10: error: always_inline function '_mm256_srli_epi32' requires target feature 'avx2', but would be inlined into function 'cbd3' that is compiled without support for 'avx2'
cbd.c:    89 |     f1 = _mm256_srli_epi32(f0,1);
cbd.c:       |          ^
cbd.c: cbd.c:90:10: error: always_inline function '_mm256_srli_epi32' requires target feature 'avx2', but would be inlined into function 'cbd3' that is compiled without support for 'avx2'
cbd.c:    90 |     f2 = _mm256_srli_epi32(f0,2);
cbd.c:       |          ^
cbd.c: cbd.c:91:10: error: always_inline function '_mm256_and_si256' requires target feature 'avx2', but would be inlined into function 'cbd3' that is compiled without support for 'avx2'
cbd.c:    91 |     f0 = _mm256_and_si256(mask249,f0);
cbd.c:       |          ^
cbd.c: cbd.c:92:10: error: always_inline function '_mm256_and_si256' requires target feature 'avx2', but would be inlined into function 'cbd3' that is compiled without support for 'avx2'
cbd.c:    92 |     f1 = _mm256_and_si256(mask249,f1);
cbd.c:       |          ^
cbd.c: cbd.c:93:10: error: always_inline function '_mm256_and_si256' requires target feature 'avx2', but would be inlined into function 'cbd3' that is compiled without support for 'avx2'
cbd.c:    93 |     f2 = _mm256_and_si256(mask249,f2);
cbd.c:       |          ^
cbd.c: cbd.c:94:10: error: always_inline function '_mm256_add_epi32' requires target feature 'avx2', but would be inlined into function 'cbd3' that is compiled without support for 'avx2'
cbd.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
avx2clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1))
avx2clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1))
avx2clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1))
avx2clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1))

Compiler output


aes256ctr.c: aes256ctr.c:90:3: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes256ctr.c:    90 |   BLOCK1(0x01);
aes256ctr.c:       |   ^
aes256ctr.c: aes256ctr.c:71:11: note: expanded from macro 'BLOCK1'
aes256ctr.c:    71 |   temp1 = _mm_aeskeygenassist_si128(temp2, IMM);                        \
aes256ctr.c:       |           ^
aes256ctr.c: /usr/lib/llvm-19/lib/clang/19/include/__wmmintrin_aes.h:136:13: note: expanded from macro '_mm_aeskeygenassist_si128'
aes256ctr.c:   136 |   ((__m128i)__builtin_ia32_aeskeygenassist128((__v2di)(__m128i)(C), (int)(R)))
aes256ctr.c:       |             ^
aes256ctr.c: aes256ctr.c:91:3: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes256ctr.c:    91 |   BLOCK2(0x01);
aes256ctr.c:       |   ^
aes256ctr.c: aes256ctr.c:81:11: note: expanded from macro 'BLOCK2'
aes256ctr.c:    81 |   temp1 = _mm_aeskeygenassist_si128(temp0, IMM);                        \
aes256ctr.c:       |           ^
aes256ctr.c: /usr/lib/llvm-19/lib/clang/19/include/__wmmintrin_aes.h:136:13: note: expanded from macro '_mm_aeskeygenassist_si128'
aes256ctr.c:   136 |   ((__m128i)__builtin_ia32_aeskeygenassist128((__v2di)(__m128i)(C), (int)(R)))
aes256ctr.c:       |             ^
aes256ctr.c: aes256ctr.c:93:3: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes256ctr.c:    93 |   BLOCK1(0x02);
aes256ctr.c:       |   ^
aes256ctr.c: aes256ctr.c:71:11: note: expanded from macro 'BLOCK1'
aes256ctr.c:    71 |   temp1 = _mm_aeskeygenassist_si128(temp2, IMM);                        \
aes256ctr.c:       |           ^
aes256ctr.c: /usr/lib/llvm-19/lib/clang/19/include/__wmmintrin_aes.h:136:13: note: expanded from macro '_mm_aeskeygenassist_si128'
aes256ctr.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
avx2clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1))

Compiler output


cbd.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/14/include/immintrin.h:53,
cbd.c:                  from cbd.c:2:
cbd.c: /usr/lib/gcc/x86_64-linux-gnu/14/include/avx2intrin.h: In function 'cbd3':
cbd.c: /usr/lib/gcc/x86_64-linux-gnu/14/include/avx2intrin.h:1084:1: error: inlining failed in call to 'always_inline' '_mm256_permute2x128_si256': target specific option mismatch
cbd.c:  1084 | _mm256_permute2x128_si256 (__m256i __X, __m256i __Y, const int __M)
cbd.c:       | ^~~~~~~~~~~~~~~~~~~~~~~~~
cbd.c: cbd.c:117:10: note: called from here
cbd.c:   117 |     f1 = _mm256_permute2x128_si256(f2,f3,0x31);
cbd.c:       |          ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
cbd.c: /usr/lib/gcc/x86_64-linux-gnu/14/include/avx2intrin.h:1084:1: error: inlining failed in call to 'always_inline' '_mm256_permute2x128_si256': target specific option mismatch
cbd.c:  1084 | _mm256_permute2x128_si256 (__m256i __X, __m256i __Y, const int __M)
cbd.c:       | ^~~~~~~~~~~~~~~~~~~~~~~~~
cbd.c: cbd.c:116:10: note: called from here
cbd.c:   116 |     f0 = _mm256_permute2x128_si256(f2,f3,0x20);
cbd.c:       |          ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
cbd.c: /usr/lib/gcc/x86_64-linux-gnu/14/include/avx2intrin.h:871:1: error: inlining failed in call to 'always_inline' '_mm256_unpackhi_epi32': target specific option mismatch
cbd.c:   871 | _mm256_unpackhi_epi32 (__m256i __A, __m256i __B)
cbd.c:       | ^~~~~~~~~~~~~~~~~~~~~
cbd.c: cbd.c:114:10: note: called from here
cbd.c:   114 |     f3 = _mm256_unpackhi_epi32(f0,f1);
cbd.c:       |          ^~~~~~~~~~~~~~~~~~~~~~~~~~~~
cbd.c: /usr/lib/gcc/x86_64-linux-gnu/14/include/avx2intrin.h:899:1: error: inlining failed in call to 'always_inline' '_mm256_unpacklo_epi32': target specific option mismatch
cbd.c:   899 | _mm256_unpacklo_epi32 (__m256i __A, __m256i __B)
cbd.c:       | ^~~~~~~~~~~~~~~~~~~~~
cbd.c: cbd.c:113:10: note: called from here
cbd.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
avx2gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0)
avx2gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0)
avx2gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0)
avx2gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0)

Passed TIMECOP


TIMECOP iterations: 1

Number of similar (implementation,compiler) pairs: 9, namely:
ImplementationCompiler
refclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1))
refclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1))
refclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1))
refclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1))
refclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1))
refgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0)
refgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0)
refgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0)
refgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0)