Implementation notes: amd64, firefly, crypto_kem/kyber1024

Computer: firefly
Architecture: amd64
CPU ID: AuthenticAMD-00800f12-178bfbff
SUPERCOP version: 20201130
Operation: crypto_kem
Primitive: kyber1024
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
239729137380 0 0157983 800 1568T:avx2clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
267168120760 0 0140510 824 1600T:avx2gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
350175137385 0 0157079 800 1568T:avx2clang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
351371159333 0 0180998 824 1600T:avx2gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
365930137385 0 0157079 800 1568T:avx2clang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
402316119302 0 0137506 816 1568T:avx2gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
415081119494 0 0138878 824 1600T:avx2gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
428076128383 0 0146453 792 1568T:avx2clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
68949450059 0 071494 824 1600T:refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
73970316954 0 036840 808 1568T:refclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
75711417272 0 037440 808 1568T:refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
78234511147 0 029382 800 1568T:refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
79642111731 0 031014 824 1600T:refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
88816811207 0 029330 816 1568T:refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
110303416954 0 036840 808 1568T:refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
112688517418 0 038192 808 1568T:refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
121382512556 0 032174 824 1600T:refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
cbd.c: cbd.c:20:26: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'crypto_kem_kyber1024_avx2_timingleaks_cbd' that is compiled without support for 'sse4.2'
cbd.c: const __m256i mask55 = _mm256_set1_epi32(0x55555555);
cbd.c: ^
cbd.c: cbd.c:21:26: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'crypto_kem_kyber1024_avx2_timingleaks_cbd' that is compiled without support for 'sse4.2'
cbd.c: const __m256i mask33 = _mm256_set1_epi32(0x33333333);
cbd.c: ^
cbd.c: cbd.c:22:26: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'crypto_kem_kyber1024_avx2_timingleaks_cbd' that is compiled without support for 'sse4.2'
cbd.c: const __m256i mask03 = _mm256_set1_epi32(0x03030303);
cbd.c: ^
cbd.c: cbd.c:25:12: error: always_inline function '_mm256_loadu_si256' requires target feature 'sse4.2', but would be inlined into function 'crypto_kem_kyber1024_avx2_timingleaks_cbd' that is compiled without support for 'sse4.2'
cbd.c: vec0 = _mm256_loadu_si256((__m256i *)&buf[32*i]);
cbd.c: ^
cbd.c: cbd.c:27:12: error: always_inline function '_mm256_srli_epi32' requires target feature 'sse4.2', but would be inlined into function 'crypto_kem_kyber1024_avx2_timingleaks_cbd' that is compiled without support for 'sse4.2'
cbd.c: vec1 = _mm256_srli_epi32(vec0, 1);
cbd.c: ^
cbd.c: cbd.c:28:12: error: always_inline function '_mm256_and_si256' requires target feature 'sse4.2', but would be inlined into function 'crypto_kem_kyber1024_avx2_timingleaks_cbd' that is compiled without support for 'sse4.2'
cbd.c: vec0 = _mm256_and_si256(mask55, vec0);
cbd.c: ^
cbd.c: cbd.c:29:12: error: always_inline function '_mm256_and_si256' requires target feature 'sse4.2', but would be inlined into function 'crypto_kem_kyber1024_avx2_timingleaks_cbd' that is compiled without support for 'sse4.2'
cbd.c: vec1 = _mm256_and_si256(mask55, vec1);
cbd.c: ^
cbd.c: cbd.c:30:12: error: always_inline function '_mm256_add_epi32' requires target feature 'sse4.2', but would be inlined into function 'crypto_kem_kyber1024_avx2_timingleaks_cbd' that is compiled without support for 'sse4.2'
cbd.c: vec0 = _mm256_add_epi32(vec0, vec1);
cbd.c: ^
cbd.c: cbd.c:32:12: error: always_inline function '_mm256_srli_epi32' requires target feature 'sse4.2', but would be inlined into function 'crypto_kem_kyber1024_avx2_timingleaks_cbd' that is compiled without support for 'sse4.2'
cbd.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2