Implementation notes: amd64, firefly, crypto_kem/kyber512

Computer: firefly
Architecture: amd64
CPU ID: AuthenticAMD-00800f12-178bfbff
SUPERCOP version: 20201130
Operation: crypto_kem
Primitive: kyber512
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
88895164879 0 0186534 824 1600T:avx2gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
92000135977 0 0155567 800 1568T:avx2clang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
106352118137 0 0137534 824 1600T:avx2gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
107548127196 0 0145237 792 1568T:avx2clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
139403135977 0 0155567 800 1568T:avx2clang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
141335135988 0 0156471 800 1568T:avx2clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
164105119322 0 0139078 824 1600T:avx2gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
167325117883 0 0136154 816 1568T:avx2gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
31779116406 0 036247 808 1568T:refclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
34796711185 0 030494 824 1600T:refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
35111810878 0 029125 800 1568T:refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
40880210594 0 028722 816 1568T:refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
47616956097 0 077526 824 1600T:refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
47835416406 0 036247 808 1568T:refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
49680017446 0 038239 808 1568T:refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
51186511922 0 031558 824 1600T:refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
52072016796 0 037039 808 1568T:refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
cbd.c: cbd.c:20:26: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'crypto_kem_kyber512_avx2_timingleaks_cbd' that is compiled without support for 'sse4.2'
cbd.c: const __m256i mask55 = _mm256_set1_epi32(0x55555555);
cbd.c: ^
cbd.c: cbd.c:21:26: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'crypto_kem_kyber512_avx2_timingleaks_cbd' that is compiled without support for 'sse4.2'
cbd.c: const __m256i mask33 = _mm256_set1_epi32(0x33333333);
cbd.c: ^
cbd.c: cbd.c:22:26: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'crypto_kem_kyber512_avx2_timingleaks_cbd' that is compiled without support for 'sse4.2'
cbd.c: const __m256i mask03 = _mm256_set1_epi32(0x03030303);
cbd.c: ^
cbd.c: cbd.c:25:12: error: always_inline function '_mm256_loadu_si256' requires target feature 'sse4.2', but would be inlined into function 'crypto_kem_kyber512_avx2_timingleaks_cbd' that is compiled without support for 'sse4.2'
cbd.c: vec0 = _mm256_loadu_si256((__m256i *)&buf[32*i]);
cbd.c: ^
cbd.c: cbd.c:27:12: error: always_inline function '_mm256_srli_epi32' requires target feature 'sse4.2', but would be inlined into function 'crypto_kem_kyber512_avx2_timingleaks_cbd' that is compiled without support for 'sse4.2'
cbd.c: vec1 = _mm256_srli_epi32(vec0, 1);
cbd.c: ^
cbd.c: cbd.c:28:12: error: always_inline function '_mm256_and_si256' requires target feature 'sse4.2', but would be inlined into function 'crypto_kem_kyber512_avx2_timingleaks_cbd' that is compiled without support for 'sse4.2'
cbd.c: vec0 = _mm256_and_si256(mask55, vec0);
cbd.c: ^
cbd.c: cbd.c:29:12: error: always_inline function '_mm256_and_si256' requires target feature 'sse4.2', but would be inlined into function 'crypto_kem_kyber512_avx2_timingleaks_cbd' that is compiled without support for 'sse4.2'
cbd.c: vec1 = _mm256_and_si256(mask55, vec1);
cbd.c: ^
cbd.c: cbd.c:30:12: error: always_inline function '_mm256_add_epi32' requires target feature 'sse4.2', but would be inlined into function 'crypto_kem_kyber512_avx2_timingleaks_cbd' that is compiled without support for 'sse4.2'
cbd.c: vec0 = _mm256_add_epi32(vec0, vec1);
cbd.c: ^
cbd.c: cbd.c:32:12: error: always_inline function '_mm256_srli_epi32' requires target feature 'sse4.2', but would be inlined into function 'crypto_kem_kyber512_avx2_timingleaks_cbd' that is compiled without support for 'sse4.2'
cbd.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2