Implementation notes: amd64, hydra7, crypto_kem/bikel1

Computer: hydra7
Microarchitecture: amd64; Sandy Bridge+AES (206a7)
Architecture: amd64
CPU ID: GenuineIntel-000206a7-bfebfbff
SUPERCOP version: 20240425
Operation: crypto_kem
Primitive: bikel1
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
1049343178772 56 497261 876 1764T:aes-ni-and-pclmulgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042720240425
1313391634184 56 451389 876 1764T:aes-ni-and-pclmulgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042720240425
1328522833212 56 450021 876 1764T:aes-ni-and-pclmulgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042720240425
1357923322736 56 438509 868 1732T:aes-ni-and-pclmulgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042720240425
1861913572859 48 492038 940 1764T:portablegcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042720240425
1880532876740 56 495277 876 1764T:aes-ni-onlygcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042720240425
3175162632121 48 449990 940 1764T:portablegcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042720240425
3199622534984 56 452221 876 1764T:aes-ni-onlygcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042720240425
3217747922466 56 438157 868 1732T:aes-ni-onlygcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042720240425
3227956020113 48 436526 932 1732T:portablegcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042720240425
3268300431794 48 449278 940 1764T:portablegcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042720240425
3336156434141 56 450917 876 1764T:aes-ni-onlygcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042720240425

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
gf2x_ksqr_avx2.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
gf2x_ksqr_avx2.c: from x86_64_intrinsic.h:20,
gf2x_ksqr_avx2.c: from defs.h:103,
gf2x_ksqr_avx2.c: from bike_defs.h:10,
gf2x_ksqr_avx2.c: from types.h:13,
gf2x_ksqr_avx2.c: from utilities.h:13,
gf2x_ksqr_avx2.c: from cleanup.h:10,
gf2x_ksqr_avx2.c: from gf2x_ksqr_avx2.c:13:
gf2x_ksqr_avx2.c: gf2x_ksqr_avx2.c: In function 'bytes_to_bin':
gf2x_ksqr_avx2.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:433:1: error: inlining failed in call to 'always_inline' '_mm256_movemask_epi8': target specific option mismatch
gf2x_ksqr_avx2.c: 433 | _mm256_movemask_epi8 (__m256i __A)
gf2x_ksqr_avx2.c: | ^~~~~~~~~~~~~~~~~~~~
gf2x_ksqr_avx2.c: In file included from defs.h:103,
gf2x_ksqr_avx2.c: from bike_defs.h:10,
gf2x_ksqr_avx2.c: from types.h:13,
gf2x_ksqr_avx2.c: from utilities.h:13,
gf2x_ksqr_avx2.c: from cleanup.h:10,
gf2x_ksqr_avx2.c: from gf2x_ksqr_avx2.c:13:
gf2x_ksqr_avx2.c: x86_64_intrinsic.h:79:23: note: called from here
gf2x_ksqr_avx2.c: 79 | # define MOVEMASK(a) _mm256_movemask_epi8(a)
gf2x_ksqr_avx2.c: | ^~~~~~~~~~~~~~~~~~~~~~~
gf2x_ksqr_avx2.c: gf2x_ksqr_avx2.c:85:17: note: in expansion of macro 'MOVEMASK'
gf2x_ksqr_avx2.c: 85 | bin32[i] = MOVEMASK(t);
gf2x_ksqr_avx2.c: | ^~~~~~~~
gf2x_ksqr_avx2.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
gf2x_ksqr_avx2.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2

Compiler output

Implementation: T:avx512
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
decode.c: In file included from decode.c:39:
decode.c: gf2x.h: In function 'gf2x_mod_add':
decode.c: gf2x.h:22:8: warning: AVX512F vector return without AVX512F enabled changes the ABI [-Wpsabi]
decode.c: 22 | va = LOAD(&a_qwords[i]);
decode.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:49,
decode.c: from x86_64_intrinsic.h:20,
decode.c: from defs.h:103,
decode.c: from bike_defs.h:10,
decode.c: from types.h:13,
decode.c: from decode.h:10,
decode.c: from decode.c:37:
decode.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx512fintrin.h:6445:1: error: inlining failed in call to 'always_inline' '_mm512_storeu_si512': target specific option mismatch
decode.c: 6445 | _mm512_storeu_si512 (void *__P, __m512i __A)
decode.c: | ^~~~~~~~~~~~~~~~~~~
decode.c: In file included from defs.h:103,
decode.c: from bike_defs.h:10,
decode.c: from types.h:13,
decode.c: from decode.h:10,
decode.c: from decode.c:37:
decode.c: x86_64_intrinsic.h:41:27: note: called from here
decode.c: 41 | # define STORE(mem, reg) _mm512_storeu_si512((mem), (reg))
decode.c: | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
decode.c: gf2x.h:25:5: note: in expansion of macro 'STORE'
decode.c: 25 | STORE(&c_qwords[i], va ^ vb);
decode.c: | ^~~~~
decode.c: ...

Number of similar (compiler,implementation) pairs: 8, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx512
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx512
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx512
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx512
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx512-vpclmul
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx512-vpclmul
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx512-vpclmul
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx512-vpclmul

Compiler output

Implementation: T:ches2021
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
gf2x_ksqr_avx2.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
gf2x_ksqr_avx2.c: from x86_64_intrinsic.h:20,
gf2x_ksqr_avx2.c: from defs.h:106,
gf2x_ksqr_avx2.c: from bike_defs.h:10,
gf2x_ksqr_avx2.c: from types.h:15,
gf2x_ksqr_avx2.c: from utilities.h:20,
gf2x_ksqr_avx2.c: from cleanup.h:10,
gf2x_ksqr_avx2.c: from gf2x_ksqr_avx2.c:13:
gf2x_ksqr_avx2.c: gf2x_ksqr_avx2.c: In function 'bytes_to_bin':
gf2x_ksqr_avx2.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:433:1: error: inlining failed in call to 'always_inline' '_mm256_movemask_epi8': target specific option mismatch
gf2x_ksqr_avx2.c: 433 | _mm256_movemask_epi8 (__m256i __A)
gf2x_ksqr_avx2.c: | ^~~~~~~~~~~~~~~~~~~~
gf2x_ksqr_avx2.c: In file included from defs.h:106,
gf2x_ksqr_avx2.c: from bike_defs.h:10,
gf2x_ksqr_avx2.c: from types.h:15,
gf2x_ksqr_avx2.c: from utilities.h:20,
gf2x_ksqr_avx2.c: from cleanup.h:10,
gf2x_ksqr_avx2.c: from gf2x_ksqr_avx2.c:13:
gf2x_ksqr_avx2.c: x86_64_intrinsic.h:80:23: note: called from here
gf2x_ksqr_avx2.c: 80 | # define MOVEMASK(a) _mm256_movemask_epi8(a)
gf2x_ksqr_avx2.c: | ^~~~~~~~~~~~~~~~~~~~~~~
gf2x_ksqr_avx2.c: gf2x_ksqr_avx2.c:85:17: note: in expansion of macro 'MOVEMASK'
gf2x_ksqr_avx2.c: 85 | bin32[i] = MOVEMASK(t);
gf2x_ksqr_avx2.c: | ^~~~~~~~
gf2x_ksqr_avx2.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
gf2x_ksqr_avx2.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:ches2021
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:ches2021
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:ches2021
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:ches2021