Implementation notes: amd64, saber214, crypto_kem/bikel1

Computer: saber214
Microarchitecture: amd64; Bulldozer (600f20)
Architecture: amd64
CPU ID: AuthenticAMD-00600f20-1789c3f5
SUPERCOP version: 20240625
Operation: crypto_kem
Primitive: bikel1
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
1143759379902 56 497942 848 1604T:aes-ni-and-pclmulgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
1591486417828 56 433644 888 1572T:aes-ni-and-pclmulclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
1626313334978 56 451478 848 1604T:aes-ni-and-pclmulgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
1643968833881 56 450030 848 1604T:aes-ni-and-pclmulgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
1658436322797 56 437718 840 1572T:aes-ni-and-pclmulgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
2253728030734 48 449614 936 1572T:portableclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
2257073925737 56 442828 888 1572T:aes-ni-and-pclmulclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
2272087175894 56 493942 848 1604T:aes-ni-onlygcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
2280125071934 48 490631 912 1604T:portablegcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
2627569744524 56 463988 888 1572T:aes-ni-and-pclmulclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
3307739414795 48 431182 936 1572T:portableclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
3339100232902 48 450087 912 1604T:portablegcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
3346710217126 56 432900 888 1572T:aes-ni-onlyclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
3409722220129 48 435671 904 1572T:portablegcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
3462657635938 56 452438 848 1604T:aes-ni-onlygcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
3468987522530 56 437366 840 1572T:aes-ni-onlygcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
3480954622533 48 440198 936 1572T:portableclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
3495885225587 56 442684 888 1572T:aes-ni-onlyclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
3541993332495 48 449303 912 1604T:portablegcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
3545748034822 56 450926 848 1604T:aes-ni-onlygcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
3807340441336 48 461422 936 1572T:portableclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
3845223644374 56 463844 888 1572T:aes-ni-onlyclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
12464670016229 56 431366 880 1572T:aes-ni-and-pclmulclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
14162843013546 48 429216 928 1572T:portableclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
14207870815702 56 430798 880 1572T:aes-ni-onlyclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625

Compiler output


aes.c: aes.c:9:4: error: "This code requries support for AES_NI and SSSE3"
aes.c: #  error "This code requries support for AES_NI and SSSE3"
aes.c:    ^
aes.c: 1 error generated.

Number of similar (implementation,compiler) pairs: 6, namely:
ImplementationCompiler
T:aes-ni-and-pclmulclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:aes-ni-onlyclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:avx2clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:avx512clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:avx512-vpclmulclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:ches2021clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


gf2x_ksqr_avx2.c: gf2x_ksqr_avx2.c:58:9: error: always_inline function '_mm256_sub_epi16' requires target feature 'avx2', but would be inlined into function 'generate_map' that is compiled without support for 'avx2'
gf2x_ksqr_avx2.c:   inc = SUB_I16(inc, vr);
gf2x_ksqr_avx2.c:         ^
gf2x_ksqr_avx2.c: ./x86_64_intrinsic.h:64:28: note: expanded from macro 'SUB_I16'
gf2x_ksqr_avx2.c: #  define SUB_I16(a, b)    _mm256_sub_epi16(a, b)
gf2x_ksqr_avx2.c:                            ^
gf2x_ksqr_avx2.c: gf2x_ksqr_avx2.c:67:17: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'generate_map' that is compiled without support for 'avx2'
gf2x_ksqr_avx2.c:       vmap[j] = ADD_I16(vmap[j], inc);
gf2x_ksqr_avx2.c:                 ^
gf2x_ksqr_avx2.c: ./x86_64_intrinsic.h:63:28: note: expanded from macro 'ADD_I16'
gf2x_ksqr_avx2.c: #  define ADD_I16(a, b)    _mm256_add_epi16(a, b)
gf2x_ksqr_avx2.c:                            ^
gf2x_ksqr_avx2.c: gf2x_ksqr_avx2.c:68:17: error: always_inline function '_mm256_cmpgt_epi16' requires target feature 'avx2', but would be inlined into function 'generate_map' that is compiled without support for 'avx2'
gf2x_ksqr_avx2.c:       vtmp[j] = CMPGT_I16(zero, vmap[j]);
gf2x_ksqr_avx2.c:                 ^
gf2x_ksqr_avx2.c: ./x86_64_intrinsic.h:70:27: note: expanded from macro 'CMPGT_I16'
gf2x_ksqr_avx2.c: #  define CMPGT_I16(a, b) _mm256_cmpgt_epi16(a, b)
gf2x_ksqr_avx2.c:                           ^
gf2x_ksqr_avx2.c: gf2x_ksqr_avx2.c:69:17: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'generate_map' that is compiled without support for 'avx2'
gf2x_ksqr_avx2.c:       vmap[j] = ADD_I16(vmap[j], vtmp[j] & vr);
gf2x_ksqr_avx2.c:                 ^
gf2x_ksqr_avx2.c: ./x86_64_intrinsic.h:63:28: note: expanded from macro 'ADD_I16'
gf2x_ksqr_avx2.c: #  define ADD_I16(a, b)    _mm256_add_epi16(a, b)
gf2x_ksqr_avx2.c:                            ^
gf2x_ksqr_avx2.c: 4 errors generated.

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:avx2clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:avx2clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:avx2clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:avx2clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


gf2x_ksqr_avx2.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
gf2x_ksqr_avx2.c:                  from x86_64_intrinsic.h:20,
gf2x_ksqr_avx2.c:                  from defs.h:103,
gf2x_ksqr_avx2.c:                  from bike_defs.h:10,
gf2x_ksqr_avx2.c:                  from types.h:13,
gf2x_ksqr_avx2.c:                  from utilities.h:13,
gf2x_ksqr_avx2.c:                  from cleanup.h:10,
gf2x_ksqr_avx2.c:                  from gf2x_ksqr_avx2.c:13:
gf2x_ksqr_avx2.c: gf2x_ksqr_avx2.c: In function 'bytes_to_bin':
gf2x_ksqr_avx2.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:433:1: error: inlining failed in call to 'always_inline' '_mm256_movemask_epi8': target specific option mismatch
gf2x_ksqr_avx2.c:   433 | _mm256_movemask_epi8 (__m256i __A)
gf2x_ksqr_avx2.c:       | ^~~~~~~~~~~~~~~~~~~~
gf2x_ksqr_avx2.c: In file included from defs.h:103,
gf2x_ksqr_avx2.c:                  from bike_defs.h:10,
gf2x_ksqr_avx2.c:                  from types.h:13,
gf2x_ksqr_avx2.c:                  from utilities.h:13,
gf2x_ksqr_avx2.c:                  from cleanup.h:10,
gf2x_ksqr_avx2.c:                  from gf2x_ksqr_avx2.c:13:
gf2x_ksqr_avx2.c: x86_64_intrinsic.h:79:23: note: called from here
gf2x_ksqr_avx2.c:    79 | #  define MOVEMASK(a) _mm256_movemask_epi8(a)
gf2x_ksqr_avx2.c:       |                       ^~~~~~~~~~~~~~~~~~~~~~~
gf2x_ksqr_avx2.c: gf2x_ksqr_avx2.c:85:17: note: in expansion of macro 'MOVEMASK'
gf2x_ksqr_avx2.c:    85 |     bin32[i]  = MOVEMASK(t);
gf2x_ksqr_avx2.c:       |                 ^~~~~~~~
gf2x_ksqr_avx2.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
gf2x_ksqr_avx2.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:avx2gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:avx2gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:avx2gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:avx2gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

Compiler output


decode.c: In file included from decode.c:39:
decode.c: ./gf2x.h:22:10: error: always_inline function '_mm512_loadu_si512' requires target feature 'avx512f', but would be inlined into function 'gf2x_mod_add' that is compiled without support for 'avx512f'
decode.c:     va = LOAD(&a_qwords[i]);
decode.c:          ^
decode.c: ./x86_64_intrinsic.h:40:27: note: expanded from macro 'LOAD'
decode.c: #  define LOAD(mem)       _mm512_loadu_si512((mem))
decode.c:                           ^
decode.c: In file included from decode.c:39:
decode.c: ./gf2x.h:22:10: error: AVX vector return of type '__m512i' (vector of 8 'long long' values) without 'avx512f' enabled changes the ABI
decode.c: ./x86_64_intrinsic.h:40:27: note: expanded from macro 'LOAD'
decode.c: #  define LOAD(mem)       _mm512_loadu_si512((mem))
decode.c:                           ^
decode.c: In file included from decode.c:39:
decode.c: ./gf2x.h:23:10: error: always_inline function '_mm512_loadu_si512' requires target feature 'avx512f', but would be inlined into function 'gf2x_mod_add' that is compiled without support for 'avx512f'
decode.c:     vb = LOAD(&b_qwords[i]);
decode.c:          ^
decode.c: ./x86_64_intrinsic.h:40:27: note: expanded from macro 'LOAD'
decode.c: #  define LOAD(mem)       _mm512_loadu_si512((mem))
decode.c:                           ^
decode.c: In file included from decode.c:39:
decode.c: ./gf2x.h:23:10: error: AVX vector return of type '__m512i' (vector of 8 'long long' values) without 'avx512f' enabled changes the ABI
decode.c: ./x86_64_intrinsic.h:40:27: note: expanded from macro 'LOAD'
decode.c: #  define LOAD(mem)       _mm512_loadu_si512((mem))
decode.c:                           ^
decode.c: In file included from decode.c:39:
decode.c: ...

Number of similar (implementation,compiler) pairs: 8, namely:
ImplementationCompiler
T:avx512clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:avx512clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:avx512clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:avx512clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:avx512-vpclmulclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:avx512-vpclmulclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:avx512-vpclmulclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:avx512-vpclmulclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


decode.c: In file included from decode.c:39:
decode.c: gf2x.h: In function 'gf2x_mod_add':
decode.c: gf2x.h:22:8: warning: AVX512F vector return without AVX512F enabled changes the ABI [-Wpsabi]
decode.c:    22 |     va = LOAD(&a_qwords[i]);
decode.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:49,
decode.c:                  from x86_64_intrinsic.h:20,
decode.c:                  from defs.h:103,
decode.c:                  from bike_defs.h:10,
decode.c:                  from types.h:13,
decode.c:                  from decode.h:10,
decode.c:                  from decode.c:37:
decode.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx512fintrin.h:6481:1: error: inlining failed in call to 'always_inline' '_mm512_storeu_si512': target specific option mismatch
decode.c:  6481 | _mm512_storeu_si512 (void *__P, __m512i __A)
decode.c:       | ^~~~~~~~~~~~~~~~~~~
decode.c: In file included from defs.h:103,
decode.c:                  from bike_defs.h:10,
decode.c:                  from types.h:13,
decode.c:                  from decode.h:10,
decode.c:                  from decode.c:37:
decode.c: x86_64_intrinsic.h:41:27: note: called from here
decode.c:    41 | #  define STORE(mem, reg) _mm512_storeu_si512((mem), (reg))
decode.c:       |                           ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
decode.c: gf2x.h:25:5: note: in expansion of macro 'STORE'
decode.c:    25 |     STORE(&c_qwords[i], va ^ vb);
decode.c:       |     ^~~~~
decode.c: ...

Number of similar (implementation,compiler) pairs: 8, namely:
ImplementationCompiler
T:avx512gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:avx512gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:avx512gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:avx512gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:avx512-vpclmulgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:avx512-vpclmulgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:avx512-vpclmulgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:avx512-vpclmulgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

Compiler output


decode.c: decode.c:211:15: warning: unused function 'adder_size_53' [-Wunused-function]
decode.c: _INLINE_ void adder_size_53(OUT upc_t *upc,
decode.c:               ^
decode.c: decode.c:246:15: warning: unused function 'bit_sliced_adder_test' [-Wunused-function]
decode.c: _INLINE_ void bit_sliced_adder_test(OUT upc_t *upc,
decode.c:               ^
decode.c: 2 warnings generated.
gf2x_ksqr_avx2.c: gf2x_ksqr_avx2.c:58:9: error: always_inline function '_mm256_sub_epi16' requires target feature 'avx2', but would be inlined into function 'generate_map' that is compiled without support for 'avx2'
gf2x_ksqr_avx2.c:   inc = SUB_I16(inc, vr);
gf2x_ksqr_avx2.c:         ^
gf2x_ksqr_avx2.c: ./x86_64_intrinsic.h:64:28: note: expanded from macro 'SUB_I16'
gf2x_ksqr_avx2.c: #  define SUB_I16(a, b)    _mm256_sub_epi16(a, b)
gf2x_ksqr_avx2.c:                            ^
gf2x_ksqr_avx2.c: gf2x_ksqr_avx2.c:67:17: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'generate_map' that is compiled without support for 'avx2'
gf2x_ksqr_avx2.c:       vmap[j] = ADD_I16(vmap[j], inc);
gf2x_ksqr_avx2.c:                 ^
gf2x_ksqr_avx2.c: ./x86_64_intrinsic.h:63:28: note: expanded from macro 'ADD_I16'
gf2x_ksqr_avx2.c: #  define ADD_I16(a, b)    _mm256_add_epi16(a, b)
gf2x_ksqr_avx2.c:                            ^
gf2x_ksqr_avx2.c: gf2x_ksqr_avx2.c:68:17: error: always_inline function '_mm256_cmpgt_epi16' requires target feature 'avx2', but would be inlined into function 'generate_map' that is compiled without support for 'avx2'
gf2x_ksqr_avx2.c:       vtmp[j] = CMPGT_I16(zero, vmap[j]);
gf2x_ksqr_avx2.c:                 ^
gf2x_ksqr_avx2.c: ./x86_64_intrinsic.h:70:27: note: expanded from macro 'CMPGT_I16'
gf2x_ksqr_avx2.c: #  define CMPGT_I16(a, b) _mm256_cmpgt_epi16(a, b)
gf2x_ksqr_avx2.c:                           ^
gf2x_ksqr_avx2.c: gf2x_ksqr_avx2.c:69:17: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'generate_map' that is compiled without support for 'avx2'
gf2x_ksqr_avx2.c:       vmap[j] = ADD_I16(vmap[j], vtmp[j] & vr);
gf2x_ksqr_avx2.c:                 ^
gf2x_ksqr_avx2.c: ./x86_64_intrinsic.h:63:28: note: expanded from macro 'ADD_I16'
gf2x_ksqr_avx2.c: #  define ADD_I16(a, b)    _mm256_add_epi16(a, b)
gf2x_ksqr_avx2.c:                            ^
gf2x_ksqr_avx2.c: 4 errors generated.

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:ches2021clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:ches2021clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:ches2021clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:ches2021clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


gf2x_ksqr_avx2.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
gf2x_ksqr_avx2.c:                  from x86_64_intrinsic.h:20,
gf2x_ksqr_avx2.c:                  from defs.h:106,
gf2x_ksqr_avx2.c:                  from bike_defs.h:10,
gf2x_ksqr_avx2.c:                  from types.h:15,
gf2x_ksqr_avx2.c:                  from utilities.h:20,
gf2x_ksqr_avx2.c:                  from cleanup.h:10,
gf2x_ksqr_avx2.c:                  from gf2x_ksqr_avx2.c:13:
gf2x_ksqr_avx2.c: gf2x_ksqr_avx2.c: In function 'bytes_to_bin':
gf2x_ksqr_avx2.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:433:1: error: inlining failed in call to 'always_inline' '_mm256_movemask_epi8': target specific option mismatch
gf2x_ksqr_avx2.c:   433 | _mm256_movemask_epi8 (__m256i __A)
gf2x_ksqr_avx2.c:       | ^~~~~~~~~~~~~~~~~~~~
gf2x_ksqr_avx2.c: In file included from defs.h:106,
gf2x_ksqr_avx2.c:                  from bike_defs.h:10,
gf2x_ksqr_avx2.c:                  from types.h:15,
gf2x_ksqr_avx2.c:                  from utilities.h:20,
gf2x_ksqr_avx2.c:                  from cleanup.h:10,
gf2x_ksqr_avx2.c:                  from gf2x_ksqr_avx2.c:13:
gf2x_ksqr_avx2.c: x86_64_intrinsic.h:80:23: note: called from here
gf2x_ksqr_avx2.c:    80 | #  define MOVEMASK(a) _mm256_movemask_epi8(a)
gf2x_ksqr_avx2.c:       |                       ^~~~~~~~~~~~~~~~~~~~~~~
gf2x_ksqr_avx2.c: gf2x_ksqr_avx2.c:85:17: note: in expansion of macro 'MOVEMASK'
gf2x_ksqr_avx2.c:    85 |     bin32[i]  = MOVEMASK(t);
gf2x_ksqr_avx2.c:       |                 ^~~~~~~~
gf2x_ksqr_avx2.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
gf2x_ksqr_avx2.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:ches2021gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:ches2021gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:ches2021gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:ches2021gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)