Implementation notes: amd64, margaux, crypto_kem/mceliece348864f

Computer: margaux
Architecture: amd64
CPU ID: GenuineIntel-000006fb-bfebfbff
SUPERCOP version: 20201130
Operation: crypto_kem
Primitive: mceliece348864f
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
132846372550 0 0165928 864 1568T:vecclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020110420201018
133138872274 0 0164704 864 1568T:vecclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020110420201018
133237872274 0 0164704 864 1568T:vecclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020110420201018
141153342406 0 0133726 856 1568T:vecclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020110420201018
145093549541 0 0142443 824 1600T:vecgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020110420201018
147537040664 0 0132139 816 1568T:vecgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020110420201018
149247972303 0 0165704 864 1568T:vecclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020110420201018
169143349722 0 0142251 824 1600T:vecgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020110420201018
8936442027492 0 0119976 864 1568T:refclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020110420201018
8936525727492 0 0119976 864 1568T:refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020110420201018
8937699327978 0 0121456 864 1568T:refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020110420201018
8969478327476 0 0120880 864 1568T:refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020110420201018
10557956771012 0 0165845 832 1600T:refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020110420201018
13565049312900 0 0104286 856 1568T:refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020110420201018
14329246515005 0 0107507 824 1600T:refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020110420201018
14635501211809 0 0103235 816 1568T:refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020110420201018
15827874315068 0 0107971 824 1600T:refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020110420201018

Test failure

Implementation: T:vec
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE
error 111

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:vec

Compiler output

Implementation: T:avx
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:10:
decrypt.c: ./vec256.h:37:10: error: always_inline function '_mm256_set_epi64x' requires target feature 'xsave', but would be inlined into function 'vec256_set4x' that is compiled without support for 'xsave'
decrypt.c: return _mm256_set_epi64x(a3, a2, a1, a0);
decrypt.c: ^
decrypt.c: ./vec256.h:50:9: error: always_inline function '_mm256_and_si256' requires target feature 'xsave', but would be inlined into function 'vec256_and' that is compiled without support for 'xsave'
decrypt.c: return _mm256_and_si256(a, b);
decrypt.c: ^
decrypt.c: ./vec256.h:22:9: error: always_inline function '_mm256_set1_epi16' requires target feature 'xsave', but would be inlined into function 'vec256_set1_16b' that is compiled without support for 'xsave'
decrypt.c: return _mm256_set1_epi16(a);
decrypt.c: ^
decrypt.c: ./vec256.h:60:9: error: always_inline function '_mm256_or_si256' requires target feature 'xsave', but would be inlined into function 'vec256_or' that is compiled without support for 'xsave'
decrypt.c: return _mm256_or_si256(a, b);
decrypt.c: ^
decrypt.c: ./vec256.h:55:9: error: always_inline function '_mm256_xor_si256' requires target feature 'xsave', but would be inlined into function 'vec256_xor' that is compiled without support for 'xsave'
decrypt.c: return _mm256_xor_si256(a, b);
decrypt.c: ^
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:10:
decrypt.c: In file included from ./vec256.h:14:
decrypt.c: ./vec128.h:32:9: error: always_inline function '_mm_testz_si128' requires target feature 'sse4.1', but would be inlined into function 'vec128_testz' that is compiled without support for 'sse4.1'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: decrypt.c:157:10: error: always_inline function '_mm256_extract_epi64' requires target feature 'xsave', but would be inlined into function 'reformat_256to128' that is compiled without support for 'xsave'
decrypt.c: v[0] = vec256_extract(in[i], 0);
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx

Compiler output

Implementation: T:avx
Security model: timingleaks
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:10:
decrypt.c: ./vec256.h:37:10: error: always_inline function '_mm256_set_epi64x' requires target feature 'sse4.2', but would be inlined into function 'vec256_set4x' that is compiled without support for 'sse4.2'
decrypt.c: return _mm256_set_epi64x(a3, a2, a1, a0);
decrypt.c: ^
decrypt.c: ./vec256.h:50:9: error: always_inline function '_mm256_and_si256' requires target feature 'sse4.2', but would be inlined into function 'vec256_and' that is compiled without support for 'sse4.2'
decrypt.c: return _mm256_and_si256(a, b);
decrypt.c: ^
decrypt.c: ./vec256.h:22:9: error: always_inline function '_mm256_set1_epi16' requires target feature 'sse4.2', but would be inlined into function 'vec256_set1_16b' that is compiled without support for 'sse4.2'
decrypt.c: return _mm256_set1_epi16(a);
decrypt.c: ^
decrypt.c: ./vec256.h:60:9: error: always_inline function '_mm256_or_si256' requires target feature 'sse4.2', but would be inlined into function 'vec256_or' that is compiled without support for 'sse4.2'
decrypt.c: return _mm256_or_si256(a, b);
decrypt.c: ^
decrypt.c: ./vec256.h:55:9: error: always_inline function '_mm256_xor_si256' requires target feature 'sse4.2', but would be inlined into function 'vec256_xor' that is compiled without support for 'sse4.2'
decrypt.c: return _mm256_xor_si256(a, b);
decrypt.c: ^
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:10:
decrypt.c: In file included from ./vec256.h:14:
decrypt.c: ./vec128.h:32:9: error: always_inline function '_mm_testz_si128' requires target feature 'ssse3', but would be inlined into function 'vec128_testz' that is compiled without support for 'ssse3'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: decrypt.c:157:10: error: always_inline function '_mm256_extract_epi64' requires target feature 'sse4.2', but would be inlined into function 'reformat_256to128' that is compiled without support for 'sse4.2'
decrypt.c: v[0] = vec256_extract(in[i], 0);
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx

Compiler output

Implementation: T:avx
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
benes.c: In file included from transpose.h:11:0,
benes.c: from benes.c:6:
benes.c: vec256.h: In function 'vec256_set1_16b':
benes.c: vec256.h:21:1: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
benes.c: {
benes.c: ^
bm.c: In file included from vec128.h:14:0,
bm.c: from bm.h:10,
bm.c: from bm.c:10:
bm.c: bm.c: In function 'get_coefs':
bm.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:454:1: error: inlining failed in call to always_inline '_mm_extract_epi64': target specific option mismatch
bm.c: _mm_extract_epi64 (__m128i __X, const int __N)
bm.c: ^
bm.c: In file included from bm.h:10:0,
bm.c: from bm.c:10:
bm.c: vec128.h:28:42: error: called from here
bm.c: #define vec128_extract(a, i) ((uint64_t) _mm_extract_epi64((vec128) (a), (i)))
bm.c: ^
bm.c: bm.c:130:30: note: in expansion of macro 'vec128_extract'
bm.c: out[ (4*1 + k)*16 + i ] = (vec128_extract(buf[i], 1) >> (k*16)) & GFMASK;
bm.c: ^
bm.c: In file included from vec128.h:14:0,
bm.c: from bm.h:10,
bm.c: from bm.c:10:
bm.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:454:1: error: inlining failed in call to always_inline '_mm_extract_epi64': target specific option mismatch
bm.c: _mm_extract_epi64 (__m128i __X, const int __N)
bm.c: ^
bm.c: In file included from bm.h:10:0,
bm.c: from bm.c:10:
bm.c: vec128.h:28:42: error: called from here
bm.c: #define vec128_extract(a, i) ((uint64_t) _mm_extract_epi64((vec128) (a), (i)))
bm.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx

Compiler output

Implementation: T:sse
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:10:
decrypt.c: ./vec128.h:39:9: error: always_inline function '_mm_testz_si128' requires target feature 'sse4.1', but would be inlined into function 'vec128_testz' that is compiled without support for 'sse4.1'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: fatal error: error in backend: Cannot select: 0x236b5c0: i32 = X86ISD::PTEST 0x238ce80, 0x238ce80
decrypt.c: 0x238ce80: v2i64,ch = CopyFromReg 0x2280880, Register:v2i64 %vreg93
decrypt.c: 0x23608f0: v2i64 = Register %vreg93
decrypt.c: 0x238ce80: v2i64,ch = CopyFromReg 0x2280880, Register:v2i64 %vreg93
decrypt.c: 0x23608f0: v2i64 = Register %vreg93
decrypt.c: In function: crypto_kem_mceliece348864f_sse_timingleaks_decrypt
decrypt.c: clang: error: clang frontend command failed with exit code 70 (use -v to see invocation)
decrypt.c: clang version 3.8.0-2ubuntu4 (tags/RELEASE_380/final)
decrypt.c: Target: x86_64-pc-linux-gnu
decrypt.c: Thread model: posix
decrypt.c: InstalledDir: /usr/bin
decrypt.c: clang: note: diagnostic msg: PLEASE submit a bug report to http://llvm.org/bugs/ and include the crash backtrace, preprocessed source, and associated run script.
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ********************
decrypt.c:
decrypt.c: PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:
decrypt.c: Preprocessed source(s) and associated run script(s) are located at:
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-480ee9.c
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-480ee9.sh
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:sse

Compiler output

Implementation: T:sse
Security model: timingleaks
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:10:
decrypt.c: ./vec128.h:39:9: error: always_inline function '_mm_testz_si128' requires target feature 'sse4.1', but would be inlined into function 'vec128_testz' that is compiled without support for 'sse4.1'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: fatal error: error in backend: Cannot select: 0x37b3fa0: i32 = X86ISD::PTEST 0x379e940, 0x379e940
decrypt.c: 0x379e940: v2i64,ch = CopyFromReg 0x3690970, Register:v2i64 %vreg93
decrypt.c: 0x3789a30: v2i64 = Register %vreg93
decrypt.c: 0x379e940: v2i64,ch = CopyFromReg 0x3690970, Register:v2i64 %vreg93
decrypt.c: 0x3789a30: v2i64 = Register %vreg93
decrypt.c: In function: crypto_kem_mceliece348864f_sse_timingleaks_decrypt
decrypt.c: clang: error: clang frontend command failed with exit code 70 (use -v to see invocation)
decrypt.c: clang version 3.8.0-2ubuntu4 (tags/RELEASE_380/final)
decrypt.c: Target: x86_64-pc-linux-gnu
decrypt.c: Thread model: posix
decrypt.c: InstalledDir: /usr/bin
decrypt.c: clang: note: diagnostic msg: PLEASE submit a bug report to http://llvm.org/bugs/ and include the crash backtrace, preprocessed source, and associated run script.
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ********************
decrypt.c:
decrypt.c: PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:
decrypt.c: Preprocessed source(s) and associated run script(s) are located at:
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-371783.c
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-371783.sh
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:sse

Compiler output

Implementation: T:sse
Security model: timingleaks
Compiler: clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:10:
decrypt.c: ./vec128.h:39:9: error: always_inline function '_mm_testz_si128' requires target feature 'sse4.1', but would be inlined into function 'vec128_testz' that is compiled without support for 'sse4.1'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: fatal error: error in backend: Cannot select: 0x3562f50: i32 = X86ISD::PTEST 0x358c650, 0x358c650
decrypt.c: 0x358c650: v2i64,ch = CopyFromReg 0x347f860, Register:v2i64 %vreg93
decrypt.c: 0x3515ec0: v2i64 = Register %vreg93
decrypt.c: 0x358c650: v2i64,ch = CopyFromReg 0x347f860, Register:v2i64 %vreg93
decrypt.c: 0x3515ec0: v2i64 = Register %vreg93
decrypt.c: In function: crypto_kem_mceliece348864f_sse_timingleaks_decrypt
decrypt.c: clang: error: clang frontend command failed with exit code 70 (use -v to see invocation)
decrypt.c: clang version 3.8.0-2ubuntu4 (tags/RELEASE_380/final)
decrypt.c: Target: x86_64-pc-linux-gnu
decrypt.c: Thread model: posix
decrypt.c: InstalledDir: /usr/bin
decrypt.c: clang: note: diagnostic msg: PLEASE submit a bug report to http://llvm.org/bugs/ and include the crash backtrace, preprocessed source, and associated run script.
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ********************
decrypt.c:
decrypt.c: PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:
decrypt.c: Preprocessed source(s) and associated run script(s) are located at:
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-1d6cf2.c
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-1d6cf2.sh
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:sse

Compiler output

Implementation: T:sse
Security model: timingleaks
Compiler: clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:10:
decrypt.c: ./vec128.h:39:9: error: always_inline function '_mm_testz_si128' requires target feature 'sse4.1', but would be inlined into function 'vec128_testz' that is compiled without support for 'sse4.1'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: fatal error: error in backend: Cannot select: 0x350a6e0: i32 = X86ISD::PTEST 0x352e600, 0x352e600
decrypt.c: 0x352e600: v2i64,ch = CopyFromReg 0x343e390, Register:v2i64 %vreg86
decrypt.c: 0x350b230: v2i64 = Register %vreg86
decrypt.c: 0x352e600: v2i64,ch = CopyFromReg 0x343e390, Register:v2i64 %vreg86
decrypt.c: 0x350b230: v2i64 = Register %vreg86
decrypt.c: In function: crypto_kem_mceliece348864f_sse_timingleaks_decrypt
decrypt.c: clang: error: clang frontend command failed with exit code 70 (use -v to see invocation)
decrypt.c: clang version 3.8.0-2ubuntu4 (tags/RELEASE_380/final)
decrypt.c: Target: x86_64-pc-linux-gnu
decrypt.c: Thread model: posix
decrypt.c: InstalledDir: /usr/bin
decrypt.c: clang: note: diagnostic msg: PLEASE submit a bug report to http://llvm.org/bugs/ and include the crash backtrace, preprocessed source, and associated run script.
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ********************
decrypt.c:
decrypt.c: PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:
decrypt.c: Preprocessed source(s) and associated run script(s) are located at:
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-98aca0.c
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-98aca0.sh
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:sse

Compiler output

Implementation: T:sse
Security model: timingleaks
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:10:
decrypt.c: ./vec128.h:39:9: error: always_inline function '_mm_testz_si128' requires target feature 'ssse3', but would be inlined into function 'vec128_testz' that is compiled without support for 'ssse3'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: fatal error: error in backend: Cannot select: 0x33de7b0: i32 = X86ISD::PTEST 0x33d04e0, 0x33d04e0
decrypt.c: 0x33d04e0: v2i64,ch = CopyFromReg 0x32d36a0, Register:v2i64 %vreg93
decrypt.c: 0x336b260: v2i64 = Register %vreg93
decrypt.c: 0x33d04e0: v2i64,ch = CopyFromReg 0x32d36a0, Register:v2i64 %vreg93
decrypt.c: 0x336b260: v2i64 = Register %vreg93
decrypt.c: In function: crypto_kem_mceliece348864f_sse_timingleaks_decrypt
decrypt.c: clang: error: clang frontend command failed with exit code 70 (use -v to see invocation)
decrypt.c: clang version 3.8.0-2ubuntu4 (tags/RELEASE_380/final)
decrypt.c: Target: x86_64-pc-linux-gnu
decrypt.c: Thread model: posix
decrypt.c: InstalledDir: /usr/bin
decrypt.c: clang: note: diagnostic msg: PLEASE submit a bug report to http://llvm.org/bugs/ and include the crash backtrace, preprocessed source, and associated run script.
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ********************
decrypt.c:
decrypt.c: PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:
decrypt.c: Preprocessed source(s) and associated run script(s) are located at:
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-19b9e0.c
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-19b9e0.sh
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:sse

Compiler output

Implementation: T:sse
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
bm.c: In file included from vec128.h:16:0,
bm.c: from bm.h:13,
bm.c: from bm.c:10:
bm.c: bm.c: In function 'get_coefs':
bm.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:454:1: error: inlining failed in call to always_inline '_mm_extract_epi64': target specific option mismatch
bm.c: _mm_extract_epi64 (__m128i __X, const int __N)
bm.c: ^
bm.c: In file included from bm.h:13:0,
bm.c: from bm.c:10:
bm.c: vec128.h:35:42: error: called from here
bm.c: #define vec128_extract(a, i) ((uint64_t) _mm_extract_epi64((vec128) (a), (i)))
bm.c: ^
bm.c: bm.c:156:30: note: in expansion of macro 'vec128_extract'
bm.c: out[ (4*1 + k)*16 + i ] = (vec128_extract(buf[i], 1) >> (k*16)) & GFMASK;
bm.c: ^
bm.c: In file included from vec128.h:16:0,
bm.c: from bm.h:13,
bm.c: from bm.c:10:
bm.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:454:1: error: inlining failed in call to always_inline '_mm_extract_epi64': target specific option mismatch
bm.c: _mm_extract_epi64 (__m128i __X, const int __N)
bm.c: ^
bm.c: In file included from bm.h:13:0,
bm.c: from bm.c:10:
bm.c: vec128.h:35:42: error: called from here
bm.c: #define vec128_extract(a, i) ((uint64_t) _mm_extract_epi64((vec128) (a), (i)))
bm.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:sse
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:sse
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:sse
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:sse