Implementation notes: amd64, latour, crypto_kem/mceliece348864

Computer: latour
Architecture: amd64
CPU ID: GenuineIntel-000006fb-bfebfbff
SUPERCOP version: 20191221
Operation: crypto_kem
Primitive: mceliece348864
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
133120871620 0 0164120 864 1576vecclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
133191971671 0 0163248 864 1576vecclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
142314341723 0 0132302 856 1576vecclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
144090040047 0 0130691 816 1576vecgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010620191221
144246648646 0 0140499 824 1608vecgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010620191221
147097848840 0 0140483 824 1608vecgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010620191221
149848271671 0 0163248 864 1576vecclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
150502571883 0 0164376 864 1576vecclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
8943432326145 0 0118720 864 1576refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
8973047726257 0 0117880 864 1576refclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
8973750626257 0 0117880 864 1576refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
8974606526567 0 0119104 864 1576refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
10558096270138 0 0163973 832 1608refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010620191221
13695052511730 0 0102350 856 1576refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
15432870613698 0 0105291 824 1608refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010620191221
16014683713784 0 0105619 824 1608refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010620191221
18718938010859 0 0101411 816 1576refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010620191221

Test failure

Implementation: crypto_kem/mceliece348864/vec
Compiler: gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE
error 111

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE vec

Compiler output

Implementation: crypto_kem/mceliece348864/sse
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:9:
decrypt.c: ./vec128.h:30:9: error: always_inline function '_mm_testz_si128' requires target feature 'sse4.1', but would be inlined into function 'vec128_testz' that is compiled without support for 'sse4.1'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: fatal error: error in backend: Cannot select: 0x309bb00: i32 = X86ISD::PTEST 0x3088520, 0x3088520
decrypt.c: 0x3088520: v2i64,ch = CopyFromReg 0x2fa00f0, Register:v2i64 %vreg93
decrypt.c: 0x30c3300: v2i64 = Register %vreg93
decrypt.c: 0x3088520: v2i64,ch = CopyFromReg 0x2fa00f0, Register:v2i64 %vreg93
decrypt.c: 0x30c3300: v2i64 = Register %vreg93
decrypt.c: In function: decrypt
decrypt.c: clang: error: clang frontend command failed with exit code 70 (use -v to see invocation)
decrypt.c: clang version 3.8.0-2ubuntu4 (tags/RELEASE_380/final)
decrypt.c: Target: x86_64-pc-linux-gnu
decrypt.c: Thread model: posix
decrypt.c: InstalledDir: /usr/bin
decrypt.c: clang: note: diagnostic msg: PLEASE submit a bug report to http://llvm.org/bugs/ and include the crash backtrace, preprocessed source, and associated run script.
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ********************
decrypt.c:
decrypt.c: PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:
decrypt.c: Preprocessed source(s) and associated run script(s) are located at:
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-3996d0.c
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-3996d0.sh
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse

Compiler output

Implementation: crypto_kem/mceliece348864/avx
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:9:
decrypt.c: ./vec256.h:27:10: error: always_inline function '_mm256_set_epi64x' requires target feature 'xsave', but would be inlined into function 'vec256_set4x' that is compiled without support for 'xsave'
decrypt.c: return _mm256_set_epi64x(a3, a2, a1, a0);
decrypt.c: ^
decrypt.c: ./vec256.h:40:9: error: always_inline function '_mm256_and_si256' requires target feature 'xsave', but would be inlined into function 'vec256_and' that is compiled without support for 'xsave'
decrypt.c: return _mm256_and_si256(a, b);
decrypt.c: ^
decrypt.c: ./vec256.h:17:9: error: always_inline function '_mm256_set1_epi16' requires target feature 'xsave', but would be inlined into function 'vec256_set1_16b' that is compiled without support for 'xsave'
decrypt.c: return _mm256_set1_epi16(a);
decrypt.c: ^
decrypt.c: ./vec256.h:50:9: error: always_inline function '_mm256_or_si256' requires target feature 'xsave', but would be inlined into function 'vec256_or' that is compiled without support for 'xsave'
decrypt.c: return _mm256_or_si256(a, b);
decrypt.c: ^
decrypt.c: ./vec256.h:45:9: error: always_inline function '_mm256_xor_si256' requires target feature 'xsave', but would be inlined into function 'vec256_xor' that is compiled without support for 'xsave'
decrypt.c: return _mm256_xor_si256(a, b);
decrypt.c: ^
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:9:
decrypt.c: In file included from ./vec256.h:9:
decrypt.c: ./vec128.h:30:9: error: always_inline function '_mm_testz_si128' requires target feature 'sse4.1', but would be inlined into function 'vec128_testz' that is compiled without support for 'sse4.1'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: decrypt.c:157:10: error: always_inline function '_mm256_extract_epi64' requires target feature 'xsave', but would be inlined into function 'reformat_256to128' that is compiled without support for 'xsave'
decrypt.c: v[0] = vec256_extract(in[i], 0);
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx

Compiler output

Implementation: crypto_kem/mceliece348864/sse
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:9:
decrypt.c: ./vec128.h:30:9: error: always_inline function '_mm_testz_si128' requires target feature 'sse4.1', but would be inlined into function 'vec128_testz' that is compiled without support for 'sse4.1'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: fatal error: error in backend: Cannot select: 0x2bc9060: i32 = X86ISD::PTEST 0x2b6c540, 0x2b6c540
decrypt.c: 0x2b6c540: v2i64,ch = CopyFromReg 0x2ad51a0, Register:v2i64 %vreg93
decrypt.c: 0x2bbf890: v2i64 = Register %vreg93
decrypt.c: 0x2b6c540: v2i64,ch = CopyFromReg 0x2ad51a0, Register:v2i64 %vreg93
decrypt.c: 0x2bbf890: v2i64 = Register %vreg93
decrypt.c: In function: decrypt
decrypt.c: clang: error: clang frontend command failed with exit code 70 (use -v to see invocation)
decrypt.c: clang version 3.8.0-2ubuntu4 (tags/RELEASE_380/final)
decrypt.c: Target: x86_64-pc-linux-gnu
decrypt.c: Thread model: posix
decrypt.c: InstalledDir: /usr/bin
decrypt.c: clang: note: diagnostic msg: PLEASE submit a bug report to http://llvm.org/bugs/ and include the crash backtrace, preprocessed source, and associated run script.
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ********************
decrypt.c:
decrypt.c: PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:
decrypt.c: Preprocessed source(s) and associated run script(s) are located at:
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-6a2267.c
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-6a2267.sh
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse

Compiler output

Implementation: crypto_kem/mceliece348864/sse
Compiler: clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:9:
decrypt.c: ./vec128.h:30:9: error: always_inline function '_mm_testz_si128' requires target feature 'sse4.1', but would be inlined into function 'vec128_testz' that is compiled without support for 'sse4.1'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: fatal error: error in backend: Cannot select: 0x31a3310: i32 = X86ISD::PTEST 0x31b9210, 0x31b9210
decrypt.c: 0x31b9210: v2i64,ch = CopyFromReg 0x30bd0b0, Register:v2i64 %vreg93
decrypt.c: 0x3153d40: v2i64 = Register %vreg93
decrypt.c: 0x31b9210: v2i64,ch = CopyFromReg 0x30bd0b0, Register:v2i64 %vreg93
decrypt.c: 0x3153d40: v2i64 = Register %vreg93
decrypt.c: In function: decrypt
decrypt.c: clang: error: clang frontend command failed with exit code 70 (use -v to see invocation)
decrypt.c: clang version 3.8.0-2ubuntu4 (tags/RELEASE_380/final)
decrypt.c: Target: x86_64-pc-linux-gnu
decrypt.c: Thread model: posix
decrypt.c: InstalledDir: /usr/bin
decrypt.c: clang: note: diagnostic msg: PLEASE submit a bug report to http://llvm.org/bugs/ and include the crash backtrace, preprocessed source, and associated run script.
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ********************
decrypt.c:
decrypt.c: PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:
decrypt.c: Preprocessed source(s) and associated run script(s) are located at:
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-e9ccc8.c
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-e9ccc8.sh
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse

Compiler output

Implementation: crypto_kem/mceliece348864/sse
Compiler: clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:9:
decrypt.c: ./vec128.h:30:9: error: always_inline function '_mm_testz_si128' requires target feature 'sse4.1', but would be inlined into function 'vec128_testz' that is compiled without support for 'sse4.1'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: fatal error: error in backend: Cannot select: 0x2c796d0: i32 = X86ISD::PTEST 0x2caaf50, 0x2caaf50
decrypt.c: 0x2caaf50: v2i64,ch = CopyFromReg 0x2bbbbe0, Register:v2i64 %vreg86
decrypt.c: 0x2c7a220: v2i64 = Register %vreg86
decrypt.c: 0x2caaf50: v2i64,ch = CopyFromReg 0x2bbbbe0, Register:v2i64 %vreg86
decrypt.c: 0x2c7a220: v2i64 = Register %vreg86
decrypt.c: In function: decrypt
decrypt.c: clang: error: clang frontend command failed with exit code 70 (use -v to see invocation)
decrypt.c: clang version 3.8.0-2ubuntu4 (tags/RELEASE_380/final)
decrypt.c: Target: x86_64-pc-linux-gnu
decrypt.c: Thread model: posix
decrypt.c: InstalledDir: /usr/bin
decrypt.c: clang: note: diagnostic msg: PLEASE submit a bug report to http://llvm.org/bugs/ and include the crash backtrace, preprocessed source, and associated run script.
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ********************
decrypt.c:
decrypt.c: PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:
decrypt.c: Preprocessed source(s) and associated run script(s) are located at:
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-56390e.c
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-56390e.sh
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse

Compiler output

Implementation: crypto_kem/mceliece348864/sse
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:9:
decrypt.c: ./vec128.h:30:9: error: always_inline function '_mm_testz_si128' requires target feature 'ssse3', but would be inlined into function 'vec128_testz' that is compiled without support for 'ssse3'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: fatal error: error in backend: Cannot select: 0x39a6800: i32 = X86ISD::PTEST 0x39937e0, 0x39937e0
decrypt.c: 0x39937e0: v2i64,ch = CopyFromReg 0x38b2f80, Register:v2i64 %vreg93
decrypt.c: 0x399cee0: v2i64 = Register %vreg93
decrypt.c: 0x39937e0: v2i64,ch = CopyFromReg 0x38b2f80, Register:v2i64 %vreg93
decrypt.c: 0x399cee0: v2i64 = Register %vreg93
decrypt.c: In function: decrypt
decrypt.c: clang: error: clang frontend command failed with exit code 70 (use -v to see invocation)
decrypt.c: clang version 3.8.0-2ubuntu4 (tags/RELEASE_380/final)
decrypt.c: Target: x86_64-pc-linux-gnu
decrypt.c: Thread model: posix
decrypt.c: InstalledDir: /usr/bin
decrypt.c: clang: note: diagnostic msg: PLEASE submit a bug report to http://llvm.org/bugs/ and include the crash backtrace, preprocessed source, and associated run script.
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ********************
decrypt.c:
decrypt.c: PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:
decrypt.c: Preprocessed source(s) and associated run script(s) are located at:
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-180b37.c
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-180b37.sh
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse

Compiler output

Implementation: crypto_kem/mceliece348864/avx
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:9:
decrypt.c: ./vec256.h:27:10: error: always_inline function '_mm256_set_epi64x' requires target feature 'sse4.2', but would be inlined into function 'vec256_set4x' that is compiled without support for 'sse4.2'
decrypt.c: return _mm256_set_epi64x(a3, a2, a1, a0);
decrypt.c: ^
decrypt.c: ./vec256.h:40:9: error: always_inline function '_mm256_and_si256' requires target feature 'sse4.2', but would be inlined into function 'vec256_and' that is compiled without support for 'sse4.2'
decrypt.c: return _mm256_and_si256(a, b);
decrypt.c: ^
decrypt.c: ./vec256.h:17:9: error: always_inline function '_mm256_set1_epi16' requires target feature 'sse4.2', but would be inlined into function 'vec256_set1_16b' that is compiled without support for 'sse4.2'
decrypt.c: return _mm256_set1_epi16(a);
decrypt.c: ^
decrypt.c: ./vec256.h:50:9: error: always_inline function '_mm256_or_si256' requires target feature 'sse4.2', but would be inlined into function 'vec256_or' that is compiled without support for 'sse4.2'
decrypt.c: return _mm256_or_si256(a, b);
decrypt.c: ^
decrypt.c: ./vec256.h:45:9: error: always_inline function '_mm256_xor_si256' requires target feature 'sse4.2', but would be inlined into function 'vec256_xor' that is compiled without support for 'sse4.2'
decrypt.c: return _mm256_xor_si256(a, b);
decrypt.c: ^
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:9:
decrypt.c: In file included from ./vec256.h:9:
decrypt.c: ./vec128.h:30:9: error: always_inline function '_mm_testz_si128' requires target feature 'ssse3', but would be inlined into function 'vec128_testz' that is compiled without support for 'ssse3'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: decrypt.c:157:10: error: always_inline function '_mm256_extract_epi64' requires target feature 'sse4.2', but would be inlined into function 'reformat_256to128' that is compiled without support for 'sse4.2'
decrypt.c: v[0] = vec256_extract(in[i], 0);
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx

Compiler output

Implementation: crypto_kem/mceliece348864/avx
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
benes.c: In file included from transpose.h:8:0,
benes.c: from benes.c:6:
benes.c: vec256.h: In function 'vec256_set1_16b':
benes.c: vec256.h:16:1: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
benes.c: {
benes.c: ^
bm.c: In file included from vec128.h:12:0,
bm.c: from bm.h:9,
bm.c: from bm.c:6:
bm.c: bm.c: In function 'get_coefs':
bm.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:454:1: error: inlining failed in call to always_inline '_mm_extract_epi64': target specific option mismatch
bm.c: _mm_extract_epi64 (__m128i __X, const int __N)
bm.c: ^
bm.c: In file included from bm.h:9:0,
bm.c: from bm.c:6:
bm.c: vec128.h:26:42: error: called from here
bm.c: #define vec128_extract(a, i) ((uint64_t) _mm_extract_epi64((vec128) (a), (i)))
bm.c: ^
bm.c: bm.c:126:30: note: in expansion of macro 'vec128_extract'
bm.c: out[ (4*j + k)*16 + i ] = (vec128_extract(buf[i], j) >> (k*16)) & GFMASK;
bm.c: ^
bm.c: In file included from vec128.h:12:0,
bm.c: from bm.h:9,
bm.c: from bm.c:6:
bm.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:454:1: error: inlining failed in call to always_inline '_mm_extract_epi64': target specific option mismatch
bm.c: _mm_extract_epi64 (__m128i __X, const int __N)
bm.c: ^
bm.c: In file included from bm.h:9:0,
bm.c: from bm.c:6:
bm.c: vec128.h:26:42: error: called from here
bm.c: #define vec128_extract(a, i) ((uint64_t) _mm_extract_epi64((vec128) (a), (i)))
bm.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE avx
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE avx
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE avx
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE avx

Compiler output

Implementation: crypto_kem/mceliece348864/sse
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
bm.c: In file included from vec128.h:12:0,
bm.c: from bm.h:12,
bm.c: from bm.c:6:
bm.c: bm.c: In function 'get_coefs':
bm.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:454:1: error: inlining failed in call to always_inline '_mm_extract_epi64': target specific option mismatch
bm.c: _mm_extract_epi64 (__m128i __X, const int __N)
bm.c: ^
bm.c: In file included from bm.h:12:0,
bm.c: from bm.c:6:
bm.c: vec128.h:26:42: error: called from here
bm.c: #define vec128_extract(a, i) ((uint64_t) _mm_extract_epi64((vec128) (a), (i)))
bm.c: ^
bm.c: bm.c:152:30: note: in expansion of macro 'vec128_extract'
bm.c: out[ (4*j + k)*16 + i ] = (vec128_extract(buf[i], j) >> (k*16)) & GFMASK;
bm.c: ^
bm.c: In file included from vec128.h:12:0,
bm.c: from bm.h:12,
bm.c: from bm.c:6:
bm.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:454:1: error: inlining failed in call to always_inline '_mm_extract_epi64': target specific option mismatch
bm.c: _mm_extract_epi64 (__m128i __X, const int __N)
bm.c: ^
bm.c: In file included from bm.h:12:0,
bm.c: from bm.c:6:
bm.c: vec128.h:26:42: error: called from here
bm.c: #define vec128_extract(a, i) ((uint64_t) _mm_extract_epi64((vec128) (a), (i)))
bm.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE sse
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE sse
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE sse
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE sse