Implementation notes: amd64, margaux, crypto_kem/mceliece348864

Computer: margaux
Architecture: amd64
CPU ID: GenuineIntel-000006fb-bfebfbff
SUPERCOP version: 20191221
Operation: crypto_kem
Primitive: mceliece348864
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
133137071883 0 0164376 864 1576vecclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
133264871671 0 0163248 864 1576vecclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
133360271671 0 0163248 864 1576vecclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
133408871620 0 0164152 864 1576vecclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
145059340047 0 0130691 816 1576vecgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010620191221
145200648646 0 0140531 824 1608vecgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010620191221
145981848840 0 0140515 824 1608vecgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010620191221
159532241723 0 0132302 856 1576vecclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
8941002326145 0 0118720 864 1576refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
8975532626257 0 0117912 864 1576refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
8975824226567 0 0119136 864 1576refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
8998482626257 0 0117912 864 1576refclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
10558234870138 0 0163973 832 1608refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010620191221
13696433111730 0 0102350 856 1576refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010620191221
15432703213698 0 0105291 824 1608refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010620191221
16014209413784 0 0105619 824 1608refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010620191221
18718016410859 0 0101443 816 1576refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010620191221

Test failure

Implementation: crypto_kem/mceliece348864/vec
Compiler: gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE
error 111

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE vec

Compiler output

Implementation: crypto_kem/mceliece348864/sse
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:9:
decrypt.c: ./vec128.h:30:9: error: always_inline function '_mm_testz_si128' requires target feature 'sse4.1', but would be inlined into function 'vec128_testz' that is compiled without support for 'sse4.1'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: fatal error: error in backend: Cannot select: 0x248c5a0: i32 = X86ISD::PTEST 0x24b34f0, 0x24b34f0
decrypt.c: 0x24b34f0: v2i64,ch = CopyFromReg 0x23a7080, Register:v2i64 %vreg93
decrypt.c: 0x243e2f0: v2i64 = Register %vreg93
decrypt.c: 0x24b34f0: v2i64,ch = CopyFromReg 0x23a7080, Register:v2i64 %vreg93
decrypt.c: 0x243e2f0: v2i64 = Register %vreg93
decrypt.c: In function: decrypt
decrypt.c: clang: error: clang frontend command failed with exit code 70 (use -v to see invocation)
decrypt.c: clang version 3.8.0-2ubuntu4 (tags/RELEASE_380/final)
decrypt.c: Target: x86_64-pc-linux-gnu
decrypt.c: Thread model: posix
decrypt.c: InstalledDir: /usr/bin
decrypt.c: clang: note: diagnostic msg: PLEASE submit a bug report to http://llvm.org/bugs/ and include the crash backtrace, preprocessed source, and associated run script.
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ********************
decrypt.c:
decrypt.c: PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:
decrypt.c: Preprocessed source(s) and associated run script(s) are located at:
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-5d7cd4.c
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-5d7cd4.sh
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse

Compiler output

Implementation: crypto_kem/mceliece348864/avx
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:9:
decrypt.c: ./vec256.h:27:10: error: always_inline function '_mm256_set_epi64x' requires target feature 'xsave', but would be inlined into function 'vec256_set4x' that is compiled without support for 'xsave'
decrypt.c: return _mm256_set_epi64x(a3, a2, a1, a0);
decrypt.c: ^
decrypt.c: ./vec256.h:40:9: error: always_inline function '_mm256_and_si256' requires target feature 'xsave', but would be inlined into function 'vec256_and' that is compiled without support for 'xsave'
decrypt.c: return _mm256_and_si256(a, b);
decrypt.c: ^
decrypt.c: ./vec256.h:17:9: error: always_inline function '_mm256_set1_epi16' requires target feature 'xsave', but would be inlined into function 'vec256_set1_16b' that is compiled without support for 'xsave'
decrypt.c: return _mm256_set1_epi16(a);
decrypt.c: ^
decrypt.c: ./vec256.h:50:9: error: always_inline function '_mm256_or_si256' requires target feature 'xsave', but would be inlined into function 'vec256_or' that is compiled without support for 'xsave'
decrypt.c: return _mm256_or_si256(a, b);
decrypt.c: ^
decrypt.c: ./vec256.h:45:9: error: always_inline function '_mm256_xor_si256' requires target feature 'xsave', but would be inlined into function 'vec256_xor' that is compiled without support for 'xsave'
decrypt.c: return _mm256_xor_si256(a, b);
decrypt.c: ^
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:9:
decrypt.c: In file included from ./vec256.h:9:
decrypt.c: ./vec128.h:30:9: error: always_inline function '_mm_testz_si128' requires target feature 'sse4.1', but would be inlined into function 'vec128_testz' that is compiled without support for 'sse4.1'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: decrypt.c:157:10: error: always_inline function '_mm256_extract_epi64' requires target feature 'xsave', but would be inlined into function 'reformat_256to128' that is compiled without support for 'xsave'
decrypt.c: v[0] = vec256_extract(in[i], 0);
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx

Compiler output

Implementation: crypto_kem/mceliece348864/sse
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:9:
decrypt.c: ./vec128.h:30:9: error: always_inline function '_mm_testz_si128' requires target feature 'sse4.1', but would be inlined into function 'vec128_testz' that is compiled without support for 'sse4.1'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: fatal error: error in backend: Cannot select: 0x2354530: i32 = X86ISD::PTEST 0x23b5ee0, 0x23b5ee0
decrypt.c: 0x23b5ee0: v2i64,ch = CopyFromReg 0x22be1a0, Register:v2i64 %vreg93
decrypt.c: 0x23ab370: v2i64 = Register %vreg93
decrypt.c: 0x23b5ee0: v2i64,ch = CopyFromReg 0x22be1a0, Register:v2i64 %vreg93
decrypt.c: 0x23ab370: v2i64 = Register %vreg93
decrypt.c: In function: decrypt
decrypt.c: clang: error: clang frontend command failed with exit code 70 (use -v to see invocation)
decrypt.c: clang version 3.8.0-2ubuntu4 (tags/RELEASE_380/final)
decrypt.c: Target: x86_64-pc-linux-gnu
decrypt.c: Thread model: posix
decrypt.c: InstalledDir: /usr/bin
decrypt.c: clang: note: diagnostic msg: PLEASE submit a bug report to http://llvm.org/bugs/ and include the crash backtrace, preprocessed source, and associated run script.
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ********************
decrypt.c:
decrypt.c: PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:
decrypt.c: Preprocessed source(s) and associated run script(s) are located at:
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-0bacb8.c
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-0bacb8.sh
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse

Compiler output

Implementation: crypto_kem/mceliece348864/sse
Compiler: clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:9:
decrypt.c: ./vec128.h:30:9: error: always_inline function '_mm_testz_si128' requires target feature 'sse4.1', but would be inlined into function 'vec128_testz' that is compiled without support for 'sse4.1'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: fatal error: error in backend: Cannot select: 0x379f1e0: i32 = X86ISD::PTEST 0x3753440, 0x3753440
decrypt.c: 0x3753440: v2i64,ch = CopyFromReg 0x36bb0b0, Register:v2i64 %vreg93
decrypt.c: 0x37a5370: v2i64 = Register %vreg93
decrypt.c: 0x3753440: v2i64,ch = CopyFromReg 0x36bb0b0, Register:v2i64 %vreg93
decrypt.c: 0x37a5370: v2i64 = Register %vreg93
decrypt.c: In function: decrypt
decrypt.c: clang: error: clang frontend command failed with exit code 70 (use -v to see invocation)
decrypt.c: clang version 3.8.0-2ubuntu4 (tags/RELEASE_380/final)
decrypt.c: Target: x86_64-pc-linux-gnu
decrypt.c: Thread model: posix
decrypt.c: InstalledDir: /usr/bin
decrypt.c: clang: note: diagnostic msg: PLEASE submit a bug report to http://llvm.org/bugs/ and include the crash backtrace, preprocessed source, and associated run script.
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ********************
decrypt.c:
decrypt.c: PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:
decrypt.c: Preprocessed source(s) and associated run script(s) are located at:
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-5ccfca.c
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-5ccfca.sh
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse

Compiler output

Implementation: crypto_kem/mceliece348864/sse
Compiler: clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:9:
decrypt.c: ./vec128.h:30:9: error: always_inline function '_mm_testz_si128' requires target feature 'sse4.1', but would be inlined into function 'vec128_testz' that is compiled without support for 'sse4.1'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: fatal error: error in backend: Cannot select: 0x2c42580: i32 = X86ISD::PTEST 0x2c7c480, 0x2c7c480
decrypt.c: 0x2c7c480: v2i64,ch = CopyFromReg 0x2b8eb90, Register:v2i64 %vreg86
decrypt.c: 0x2c430d0: v2i64 = Register %vreg86
decrypt.c: 0x2c7c480: v2i64,ch = CopyFromReg 0x2b8eb90, Register:v2i64 %vreg86
decrypt.c: 0x2c430d0: v2i64 = Register %vreg86
decrypt.c: In function: decrypt
decrypt.c: clang: error: clang frontend command failed with exit code 70 (use -v to see invocation)
decrypt.c: clang version 3.8.0-2ubuntu4 (tags/RELEASE_380/final)
decrypt.c: Target: x86_64-pc-linux-gnu
decrypt.c: Thread model: posix
decrypt.c: InstalledDir: /usr/bin
decrypt.c: clang: note: diagnostic msg: PLEASE submit a bug report to http://llvm.org/bugs/ and include the crash backtrace, preprocessed source, and associated run script.
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ********************
decrypt.c:
decrypt.c: PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:
decrypt.c: Preprocessed source(s) and associated run script(s) are located at:
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-fba63f.c
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-fba63f.sh
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse

Compiler output

Implementation: crypto_kem/mceliece348864/sse
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:9:
decrypt.c: ./vec128.h:30:9: error: always_inline function '_mm_testz_si128' requires target feature 'ssse3', but would be inlined into function 'vec128_testz' that is compiled without support for 'ssse3'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: fatal error: error in backend: Cannot select: 0x2ec8f00: i32 = X86ISD::PTEST 0x2ec79a0, 0x2ec79a0
decrypt.c: 0x2ec79a0: v2i64,ch = CopyFromReg 0x2dd8f80, Register:v2i64 %vreg93
decrypt.c: 0x2efbe30: v2i64 = Register %vreg93
decrypt.c: 0x2ec79a0: v2i64,ch = CopyFromReg 0x2dd8f80, Register:v2i64 %vreg93
decrypt.c: 0x2efbe30: v2i64 = Register %vreg93
decrypt.c: In function: decrypt
decrypt.c: clang: error: clang frontend command failed with exit code 70 (use -v to see invocation)
decrypt.c: clang version 3.8.0-2ubuntu4 (tags/RELEASE_380/final)
decrypt.c: Target: x86_64-pc-linux-gnu
decrypt.c: Thread model: posix
decrypt.c: InstalledDir: /usr/bin
decrypt.c: clang: note: diagnostic msg: PLEASE submit a bug report to http://llvm.org/bugs/ and include the crash backtrace, preprocessed source, and associated run script.
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ********************
decrypt.c:
decrypt.c: PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:
decrypt.c: Preprocessed source(s) and associated run script(s) are located at:
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-bdf8a3.c
decrypt.c: clang: note: diagnostic msg: /tmp/decrypt-bdf8a3.sh
decrypt.c: clang: note: diagnostic msg:
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse

Compiler output

Implementation: crypto_kem/mceliece348864/avx
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:9:
decrypt.c: ./vec256.h:27:10: error: always_inline function '_mm256_set_epi64x' requires target feature 'sse4.2', but would be inlined into function 'vec256_set4x' that is compiled without support for 'sse4.2'
decrypt.c: return _mm256_set_epi64x(a3, a2, a1, a0);
decrypt.c: ^
decrypt.c: ./vec256.h:40:9: error: always_inline function '_mm256_and_si256' requires target feature 'sse4.2', but would be inlined into function 'vec256_and' that is compiled without support for 'sse4.2'
decrypt.c: return _mm256_and_si256(a, b);
decrypt.c: ^
decrypt.c: ./vec256.h:17:9: error: always_inline function '_mm256_set1_epi16' requires target feature 'sse4.2', but would be inlined into function 'vec256_set1_16b' that is compiled without support for 'sse4.2'
decrypt.c: return _mm256_set1_epi16(a);
decrypt.c: ^
decrypt.c: ./vec256.h:50:9: error: always_inline function '_mm256_or_si256' requires target feature 'sse4.2', but would be inlined into function 'vec256_or' that is compiled without support for 'sse4.2'
decrypt.c: return _mm256_or_si256(a, b);
decrypt.c: ^
decrypt.c: ./vec256.h:45:9: error: always_inline function '_mm256_xor_si256' requires target feature 'sse4.2', but would be inlined into function 'vec256_xor' that is compiled without support for 'sse4.2'
decrypt.c: return _mm256_xor_si256(a, b);
decrypt.c: ^
decrypt.c: In file included from decrypt.c:8:
decrypt.c: In file included from ./fft_tr.h:9:
decrypt.c: In file included from ./vec256.h:9:
decrypt.c: ./vec128.h:30:9: error: always_inline function '_mm_testz_si128' requires target feature 'ssse3', but would be inlined into function 'vec128_testz' that is compiled without support for 'ssse3'
decrypt.c: return _mm_testz_si128(a, a);
decrypt.c: ^
decrypt.c: decrypt.c:157:10: error: always_inline function '_mm256_extract_epi64' requires target feature 'sse4.2', but would be inlined into function 'reformat_256to128' that is compiled without support for 'sse4.2'
decrypt.c: v[0] = vec256_extract(in[i], 0);
decrypt.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx

Compiler output

Implementation: crypto_kem/mceliece348864/avx
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
benes.c: In file included from transpose.h:8:0,
benes.c: from benes.c:6:
benes.c: vec256.h: In function 'vec256_set1_16b':
benes.c: vec256.h:16:1: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
benes.c: {
benes.c: ^
bm.c: In file included from vec128.h:12:0,
bm.c: from bm.h:9,
bm.c: from bm.c:6:
bm.c: bm.c: In function 'get_coefs':
bm.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:454:1: error: inlining failed in call to always_inline '_mm_extract_epi64': target specific option mismatch
bm.c: _mm_extract_epi64 (__m128i __X, const int __N)
bm.c: ^
bm.c: In file included from bm.h:9:0,
bm.c: from bm.c:6:
bm.c: vec128.h:26:42: error: called from here
bm.c: #define vec128_extract(a, i) ((uint64_t) _mm_extract_epi64((vec128) (a), (i)))
bm.c: ^
bm.c: bm.c:126:30: note: in expansion of macro 'vec128_extract'
bm.c: out[ (4*j + k)*16 + i ] = (vec128_extract(buf[i], j) >> (k*16)) & GFMASK;
bm.c: ^
bm.c: In file included from vec128.h:12:0,
bm.c: from bm.h:9,
bm.c: from bm.c:6:
bm.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:454:1: error: inlining failed in call to always_inline '_mm_extract_epi64': target specific option mismatch
bm.c: _mm_extract_epi64 (__m128i __X, const int __N)
bm.c: ^
bm.c: In file included from bm.h:9:0,
bm.c: from bm.c:6:
bm.c: vec128.h:26:42: error: called from here
bm.c: #define vec128_extract(a, i) ((uint64_t) _mm_extract_epi64((vec128) (a), (i)))
bm.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE avx
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE avx
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE avx
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE avx

Compiler output

Implementation: crypto_kem/mceliece348864/sse
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
bm.c: In file included from vec128.h:12:0,
bm.c: from bm.h:12,
bm.c: from bm.c:6:
bm.c: bm.c: In function 'get_coefs':
bm.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:454:1: error: inlining failed in call to always_inline '_mm_extract_epi64': target specific option mismatch
bm.c: _mm_extract_epi64 (__m128i __X, const int __N)
bm.c: ^
bm.c: In file included from bm.h:12:0,
bm.c: from bm.c:6:
bm.c: vec128.h:26:42: error: called from here
bm.c: #define vec128_extract(a, i) ((uint64_t) _mm_extract_epi64((vec128) (a), (i)))
bm.c: ^
bm.c: bm.c:152:30: note: in expansion of macro 'vec128_extract'
bm.c: out[ (4*j + k)*16 + i ] = (vec128_extract(buf[i], j) >> (k*16)) & GFMASK;
bm.c: ^
bm.c: In file included from vec128.h:12:0,
bm.c: from bm.h:12,
bm.c: from bm.c:6:
bm.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:454:1: error: inlining failed in call to always_inline '_mm_extract_epi64': target specific option mismatch
bm.c: _mm_extract_epi64 (__m128i __X, const int __N)
bm.c: ^
bm.c: In file included from bm.h:12:0,
bm.c: from bm.c:6:
bm.c: vec128.h:26:42: error: called from here
bm.c: #define vec128_extract(a, i) ((uint64_t) _mm_extract_epi64((vec128) (a), (i)))
bm.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE sse
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE sse
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE sse
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE sse