Implementation notes: amd64, hydra1, crypto_aead/twine80n6t4clocv3

Computer: hydra1
Architecture: amd64
CPU ID: AuthenticAMD-00100fa0-178bfbff
SUPERCOP version: 20161220
Operation: crypto_aead
Primitive: twine80n6t4clocv3
TimeImplementationCompilerBenchmark dateSUPERCOP version
1766629refgcc -funroll-loops -O2 -fomit-frame-pointer2016122820161220
1777197refgcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer2016122820161220
1782448refgcc -funroll-loops -m64 -O2 -fomit-frame-pointer2016122820161220
1784664refgcc -march=k8 -O2 -fomit-frame-pointer2016122820161220
1785719refgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016122820161220
1787824refgcc -m64 -march=k8 -O2 -fomit-frame-pointer2016122820161220
1802203refgcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer2016122820161220
1810509refgcc -m64 -march=barcelona -O2 -fomit-frame-pointer2016122820161220
1821237refgcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer2016122820161220
1821243refgcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer2016122820161220
1824253refgcc -march=barcelona -O2 -fomit-frame-pointer2016122820161220
1825704refgcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer2016122820161220
1835472refgcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer2016122820161220
1838909refgcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer2016122820161220
1841256refgcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer2016122820161220
1877474refgcc -march=nocona -O2 -fomit-frame-pointer2016122820161220
1905354refgcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer2016122820161220
1905973refgcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer2016122820161220
1906258refgcc -m64 -march=core2 -O2 -fomit-frame-pointer2016122820161220
1910408refgcc -m64 -march=nocona -O2 -fomit-frame-pointer2016122820161220
1939541refgcc -O2 -fomit-frame-pointer2016122820161220
1947682refgcc -m64 -O2 -fomit-frame-pointer2016122820161220
1952935refgcc -fno-schedule-insns -O2 -fomit-frame-pointer2016122820161220
1965708refgcc -m64 -march=corei7 -O2 -fomit-frame-pointer2016122820161220
1994120refgcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer2016122820161220
1996947refgcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer2016122820161220
2013643refgcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer2016122820161220
2020314refgcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer2016122820161220
2041907refgcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer2016122820161220
2063125refgcc -funroll-loops -O3 -fomit-frame-pointer2016122820161220
2115002refgcc -funroll-loops -m64 -O3 -fomit-frame-pointer2016122820161220
2196077refgcc -m64 -march=k8 -O3 -fomit-frame-pointer2016122820161220
2203159refgcc -m64 -march=barcelona -O3 -fomit-frame-pointer2016122820161220
2204463refgcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer2016122820161220
2214247refgcc -fno-schedule-insns -O3 -fomit-frame-pointer2016122820161220
2216104refgcc -march=barcelona -O3 -fomit-frame-pointer2016122820161220
2218432refgcc -O3 -fomit-frame-pointer2016122820161220
2220929refgcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer2016122820161220
2229019refgcc -m64 -O3 -fomit-frame-pointer2016122820161220
2281386refgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016122820161220
2284563refgcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer2016122820161220
2303488refgcc -march=k8 -O3 -fomit-frame-pointer2016122820161220
2326104refgcc -m64 -march=nocona -O3 -fomit-frame-pointer2016122820161220
2417807refgcc -march=nocona -O3 -fomit-frame-pointer2016122820161220
2432650refgcc -march=barcelona -Os -fomit-frame-pointer2016122820161220
2434800refgcc -m64 -Os -fomit-frame-pointer2016122820161220
2434832refgcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer2016122820161220
2438021refgcc -march=k8 -Os -fomit-frame-pointer2016122820161220
2442991refgcc -Os -fomit-frame-pointer2016122820161220
2445323refgcc -fno-schedule-insns -Os -fomit-frame-pointer2016122820161220
2445333refgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016122820161220
2450133refgcc -m64 -march=k8 -Os -fomit-frame-pointer2016122820161220
2465191refgcc -m64 -march=barcelona -Os -fomit-frame-pointer2016122820161220
2545182refgcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer2016122820161220
2556703refgcc -m64 -march=corei7 -Os -fomit-frame-pointer2016122820161220
2560968refgcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer2016122820161220
2583898refgcc -m64 -march=core2 -Os -fomit-frame-pointer2016122820161220
2610568refclang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016122820161220
2610575refclang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016122820161220
2618747refclang -O3 -fomit-frame-pointer -Qunused-arguments2016122820161220
2628064refclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016122820161220
2628464refclang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments2016122820161220
2722765refclang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016122820161220
2763902refgcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer2016122820161220
2781595refgcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer2016122820161220
2781619refgcc -funroll-loops -march=k8 -Os -fomit-frame-pointer2016122820161220
2784293refgcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer2016122820161220
2821275refgcc -funroll-loops -m64 -Os -fomit-frame-pointer2016122820161220
2823181refgcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer2016122820161220
2864239refgcc -funroll-loops -Os -fomit-frame-pointer2016122820161220
3077934refgcc -m64 -march=nocona -Os -fomit-frame-pointer2016122820161220
3142011refgcc -march=nocona -Os -fomit-frame-pointer2016122820161220
3184643refgcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer2016122820161220
3184655refgcc -funroll-loops -march=nocona -Os -fomit-frame-pointer2016122820161220
3214776refgcc -funroll-loops -m64 -O -fomit-frame-pointer2016122820161220
3214850refgcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer2016122820161220
3216891refgcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer2016122820161220
3222579refgcc -march=k8 -O -fomit-frame-pointer2016122820161220
3225806refgcc -m64 -march=core2 -O -fomit-frame-pointer2016122820161220
3242395refgcc -O -fomit-frame-pointer2016122820161220
3243242refgcc -m64 -march=corei7 -O -fomit-frame-pointer2016122820161220
3243805refgcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer2016122820161220
3248087refgcc -march=nocona -O -fomit-frame-pointer2016122820161220
3248293refgcc -m64 -O -fomit-frame-pointer2016122820161220
3250380refgcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer2016122820161220
3284352refgcc -m64 -march=k8 -O -fomit-frame-pointer2016122820161220
3294490refgcc -funroll-loops -O -fomit-frame-pointer2016122820161220
3303455refgcc -fno-schedule-insns -O -fomit-frame-pointer2016122820161220
3304228refgcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer2016122820161220
3352905refgcc -funroll-loops -march=nocona -O -fomit-frame-pointer2016122820161220
3354718refgcc -m64 -march=nocona -O -fomit-frame-pointer2016122820161220
3372565refgcc -march=barcelona -O -fomit-frame-pointer2016122820161220
3372971refgcc -funroll-loops -march=barcelona -O -fomit-frame-pointer2016122820161220
3377455refgcc -m64 -march=barcelona -O -fomit-frame-pointer2016122820161220
3421552refgcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer2016122820161220
3424978refgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016122820161220
3425352refgcc -m64 -march=native -mtune=native -O -fomit-frame-pointer2016122820161220
3453864refgcc -funroll-loops -march=k8 -O -fomit-frame-pointer2016122820161220
4719864refgcc -funroll-loops2016122820161220
4746187refcc2016122820161220
4746238refgcc2016122820161220

Test failure

Implementation: crypto_aead/twine80n6t4clocv3/vperm
Compiler: gcc -m64 -march=core2 -O2 -fomit-frame-pointer
error 111

Number of similar (compiler,implementation) pairs: 14, namely:
CompilerImplementations
gcc -m64 -march=core2 -O2 -fomit-frame-pointer vperm
gcc -m64 -march=core2 -O3 -fomit-frame-pointer ref vperm
gcc -m64 -march=core2 -O -fomit-frame-pointer vperm
gcc -m64 -march=core2 -Os -fomit-frame-pointer vperm
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer vperm
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer vperm
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer vperm
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer vperm
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer vperm
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer vperm
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer vperm
gcc -m64 -march=corei7 -O -fomit-frame-pointer vperm
gcc -m64 -march=corei7 -Os -fomit-frame-pointer vperm

Compiler output

Implementation: crypto_aead/twine80n6t4clocv3/vperm
Compiler: cc
cloc.c: In file included from common.h:7:0,
cloc.c: from cloc.h:2,
cloc.c: from cloc.c:5:
cloc.c: twine.h: In function 'TWINE80_enc':
cloc.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
cloc.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
cloc.c: ^
cloc.c: In file included from cloc.c:7:0:
cloc.c: twine.h:69:9: error: called from here
cloc.c: right = PSHUFB(right, tmp);
cloc.c: ^
cloc.c: In file included from common.h:7:0,
cloc.c: from cloc.h:2,
cloc.c: from cloc.c:5:
cloc.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
cloc.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
cloc.c: ^
cloc.c: In file included from cloc.c:7:0:
cloc.c: twine.h:66:7: error: called from here
cloc.c: tmp = PSHUFB(lsbox, tmp);
cloc.c: ^
cloc.c: In file included from common.h:7:0,
cloc.c: from cloc.h:2,
cloc.c: from cloc.c:5:
cloc.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
cloc.c: ...

Number of similar (compiler,implementation) pairs: 79, namely:
CompilerImplementations
cc vperm
gcc vperm
gcc -O2 -fomit-frame-pointer vperm
gcc -O3 -fomit-frame-pointer vperm
gcc -O -fomit-frame-pointer vperm
gcc -Os -fomit-frame-pointer vperm
gcc -fno-schedule-insns -O2 -fomit-frame-pointer vperm
gcc -fno-schedule-insns -O3 -fomit-frame-pointer vperm
gcc -fno-schedule-insns -O -fomit-frame-pointer vperm
gcc -fno-schedule-insns -Os -fomit-frame-pointer vperm
gcc -funroll-loops vperm
gcc -funroll-loops -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -O -fomit-frame-pointer vperm
gcc -funroll-loops -Os -fomit-frame-pointer vperm
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer vperm
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -O -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -Os -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer vperm
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer vperm
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer vperm
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer vperm
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer vperm
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer vperm
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer vperm
gcc -m64 -O2 -fomit-frame-pointer vperm
gcc -m64 -O3 -fomit-frame-pointer vperm
gcc -m64 -O -fomit-frame-pointer vperm
gcc -m64 -Os -fomit-frame-pointer vperm
gcc -m64 -march=k8 -O2 -fomit-frame-pointer vperm
gcc -m64 -march=k8 -O3 -fomit-frame-pointer vperm
gcc -m64 -march=k8 -O -fomit-frame-pointer vperm
gcc -m64 -march=k8 -Os -fomit-frame-pointer vperm
gcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer vperm
gcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer vperm
gcc -m64 -march=native -mtune=native -O -fomit-frame-pointer vperm
gcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer vperm
gcc -m64 -march=nocona -O2 -fomit-frame-pointer vperm
gcc -m64 -march=nocona -O3 -fomit-frame-pointer vperm
gcc -m64 -march=nocona -O -fomit-frame-pointer vperm
gcc -m64 -march=nocona -Os -fomit-frame-pointer vperm
gcc -march=barcelona -O2 -fomit-frame-pointer vperm
gcc -march=barcelona -O3 -fomit-frame-pointer vperm
gcc -march=barcelona -O -fomit-frame-pointer vperm
gcc -march=barcelona -Os -fomit-frame-pointer vperm
gcc -march=k8 -O2 -fomit-frame-pointer vperm
gcc -march=k8 -O3 -fomit-frame-pointer vperm
gcc -march=k8 -O -fomit-frame-pointer vperm
gcc -march=k8 -Os -fomit-frame-pointer vperm
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv vperm
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv vperm
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv vperm
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv vperm
gcc -march=nocona -O2 -fomit-frame-pointer vperm
gcc -march=nocona -O3 -fomit-frame-pointer vperm
gcc -march=nocona -O -fomit-frame-pointer vperm
gcc -march=nocona -Os -fomit-frame-pointer vperm

Compiler output

Implementation: crypto_aead/twine80n6t4clocv3/vperm
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
cloc.c: In file included from cloc.c:7:
cloc.c: ./twine.h:48:9: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'TWINE80_enc' that is compiled without support for 'ssse3'
cloc.c: tmp = PSHUFB(lsbox, tmp);
cloc.c: ^
cloc.c: ./common.h:42:22: note: expanded from macro 'PSHUFB'
cloc.c: #define PSHUFB(s, x) _mm_shuffle_epi8((s), (x)) /*return s(x)*/
cloc.c: ^
cloc.c: In file included from cloc.c:7:
cloc.c: ./twine.h:51:10: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'TWINE80_enc' that is compiled without support for 'ssse3'
cloc.c: left = PSHUFB(left, tmp);
cloc.c: ^
cloc.c: ./common.h:42:22: note: expanded from macro 'PSHUFB'
cloc.c: #define PSHUFB(s, x) _mm_shuffle_epi8((s), (x)) /*return s(x)*/
cloc.c: ^
cloc.c: In file included from cloc.c:7:
cloc.c: ./twine.h:54:9: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'TWINE80_enc' that is compiled without support for 'ssse3'
cloc.c: tmp = PSHUFB(lsbox, tmp);
cloc.c: ^
cloc.c: ./common.h:42:22: note: expanded from macro 'PSHUFB'
cloc.c: #define PSHUFB(s, x) _mm_shuffle_epi8((s), (x)) /*return s(x)*/
cloc.c: ^
cloc.c: In file included from cloc.c:7:
cloc.c: ./twine.h:57:11: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'TWINE80_enc' that is compiled without support for 'ssse3'
cloc.c: right = PSHUFB(right, tmp);
cloc.c: ^
cloc.c: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments vperm
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments vperm
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments vperm
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments vperm
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments vperm
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments vperm

Compiler output

Implementation: crypto_aead/twine80n6t4clocv3/vperm
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
cloc.c: In file included from common.h:7:0,
cloc.c: from cloc.h:2,
cloc.c: from cloc.c:5:
cloc.c: twine.h: In function 'TWINE80_enc':
cloc.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
cloc.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
cloc.c: ^
cloc.c: In file included from cloc.c:7:0:
cloc.c: twine.h:69:9: error: called from here
cloc.c: right = PSHUFB(right, tmp);
cloc.c: ^
cloc.c: In file included from common.h:7:0,
cloc.c: from cloc.h:2,
cloc.c: from cloc.c:5:
cloc.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
cloc.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
cloc.c: ^
cloc.c: In file included from cloc.c:7:0:
cloc.c: twine.h:66:7: error: called from here
cloc.c: tmp = PSHUFB(lsbox, tmp);
cloc.c: ^
cloc.c: In file included from common.h:7:0,
cloc.c: from cloc.h:2,
cloc.c: from cloc.c:5:
cloc.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
cloc.c: ...
cloc.c: In file included from common.h:7:0,
cloc.c: from cloc.h:2,
cloc.c: from cloc.c:5:
cloc.c: twine.h: In function 'TWINE80_enc':
cloc.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
cloc.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
cloc.c: ^
cloc.c: In file included from cloc.c:7:0:
cloc.c: twine.h:69:9: error: called from here
cloc.c: right = PSHUFB(right, tmp);
cloc.c: ^
cloc.c: In file included from common.h:7:0,
cloc.c: from cloc.h:2,
cloc.c: from cloc.c:5:
cloc.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
cloc.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
cloc.c: ^
cloc.c: In file included from cloc.c:7:0:
cloc.c: twine.h:66:7: error: called from here
cloc.c: tmp = PSHUFB(lsbox, tmp);
cloc.c: ^
cloc.c: In file included from common.h:7:0,
cloc.c: from cloc.h:2,
cloc.c: from cloc.c:5:
cloc.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
cloc.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer vperm
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer vperm
gcc -m64 -march=barcelona -O -fomit-frame-pointer vperm
gcc -m64 -march=barcelona -Os -fomit-frame-pointer vperm