Implementation notes: amd64, hydra1, crypto_aead/led80n6t4silcv3

Computer: hydra1
Architecture: amd64
CPU ID: AuthenticAMD-00100fa0-178bfbff
SUPERCOP version: 20161220
Operation: crypto_aead
Primitive: led80n6t4silcv3
TimeImplementationCompilerBenchmark dateSUPERCOP version
55993136refgcc -funroll-loops -m64 -O2 -fomit-frame-pointer2016122720161220
55995122refgcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer2016122720161220
56049935refgcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer2016122720161220
56098702refgcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer2016122720161220
56099994refgcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer2016122720161220
56139536refgcc -funroll-loops -O2 -fomit-frame-pointer2016122720161220
56176427refgcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer2016122720161220
63537376refgcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer2016122720161220
63537684refgcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer2016122720161220
64014299refgcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer2016122720161220
64119512refgcc -funroll-loops -O3 -fomit-frame-pointer2016122720161220
64167198refgcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer2016122720161220
64174359refgcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer2016122720161220
64234380refgcc -funroll-loops -m64 -O3 -fomit-frame-pointer2016122720161220
64588851refgcc -march=k8 -O3 -fomit-frame-pointer2016122720161220
64600587refgcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer2016122720161220
64603699refgcc -m64 -march=k8 -O3 -fomit-frame-pointer2016122720161220
64612778refgcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer2016122720161220
64623453refgcc -m64 -O3 -fomit-frame-pointer2016122720161220
64703186refgcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer2016122720161220
64713406refgcc -m64 -march=barcelona -O3 -fomit-frame-pointer2016122720161220
64731852refgcc -O3 -fomit-frame-pointer2016122720161220
64778462refgcc -fno-schedule-insns -O3 -fomit-frame-pointer2016122720161220
65047936refgcc -march=barcelona -O3 -fomit-frame-pointer2016122720161220
65127568refgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016122720161220
65899922refgcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer2016122720161220
66340868refgcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer2016122720161220
67849181refgcc -m64 -march=nocona -O3 -fomit-frame-pointer2016122720161220
67959316refgcc -march=nocona -O3 -fomit-frame-pointer2016122720161220
74327267refgcc -funroll-loops -O -fomit-frame-pointer2016122720161220
74382129refgcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer2016122720161220
75082643refgcc -funroll-loops -m64 -O -fomit-frame-pointer2016122720161220
77149014refgcc -funroll-loops -march=nocona -O -fomit-frame-pointer2016122720161220
77205751refgcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer2016122720161220
77970077refgcc -funroll-loops -march=barcelona -O -fomit-frame-pointer2016122720161220
78661318refgcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer2016122720161220
79212014refgcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer2016122720161220
79236920refgcc -funroll-loops -march=k8 -O -fomit-frame-pointer2016122720161220
122805583refgcc -m64 -march=nocona -O2 -fomit-frame-pointer2016122720161220
123935984refgcc -march=nocona -O2 -fomit-frame-pointer2016122720161220
124022561refgcc -march=k8 -O2 -fomit-frame-pointer2016122720161220
124254880refgcc -m64 -O2 -fomit-frame-pointer2016122720161220
124326148refgcc -fno-schedule-insns -O2 -fomit-frame-pointer2016122720161220
124762365refgcc -m64 -march=k8 -O2 -fomit-frame-pointer2016122720161220
124899008refgcc -O2 -fomit-frame-pointer2016122720161220
130858124refgcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer2016122720161220
131114861refgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016122720161220
131960087refgcc -m64 -march=core2 -O2 -fomit-frame-pointer2016122720161220
132107003refgcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer2016122720161220
132148485refgcc -m64 -march=corei7 -O2 -fomit-frame-pointer2016122720161220
132269409refgcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer2016122720161220
132315029refgcc -Os -fomit-frame-pointer2016122720161220
132603001refgcc -m64 -march=core2 -Os -fomit-frame-pointer2016122720161220
132631701refgcc -m64 -Os -fomit-frame-pointer2016122720161220
132771690refgcc -march=nocona -Os -fomit-frame-pointer2016122720161220
132858096refgcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer2016122720161220
132926035refgcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer2016122720161220
133022422refgcc -m64 -march=barcelona -O2 -fomit-frame-pointer2016122720161220
133218712refgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016122720161220
133241246refgcc -march=barcelona -O2 -fomit-frame-pointer2016122720161220
133258553refgcc -m64 -march=nocona -Os -fomit-frame-pointer2016122720161220
133396592refgcc -march=k8 -Os -fomit-frame-pointer2016122720161220
133422257refgcc -m64 -march=k8 -Os -fomit-frame-pointer2016122720161220
133944971refgcc -m64 -march=barcelona -Os -fomit-frame-pointer2016122720161220
134534043refgcc -fno-schedule-insns -Os -fomit-frame-pointer2016122720161220
136059072refgcc -m64 -march=corei7 -Os -fomit-frame-pointer2016122720161220
138873680refgcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer2016122720161220
138889464refgcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer2016122720161220
138992995refgcc -m64 -march=core2 -O -fomit-frame-pointer2016122720161220
144432381refgcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer2016122720161220
144564101refgcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer2016122720161220
144567776refgcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer2016122720161220
144572456refgcc -funroll-loops -march=nocona -Os -fomit-frame-pointer2016122720161220
144730603refgcc -funroll-loops -m64 -Os -fomit-frame-pointer2016122720161220
145178016refgcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer2016122720161220
145326464refgcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer2016122720161220
145589712refgcc -funroll-loops -march=k8 -Os -fomit-frame-pointer2016122720161220
145658443refgcc -funroll-loops -Os -fomit-frame-pointer2016122720161220
147640958refgcc -m64 -march=k8 -O -fomit-frame-pointer2016122720161220
147684775refgcc -march=k8 -O -fomit-frame-pointer2016122720161220
148228665refgcc -m64 -march=native -mtune=native -O -fomit-frame-pointer2016122720161220
148232032refgcc -m64 -march=barcelona -O -fomit-frame-pointer2016122720161220
148240513refgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016122720161220
148995781refgcc -march=barcelona -O -fomit-frame-pointer2016122720161220
150097144refgcc -march=nocona -O -fomit-frame-pointer2016122720161220
150693878refgcc -m64 -march=nocona -O -fomit-frame-pointer2016122720161220
154741685refgcc -fno-schedule-insns -O -fomit-frame-pointer2016122720161220
154902450refgcc -m64 -O -fomit-frame-pointer2016122720161220
155386163refgcc -O -fomit-frame-pointer2016122720161220
158363958refgcc -march=barcelona -Os -fomit-frame-pointer2016122720161220
158388708refgcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer2016122720161220
166152392refgcc -m64 -march=corei7 -O -fomit-frame-pointer2016122720161220
203078227refclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016122720161220
204516481refclang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments2016122720161220
208171685refclang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016122720161220
208176050refclang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016122720161220
208185902refclang -O3 -fomit-frame-pointer -Qunused-arguments2016122720161220
208224880refclang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016122720161220
622363959refgcc2016122720161220
622371821refcc2016122720161220
622596667refgcc -funroll-loops2016122720161220

Test failure

Implementation: crypto_aead/led80n6t4silcv3/vperm
Compiler: gcc -m64 -march=core2 -O2 -fomit-frame-pointer
error 111

Number of similar (compiler,implementation) pairs: 14, namely:
CompilerImplementations
gcc -m64 -march=core2 -O2 -fomit-frame-pointer vperm
gcc -m64 -march=core2 -O3 -fomit-frame-pointer ref vperm
gcc -m64 -march=core2 -O -fomit-frame-pointer vperm
gcc -m64 -march=core2 -Os -fomit-frame-pointer vperm
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer vperm
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer vperm
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer vperm
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer vperm
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer vperm
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer vperm
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer vperm
gcc -m64 -march=corei7 -O -fomit-frame-pointer vperm
gcc -m64 -march=corei7 -Os -fomit-frame-pointer vperm

Compiler output

Implementation: crypto_aead/led80n6t4silcv3/vperm
Compiler: cc
led.c: In file included from common.h:7:0,
led.c: from led.c:3:
led.c: led.c: In function 'MixColumnWithSbox':
led.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
led.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
led.c: ^
led.c: led.c:200:7: error: called from here
led.c: tmp2 = PSHUFB(LOAD(Mbox8), tmp3);
led.c: ^
led.c: In file included from common.h:7:0,
led.c: from led.c:3:
led.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
led.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
led.c: ^
led.c: led.c:199:7: error: called from here
led.c: tmp1 = PSHUFB(LOAD(Mbox7), tmp3);
led.c: ^
led.c: In file included from common.h:7:0,
led.c: from led.c:3:
led.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
led.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
led.c: ^
led.c: led.c:191:7: error: called from here
led.c: tmp2 = PSHUFB(LOAD(Mbox6), tmp3);
led.c: ^
led.c: ...

Number of similar (compiler,implementation) pairs: 79, namely:
CompilerImplementations
cc vperm
gcc vperm
gcc -O2 -fomit-frame-pointer vperm
gcc -O3 -fomit-frame-pointer vperm
gcc -O -fomit-frame-pointer vperm
gcc -Os -fomit-frame-pointer vperm
gcc -fno-schedule-insns -O2 -fomit-frame-pointer vperm
gcc -fno-schedule-insns -O3 -fomit-frame-pointer vperm
gcc -fno-schedule-insns -O -fomit-frame-pointer vperm
gcc -fno-schedule-insns -Os -fomit-frame-pointer vperm
gcc -funroll-loops vperm
gcc -funroll-loops -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -O -fomit-frame-pointer vperm
gcc -funroll-loops -Os -fomit-frame-pointer vperm
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer vperm
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -O -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -Os -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer vperm
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer vperm
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer vperm
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer vperm
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer vperm
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer vperm
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer vperm
gcc -m64 -O2 -fomit-frame-pointer vperm
gcc -m64 -O3 -fomit-frame-pointer vperm
gcc -m64 -O -fomit-frame-pointer vperm
gcc -m64 -Os -fomit-frame-pointer vperm
gcc -m64 -march=k8 -O2 -fomit-frame-pointer vperm
gcc -m64 -march=k8 -O3 -fomit-frame-pointer vperm
gcc -m64 -march=k8 -O -fomit-frame-pointer vperm
gcc -m64 -march=k8 -Os -fomit-frame-pointer vperm
gcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer vperm
gcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer vperm
gcc -m64 -march=native -mtune=native -O -fomit-frame-pointer vperm
gcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer vperm
gcc -m64 -march=nocona -O2 -fomit-frame-pointer vperm
gcc -m64 -march=nocona -O3 -fomit-frame-pointer vperm
gcc -m64 -march=nocona -O -fomit-frame-pointer vperm
gcc -m64 -march=nocona -Os -fomit-frame-pointer vperm
gcc -march=barcelona -O2 -fomit-frame-pointer vperm
gcc -march=barcelona -O3 -fomit-frame-pointer vperm
gcc -march=barcelona -O -fomit-frame-pointer vperm
gcc -march=barcelona -Os -fomit-frame-pointer vperm
gcc -march=k8 -O2 -fomit-frame-pointer vperm
gcc -march=k8 -O3 -fomit-frame-pointer vperm
gcc -march=k8 -O -fomit-frame-pointer vperm
gcc -march=k8 -Os -fomit-frame-pointer vperm
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv vperm
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv vperm
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv vperm
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv vperm
gcc -march=nocona -O2 -fomit-frame-pointer vperm
gcc -march=nocona -O3 -fomit-frame-pointer vperm
gcc -march=nocona -O -fomit-frame-pointer vperm
gcc -march=nocona -Os -fomit-frame-pointer vperm

Compiler output

Implementation: crypto_aead/led80n6t4silcv3/vperm
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
led.c: led.c:172:16: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'MixColumnWithSbox' that is compiled without support for 'ssse3'
led.c: dqword tmp1 = PSHUFB(LOAD(Mbox1), sum);
led.c: ^
led.c: ./common.h:42:22: note: expanded from macro 'PSHUFB'
led.c: #define PSHUFB(s, x) _mm_shuffle_epi8((s), (x)) /*return s(x)*/
led.c: ^
led.c: led.c:174:16: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'MixColumnWithSbox' that is compiled without support for 'ssse3'
led.c: dqword tmp2 = PSHUFB(LOAD(Mbox2), sum);
led.c: ^
led.c: ./common.h:42:22: note: expanded from macro 'PSHUFB'
led.c: #define PSHUFB(s, x) _mm_shuffle_epi8((s), (x)) /*return s(x)*/
led.c: ^
led.c: led.c:181:9: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'MixColumnWithSbox' that is compiled without support for 'ssse3'
led.c: tmp1 = PSHUFB(LOAD(Mbox3), tmp3);
led.c: ^
led.c: ./common.h:42:22: note: expanded from macro 'PSHUFB'
led.c: #define PSHUFB(s, x) _mm_shuffle_epi8((s), (x)) /*return s(x)*/
led.c: ^
led.c: led.c:182:9: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'MixColumnWithSbox' that is compiled without support for 'ssse3'
led.c: tmp2 = PSHUFB(LOAD(Mbox4), tmp3);
led.c: ^
led.c: ./common.h:42:22: note: expanded from macro 'PSHUFB'
led.c: #define PSHUFB(s, x) _mm_shuffle_epi8((s), (x)) /*return s(x)*/
led.c: ^
led.c: led.c:190:9: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'MixColumnWithSbox' that is compiled without support for 'ssse3'
led.c: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments vperm
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments vperm
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments vperm
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments vperm
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments vperm
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments vperm

Compiler output

Implementation: crypto_aead/led80n6t4silcv3/vperm
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
led.c: In file included from common.h:7:0,
led.c: from led.c:3:
led.c: led.c: In function 'MixColumnWithSbox':
led.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
led.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
led.c: ^
led.c: led.c:200:7: error: called from here
led.c: tmp2 = PSHUFB(LOAD(Mbox8), tmp3);
led.c: ^
led.c: In file included from common.h:7:0,
led.c: from led.c:3:
led.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
led.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
led.c: ^
led.c: led.c:199:7: error: called from here
led.c: tmp1 = PSHUFB(LOAD(Mbox7), tmp3);
led.c: ^
led.c: In file included from common.h:7:0,
led.c: from led.c:3:
led.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
led.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
led.c: ^
led.c: led.c:191:7: error: called from here
led.c: tmp2 = PSHUFB(LOAD(Mbox6), tmp3);
led.c: ^
led.c: ...
led.c: In file included from common.h:7:0,
led.c: from led.c:3:
led.c: led.c: In function 'MixColumnWithSbox':
led.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
led.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
led.c: ^
led.c: led.c:200:7: error: called from here
led.c: tmp2 = PSHUFB(LOAD(Mbox8), tmp3);
led.c: ^
led.c: In file included from common.h:7:0,
led.c: from led.c:3:
led.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
led.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
led.c: ^
led.c: led.c:199:7: error: called from here
led.c: tmp1 = PSHUFB(LOAD(Mbox7), tmp3);
led.c: ^
led.c: In file included from common.h:7:0,
led.c: from led.c:3:
led.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
led.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
led.c: ^
led.c: led.c:191:7: error: called from here
led.c: tmp2 = PSHUFB(LOAD(Mbox6), tmp3);
led.c: ^
led.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer vperm
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer vperm
gcc -m64 -march=barcelona -O -fomit-frame-pointer vperm
gcc -m64 -march=barcelona -Os -fomit-frame-pointer vperm