Implementation notes: amd64, glyme, crypto_aead/led80n6t4silcv3

Computer: glyme
Architecture: amd64
CPU ID: GenuineIntel-00020652-bfebfbff
SUPERCOP version: 201720170105
Operation: crypto_aead
Primitive: led80n6t4silcv3
TimeImplementationCompilerBenchmark dateSUPERCOP version
1013572vpermclang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments2017020420170105
1013576vpermclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2017020420170105
1019884vpermgcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer2017020420170105
1033888vpermgcc -m64 -march=core2 -O3 -fomit-frame-pointer2017020420170105
1042120vpermgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2017020420170105
1125220vpermgcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer2017020420170105
1209512vpermgcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer2017020420170105
1281828vpermgcc -m64 -march=corei7 -O3 -fomit-frame-pointer2017020420170105
1417900vpermgcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer2017020420170105
1421896vpermgcc -m64 -march=corei7 -O2 -fomit-frame-pointer2017020420170105
1463032vpermgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2017020420170105
1526060vpermgcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer2017020420170105
1529784vpermgcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer2017020420170105
1534912vpermgcc -m64 -march=core2 -O2 -fomit-frame-pointer2017020420170105
1718940vpermgcc -m64 -march=corei7 -O -fomit-frame-pointer2017020420170105
1723036vpermgcc -m64 -march=native -mtune=native -O -fomit-frame-pointer2017020420170105
1756396vpermgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2017020420170105
1791372vpermgcc -m64 -march=core2 -O -fomit-frame-pointer2017020420170105
1803592vpermgcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer2017020420170105
1803604vpermgcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer2017020420170105
2147320vpermgcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer2017020420170105
2147572vpermgcc -m64 -march=corei7 -Os -fomit-frame-pointer2017020420170105
2147584vpermgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2017020420170105
2184516vpermgcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer2017020420170105
2184760vpermgcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer2017020420170105
2184916vpermgcc -m64 -march=core2 -Os -fomit-frame-pointer2017020420170105
54544784refgcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer2017020420170105
54545500refgcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer2017020420170105
62846496refgcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer2017020420170105
62859180refgcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer2017020420170105
62896456refgcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer2017020420170105
62990868refgcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer2017020420170105
63172940refgcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer2017020420170105
63189796refgcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer2017020420170105
63626632refgcc -funroll-loops -O3 -fomit-frame-pointer2017020420170105
63652316refgcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer2017020420170105
63665372refgcc -funroll-loops -m64 -O3 -fomit-frame-pointer2017020420170105
63935840refgcc -march=nocona -O3 -fomit-frame-pointer2017020420170105
63981936refgcc -m64 -march=nocona -O3 -fomit-frame-pointer2017020420170105
64091944refgcc -m64 -march=core2 -O3 -fomit-frame-pointer2017020420170105
64316072refgcc -m64 -march=corei7 -O3 -fomit-frame-pointer2017020420170105
64318680refgcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer2017020420170105
64351120refgcc -O3 -fomit-frame-pointer2017020420170105
64354612refgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2017020420170105
64394352refgcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer2017020420170105
64408844refgcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer2017020420170105
64448560refgcc -m64 -O3 -fomit-frame-pointer2017020420170105
64563568refgcc -march=barcelona -O3 -fomit-frame-pointer2017020420170105
64581980refgcc -fno-schedule-insns -O3 -fomit-frame-pointer2017020420170105
64661796refgcc -m64 -march=barcelona -O3 -fomit-frame-pointer2017020420170105
64817924refgcc -march=k8 -O3 -fomit-frame-pointer2017020420170105
64833716refgcc -m64 -march=k8 -O3 -fomit-frame-pointer2017020420170105
66088036refgcc -funroll-loops -march=nocona -O -fomit-frame-pointer2017020420170105
66090552refgcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer2017020420170105
68909916refgcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer2017020420170105
68911236refgcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer2017020420170105
68992444refgcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer2017020420170105
69001128refgcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer2017020420170105
71895000refgcc -funroll-loops -m64 -O2 -fomit-frame-pointer2017020420170105
71899924refgcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer2017020420170105
71906356refgcc -funroll-loops -O2 -fomit-frame-pointer2017020420170105
84430316refgcc -funroll-loops -march=barcelona -O -fomit-frame-pointer2017020420170105
84431780refgcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer2017020420170105
84455112refgcc -funroll-loops -march=k8 -O -fomit-frame-pointer2017020420170105
84513372refgcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer2017020420170105
84605864refgcc -funroll-loops -m64 -O -fomit-frame-pointer2017020420170105
84607596refgcc -funroll-loops -O -fomit-frame-pointer2017020420170105
84614156refgcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer2017020420170105
100737520refgcc -m64 -march=nocona -O2 -fomit-frame-pointer2017020420170105
100760436refgcc -march=nocona -O2 -fomit-frame-pointer2017020420170105
105387820refgcc -m64 -march=corei7 -O2 -fomit-frame-pointer2017020420170105
106342016refgcc -m64 -march=k8 -O2 -fomit-frame-pointer2017020420170105
106418840refgcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer2017020420170105
106454656refgcc -march=k8 -O2 -fomit-frame-pointer2017020420170105
106496772refgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2017020420170105
106629724refgcc -m64 -march=barcelona -O2 -fomit-frame-pointer2017020420170105
106649744refgcc -march=barcelona -O2 -fomit-frame-pointer2017020420170105
106724168refgcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer2017020420170105
106747404refgcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer2017020420170105
106752524refgcc -m64 -march=core2 -O2 -fomit-frame-pointer2017020420170105
106795260refgcc -O2 -fomit-frame-pointer2017020420170105
106831440refgcc -m64 -O2 -fomit-frame-pointer2017020420170105
106922840refgcc -fno-schedule-insns -O2 -fomit-frame-pointer2017020420170105
119618472refgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2017020420170105
119635176refgcc -m64 -march=corei7 -O -fomit-frame-pointer2017020420170105
119694228refgcc -m64 -march=native -mtune=native -O -fomit-frame-pointer2017020420170105
120006232refgcc -m64 -march=core2 -O -fomit-frame-pointer2017020420170105
120014020refgcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer2017020420170105
120100060refgcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer2017020420170105
120389236refgcc -O -fomit-frame-pointer2017020420170105
120541012refgcc -fno-schedule-insns -O -fomit-frame-pointer2017020420170105
120542768refgcc -m64 -O -fomit-frame-pointer2017020420170105
120611956refgcc -m64 -march=barcelona -O -fomit-frame-pointer2017020420170105
120668928refgcc -march=barcelona -O -fomit-frame-pointer2017020420170105
121721848refgcc -march=k8 -O -fomit-frame-pointer2017020420170105
121812832refgcc -m64 -march=k8 -O -fomit-frame-pointer2017020420170105
198642820refgcc -march=nocona -O -fomit-frame-pointer2017020420170105
198951092refgcc -m64 -march=nocona -O -fomit-frame-pointer2017020420170105
208992904refgcc -fno-schedule-insns -Os -fomit-frame-pointer2017020420170105
209032320refgcc -march=nocona -Os -fomit-frame-pointer2017020420170105
209050968refgcc -m64 -march=core2 -Os -fomit-frame-pointer2017020420170105
209123856refgcc -march=k8 -Os -fomit-frame-pointer2017020420170105
209125576refgcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer2017020420170105
209253372refgcc -m64 -march=nocona -Os -fomit-frame-pointer2017020420170105
209303988refgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2017020420170105
209355840refgcc -m64 -Os -fomit-frame-pointer2017020420170105
209524808refgcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer2017020420170105
209528572refgcc -funroll-loops -march=nocona -Os -fomit-frame-pointer2017020420170105
209538432refgcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer2017020420170105
209725656refgcc -m64 -march=k8 -Os -fomit-frame-pointer2017020420170105
209807208refgcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer2017020420170105
209852332refgcc -m64 -march=corei7 -Os -fomit-frame-pointer2017020420170105
209887328refgcc -m64 -march=barcelona -Os -fomit-frame-pointer2017020420170105
209929148refgcc -Os -fomit-frame-pointer2017020420170105
209932396refgcc -march=barcelona -Os -fomit-frame-pointer2017020420170105
213093488refgcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer2017020420170105
213101156refgcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer2017020420170105
213110704refgcc -funroll-loops -m64 -Os -fomit-frame-pointer2017020420170105
213112864refgcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer2017020420170105
213147592refgcc -funroll-loops -march=k8 -Os -fomit-frame-pointer2017020420170105
213165648refgcc -funroll-loops -Os -fomit-frame-pointer2017020420170105
213189492refgcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer2017020420170105
222506876refclang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2017020420170105
223170204refclang -O3 -fomit-frame-pointer -Qunused-arguments2017020420170105
223271936refclang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2017020420170105
223517592refclang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2017020420170105
224945808refclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2017020420170105
225087724refclang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments2017020420170105
602565484refcc2017020420170105
631891052refgcc -funroll-loops2017020420170105
635358744refgcc2017020420170105

Compiler output

Implementation: crypto_aead/led80n6t4silcv3/vperm
Compiler: cc
led.c: led.c:172:16: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'MixColumnWithSbox' that is compiled without support for 'ssse3'
led.c: dqword tmp1 = PSHUFB(LOAD(Mbox1), sum);
led.c: ^
led.c: ./common.h:42:22: note: expanded from macro 'PSHUFB'
led.c: #define PSHUFB(s, x) _mm_shuffle_epi8((s), (x)) /*return s(x)*/
led.c: ^
led.c: led.c:174:16: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'MixColumnWithSbox' that is compiled without support for 'ssse3'
led.c: dqword tmp2 = PSHUFB(LOAD(Mbox2), sum);
led.c: ^
led.c: ./common.h:42:22: note: expanded from macro 'PSHUFB'
led.c: #define PSHUFB(s, x) _mm_shuffle_epi8((s), (x)) /*return s(x)*/
led.c: ^
led.c: led.c:181:9: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'MixColumnWithSbox' that is compiled without support for 'ssse3'
led.c: tmp1 = PSHUFB(LOAD(Mbox3), tmp3);
led.c: ^
led.c: ./common.h:42:22: note: expanded from macro 'PSHUFB'
led.c: #define PSHUFB(s, x) _mm_shuffle_epi8((s), (x)) /*return s(x)*/
led.c: ^
led.c: led.c:182:9: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'MixColumnWithSbox' that is compiled without support for 'ssse3'
led.c: tmp2 = PSHUFB(LOAD(Mbox4), tmp3);
led.c: ^
led.c: ./common.h:42:22: note: expanded from macro 'PSHUFB'
led.c: #define PSHUFB(s, x) _mm_shuffle_epi8((s), (x)) /*return s(x)*/
led.c: ^
led.c: led.c:190:9: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'MixColumnWithSbox' that is compiled without support for 'ssse3'
led.c: ...

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
cc vperm
clang -O3 -fomit-frame-pointer -Qunused-arguments vperm
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments vperm
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments vperm
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments vperm

Compiler output

Implementation: crypto_aead/led80n6t4silcv3/vperm
Compiler: gcc
led.c: In file included from common.h:7:0,
led.c: from led.c:3:
led.c: led.c: In function 'MixColumnWithSbox':
led.c: /usr/local/lib/gcc6/gcc/x86_64-portbld-freebsd11.0/6.3.0/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
led.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
led.c: ^~~~~~~~~~~~~~~~
led.c: led.c:200:7: note: called from here
led.c: tmp2 = PSHUFB(LOAD(Mbox8), tmp3);
led.c:
led.c: In file included from common.h:7:0,
led.c: from led.c:3:
led.c: /usr/local/lib/gcc6/gcc/x86_64-portbld-freebsd11.0/6.3.0/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
led.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
led.c: ^~~~~~~~~~~~~~~~
led.c: led.c:199:7: note: called from here
led.c: tmp1 = PSHUFB(LOAD(Mbox7), tmp3);
led.c:
led.c: In file included from common.h:7:0,
led.c: from led.c:3:
led.c: /usr/local/lib/gcc6/gcc/x86_64-portbld-freebsd11.0/6.3.0/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
led.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
led.c: ^~~~~~~~~~~~~~~~
led.c: led.c:191:7: note: called from here
led.c: tmp2 = PSHUFB(LOAD(Mbox6), tmp3);
led.c:
led.c: ...

Number of similar (compiler,implementation) pairs: 70, namely:
CompilerImplementations
gcc vperm
gcc -O2 -fomit-frame-pointer vperm
gcc -O3 -fomit-frame-pointer vperm
gcc -O -fomit-frame-pointer vperm
gcc -Os -fomit-frame-pointer vperm
gcc -fno-schedule-insns -O2 -fomit-frame-pointer vperm
gcc -fno-schedule-insns -O3 -fomit-frame-pointer vperm
gcc -fno-schedule-insns -O -fomit-frame-pointer vperm
gcc -fno-schedule-insns -Os -fomit-frame-pointer vperm
gcc -funroll-loops vperm
gcc -funroll-loops -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -O -fomit-frame-pointer vperm
gcc -funroll-loops -Os -fomit-frame-pointer vperm
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer vperm
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -O -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -Os -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer vperm
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer vperm
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer vperm
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer vperm
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer vperm
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer vperm
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer vperm
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer vperm
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer vperm
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer vperm
gcc -m64 -O2 -fomit-frame-pointer vperm
gcc -m64 -O3 -fomit-frame-pointer vperm
gcc -m64 -O -fomit-frame-pointer vperm
gcc -m64 -Os -fomit-frame-pointer vperm
gcc -m64 -march=k8 -O2 -fomit-frame-pointer vperm
gcc -m64 -march=k8 -O3 -fomit-frame-pointer vperm
gcc -m64 -march=k8 -O -fomit-frame-pointer vperm
gcc -m64 -march=k8 -Os -fomit-frame-pointer vperm
gcc -m64 -march=nocona -O2 -fomit-frame-pointer vperm
gcc -m64 -march=nocona -O3 -fomit-frame-pointer vperm
gcc -m64 -march=nocona -O -fomit-frame-pointer vperm
gcc -m64 -march=nocona -Os -fomit-frame-pointer vperm
gcc -march=barcelona -O2 -fomit-frame-pointer vperm
gcc -march=barcelona -O3 -fomit-frame-pointer vperm
gcc -march=barcelona -O -fomit-frame-pointer vperm
gcc -march=barcelona -Os -fomit-frame-pointer vperm
gcc -march=k8 -O2 -fomit-frame-pointer vperm
gcc -march=k8 -O3 -fomit-frame-pointer vperm
gcc -march=k8 -O -fomit-frame-pointer vperm
gcc -march=k8 -Os -fomit-frame-pointer vperm
gcc -march=nocona -O2 -fomit-frame-pointer vperm
gcc -march=nocona -O3 -fomit-frame-pointer vperm
gcc -march=nocona -O -fomit-frame-pointer vperm
gcc -march=nocona -Os -fomit-frame-pointer vperm

Compiler output

Implementation: crypto_aead/led80n6t4silcv3/vperm
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
led.c: In file included from common.h:7:0,
led.c: from led.c:3:
led.c: led.c: In function 'MixColumnWithSbox':
led.c: /usr/local/lib/gcc6/gcc/x86_64-portbld-freebsd11.0/6.3.0/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
led.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
led.c: ^~~~~~~~~~~~~~~~
led.c: led.c:200:7: note: called from here
led.c: tmp2 = PSHUFB(LOAD(Mbox8), tmp3);
led.c:
led.c: In file included from common.h:7:0,
led.c: from led.c:3:
led.c: /usr/local/lib/gcc6/gcc/x86_64-portbld-freebsd11.0/6.3.0/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
led.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
led.c: ^~~~~~~~~~~~~~~~
led.c: led.c:199:7: note: called from here
led.c: tmp1 = PSHUFB(LOAD(Mbox7), tmp3);
led.c:
led.c: In file included from common.h:7:0,
led.c: from led.c:3:
led.c: /usr/local/lib/gcc6/gcc/x86_64-portbld-freebsd11.0/6.3.0/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
led.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
led.c: ^~~~~~~~~~~~~~~~
led.c: led.c:191:7: note: called from here
led.c: tmp2 = PSHUFB(LOAD(Mbox6), tmp3);
led.c:
led.c: ...
led.c: In file included from common.h:7:0,
led.c: from led.c:3:
led.c: led.c: In function 'MixColumnWithSbox':
led.c: /usr/local/lib/gcc6/gcc/x86_64-portbld-freebsd11.0/6.3.0/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
led.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
led.c: ^~~~~~~~~~~~~~~~
led.c: led.c:200:7: note: called from here
led.c: tmp2 = PSHUFB(LOAD(Mbox8), tmp3);
led.c:
led.c: In file included from common.h:7:0,
led.c: from led.c:3:
led.c: /usr/local/lib/gcc6/gcc/x86_64-portbld-freebsd11.0/6.3.0/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
led.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
led.c: ^~~~~~~~~~~~~~~~
led.c: led.c:199:7: note: called from here
led.c: tmp1 = PSHUFB(LOAD(Mbox7), tmp3);
led.c:
led.c: In file included from common.h:7:0,
led.c: from led.c:3:
led.c: /usr/local/lib/gcc6/gcc/x86_64-portbld-freebsd11.0/6.3.0/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
led.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
led.c: ^~~~~~~~~~~~~~~~
led.c: led.c:191:7: note: called from here
led.c: tmp2 = PSHUFB(LOAD(Mbox6), tmp3);
led.c:
led.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer vperm
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer vperm
gcc -m64 -march=barcelona -O -fomit-frame-pointer vperm
gcc -m64 -march=barcelona -Os -fomit-frame-pointer vperm