Implementation notes: amd64, oki, crypto_sign/gravitysphincss

Computer: oki
Architecture: amd64
CPU ID: GenuineIntel-00050654-bfebfbff
SUPERCOP version: 20181123
Operation: crypto_sign
Primitive: gravitysphincss
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
8813528? ? ?? ? ?fastgcc_-m64_-march=native_-mtune=native_-O2_-fomit-frame-pointer2019011420181123
8863014? ? ?? ? ?fastgcc_-m64_-march=native_-mtune=native_-O_-fomit-frame-pointer2019011420181123
8882894? ? ?? ? ?fastgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv2019011420181123
8914060? ? ?? ? ?fastgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv2019011420181123
9094840? ? ?? ? ?fastgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv2019011420181123
9137436? ? ?? ? ?fastgcc_-m64_-march=native_-mtune=native_-Os_-fomit-frame-pointer2019011420181123
9142180? ? ?? ? ?fastgcc_-m64_-march=core-avx2_-O3_-fomit-frame-pointer2019011420181123
9143640? ? ?? ? ?fastgcc_-m64_-march=corei7-avx_-O3_-fomit-frame-pointer2019011420181123
9311518? ? ?? ? ?fastgcc_-m64_-march=native_-mtune=native_-O3_-fomit-frame-pointer2019011420181123
9403474? ? ?? ? ?fastgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv2019011420181123
9465750? ? ?? ? ?fastclang_-O3_-fwrapv_-march=x86-64_-mcpu=core-avx2_-mavx2_-maes_-mpclmul_-fomit-frame-pointer_-Qunused-arguments2019011420181123
9526638? ? ?? ? ?fastgcc_-m64_-march=core-avx2_-O2_-fomit-frame-pointer2019011420181123
9532356? ? ?? ? ?fastclang_-O3_-fwrapv_-mavx_-maes_-mpclmul_-fomit-frame-pointer_-Qunused-arguments2019011420181123
9560988? ? ?? ? ?fastgcc_-m64_-march=core-avx-i_-O3_-fomit-frame-pointer2019011420181123
9643542? ? ?? ? ?fastgcc_-m64_-march=corei7-avx_-O_-fomit-frame-pointer2019011420181123
9661596? ? ?? ? ?fastgcc_-m64_-march=corei7-avx_-O2_-fomit-frame-pointer2019011420181123
9683066? ? ?? ? ?fastgcc_-m64_-march=core-avx-i_-O_-fomit-frame-pointer2019011420181123
9839122? ? ?? ? ?fastgcc_-m64_-march=core-avx2_-Os_-fomit-frame-pointer2019011420181123
9882286? ? ?? ? ?fastgcc_-m64_-march=core-avx2_-O_-fomit-frame-pointer2019011420181123
9939852? ? ?? ? ?fastgcc_-m64_-march=core-avx-i_-O2_-fomit-frame-pointer2019011420181123
9967306? ? ?? ? ?fastgcc_-m64_-march=core-avx-i_-Os_-fomit-frame-pointer2019011420181123
10317624? ? ?? ? ?fastgcc_-m64_-march=corei7-avx_-Os_-fomit-frame-pointer2019011420181123

Compiler output

Implementation: fast
Security model: unknown
Compiler: cc
aes.c: In file included from aes.c:6:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/4.8/include/smmintrin.h:31:3: error: #error "SSE4.1 instruction set not enabled"
aes.c: # error "SSE4.1 instruction set not enabled"
aes.c: ^
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/4.8/include/wmmintrin.h:34:3: error: #error "AES/PCLMUL instructions not enabled"
aes.c: # error "AES/PCLMUL instructions not enabled"
aes.c: ^
aes.c: aes.c: In function 'expand256':
aes.c: aes.c:43:5: error: incompatible type for argument 2 of 'assist256_1'
aes.c: rkeys[2] = s = assist256_1 (s, _mm_aeskeygenassist_si128 (t, 0x01));
aes.c: ^
aes.c: aes.c:12:16: note: expected '__m128i' but argument is of type 'int'
aes.c: static __m128i assist256_1 (__m128i a, __m128i b) {
aes.c: ^
aes.c: aes.c:44:5: error: incompatible type for argument 2 of 'assist256_2'
aes.c: rkeys[3] = t = assist256_2 (t, _mm_aeskeygenassist_si128 (s, 0x01));
aes.c: ^
aes.c: aes.c:25:16: note: expected '__m128i' but argument is of type 'int'
aes.c: static __m128i assist256_2 (__m128i a, __m128i b) {
aes.c: ^
aes.c: aes.c:45:5: error: incompatible type for argument 2 of 'assist256_1'
aes.c: rkeys[4] = s = assist256_1 (s, _mm_aeskeygenassist_si128 (t, 0x02));
aes.c: ^
aes.c: aes.c:12:16: note: expected '__m128i' but argument is of type 'int'
aes.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
cc fast

Compiler output

Implementation: fast
Security model: unknown
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
aes.c: aes.c:89:14: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'aesctr256_direct_x4' that is compiled without support for 'aes'
aes.c: s1 = _mm_aesenc_si128 (s1, rkeys[1]);
aes.c: ^
aes.c: aes.c:90:14: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'aesctr256_direct_x4' that is compiled without support for 'aes'
aes.c: s2 = _mm_aesenc_si128 (s2, rkeys[1]);
aes.c: ^
aes.c: aes.c:91:14: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'aesctr256_direct_x4' that is compiled without support for 'aes'
aes.c: s3 = _mm_aesenc_si128 (s3, rkeys[1]);
aes.c: ^
aes.c: aes.c:92:14: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'aesctr256_direct_x4' that is compiled without support for 'aes'
aes.c: s4 = _mm_aesenc_si128 (s4, rkeys[1]);
aes.c: ^
aes.c: aes.c:93:14: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'aesctr256_direct_x4' that is compiled without support for 'aes'
aes.c: s1 = _mm_aesenc_si128 (s1, rkeys[2]);
aes.c: ^
aes.c: aes.c:94:14: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'aesctr256_direct_x4' that is compiled without support for 'aes'
aes.c: s2 = _mm_aesenc_si128 (s2, rkeys[2]);
aes.c: ^
aes.c: aes.c:95:14: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'aesctr256_direct_x4' that is compiled without support for 'aes'
aes.c: s3 = _mm_aesenc_si128 (s3, rkeys[2]);
aes.c: ^
aes.c: aes.c:96:14: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'aesctr256_direct_x4' that is compiled without support for 'aes'
aes.c: s4 = _mm_aesenc_si128 (s4, rkeys[2]);
aes.c: ^
aes.c: aes.c:97:14: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'aesctr256_direct_x4' that is compiled without support for 'aes'
aes.c: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments fast
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments fast
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments fast
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments fast
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments fast
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments fast

Compiler output

Implementation: fast
Security model: unknown
Compiler: clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments
try.c: fatal error: error in backend: Cannot select: 0x268cca0: v4i64 = X86ISD::VTRUNC 0x268cad0
try.c: 0x268cad0: v16i32 = vselect 0x2689710, 0x2630a10, 0x268c9a0
try.c: 0x2689710: v4i1 = X86ISD::PCMPGTM 0x2675990, 0x26317c0
try.c: 0x2675990: v4i64 = X86ISD::VBROADCAST 0x2631b50
try.c: 0x2631b50: i64,ch = load<LD8[%uglygep72]> 0x256fdd0, 0x2627340, undef:i64
try.c: 0x2627340: i64 = add 0x2676870, 0x26222d0
try.c: 0x2676870: i64 = X86ISD::Wrapper TargetGlobalAddress:i64<[64 x i64]* @cycles> 0
try.c: 0x2664790: i64 = TargetGlobalAddress<[64 x i64]* @cycles> 0
try.c: 0x26222d0: i64 = shl 0x2663f40, Constant:i8<3>
try.c: 0x2663f40: i64,ch = CopyFromReg 0x256fdd0, Register:i64 %vreg50
try.c: 0x2676d30: i64 = Register %vreg50
try.c: 0x26159e0: i8 = Constant<3>
try.c: 0x26737f0: i64 = undef
try.c: 0x26317c0: v4i64,ch = CopyFromReg 0x256fdd0, Register:v4i64 %vreg13
try.c: 0x2611160: v4i64 = Register %vreg13
try.c: 0x2630a10: v16i32 = X86ISD::VBROADCAST 0x26641a0
try.c: 0x26641a0: i32,ch = load<LD4[ConstantPool]> 0x256fdd0, 0x268c610, undef:i64
try.c: 0x268c610: i64 = X86ISD::Wrapper TargetConstantPool:i64<i32 1> 0
try.c: 0x260ec80: i64 = TargetConstantPool<i32 1> 0
try.c: 0x26737f0: i64 = undef
try.c: 0x268c9a0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x268c870: i32 = Constant<0>
try.c: 0x268c870: i32 = Constant<0>
try.c: 0x268c870: i32 = Constant<0>
try.c: 0x268c870: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments fast

Compiler output

Implementation: fast
Security model: unknown
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
try.c: fatal error: error in backend: Cannot select: 0x1303060: v4i64 = X86ISD::VTRUNC 0x1302e90
try.c: 0x1302e90: v16i32 = vselect 0x1309e30, 0x129cdf0, 0x1302d60
try.c: 0x1309e30: v4i1 = X86ISD::PCMPGTM 0x12ee570, 0x129dba0
try.c: 0x12ee570: v4i64 = X86ISD::VBROADCAST 0x129df30
try.c: 0x129df30: i64,ch = load<LD8[%uglygep72]> 0x11e7dd0, 0x1284b60, undef:i64
try.c: 0x1284b60: i64 = add 0x12ef450, 0x127d800
try.c: 0x12ef450: i64 = X86ISD::Wrapper TargetGlobalAddress:i64<[64 x i64]* @cycles> 0
try.c: 0x12de600: i64 = TargetGlobalAddress<[64 x i64]* @cycles> 0
try.c: 0x127d800: i64 = shl 0x12dddb0, Constant:i8<3>
try.c: 0x12dddb0: i64,ch = CopyFromReg 0x11e7dd0, Register:i64 %vreg50
try.c: 0x12ef910: i64 = Register %vreg50
try.c: 0x128d770: i8 = Constant<3>
try.c: 0x12ecbe0: i64 = undef
try.c: 0x129dba0: v4i64,ch = CopyFromReg 0x11e7dd0, Register:v4i64 %vreg13
try.c: 0x12822b0: v4i64 = Register %vreg13
try.c: 0x129cdf0: v16i32 = X86ISD::VBROADCAST 0x12de010
try.c: 0x12de010: i32,ch = load<LD4[ConstantPool]> 0x11e7dd0, 0x13029d0, undef:i64
try.c: 0x13029d0: i64 = X86ISD::Wrapper TargetConstantPool:i64<i32 1> 0
try.c: 0x1292830: i64 = TargetConstantPool<i32 1> 0
try.c: 0x12ecbe0: i64 = undef
try.c: 0x1302d60: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x1302c30: i32 = Constant<0>
try.c: 0x1302c30: i32 = Constant<0>
try.c: 0x1302c30: i32 = Constant<0>
try.c: 0x1302c30: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments fast

Compiler output

Implementation: fast
Security model: unknown
Compiler: gcc
aes.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/smmintrin.h:32:0,
aes.c: from aes.c:6:
aes.c: aes.c: In function 'increment_be':
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
aes.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
aes.c: ^~~~~~~~~~~~~~~~
aes.c: aes.c:63:7: note: called from here
aes.c: x = _mm_shuffle_epi8 (x, swap);
aes.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/smmintrin.h:32:0,
aes.c: from aes.c:6:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
aes.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
aes.c: ^~~~~~~~~~~~~~~~
aes.c: aes.c:61:7: note: called from here
aes.c: x = _mm_shuffle_epi8 (x, swap);
aes.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/smmintrin.h:32:0,
aes.c: from aes.c:6:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
aes.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
aes.c: ^~~~~~~~~~~~~~~~
aes.c: aes.c:61:7: note: called from here
aes.c: x = _mm_shuffle_epi8 (x, swap);
aes.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: ...

Number of similar (compiler,implementation) pairs: 2, namely:
CompilerImplementations
gcc fast
gcc -funroll-loops fast

Compiler output

Implementation: fast
Security model: unknown
Compiler: gcc -O2 -fomit-frame-pointer
aes.c: In file included from aes.c:9:0:
aes.c: aes.c: In function 'expand256':
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: aes.c:55:21: note: called from here
aes.c: rkeys[14] = s = assist256_1 (s, _mm_aeskeygenassist_si128 (t, 0x40));
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: aes.c:54:21: note: called from here
aes.c: rkeys[13] = t = assist256_2 (t, _mm_aeskeygenassist_si128 (s, 0x20));
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: aes.c:53:21: note: called from here
aes.c: rkeys[12] = s = assist256_1 (s, _mm_aeskeygenassist_si128 (t, 0x20));
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ...

Number of similar (compiler,implementation) pairs: 84, namely:
CompilerImplementations
gcc -O2 -fomit-frame-pointer fast
gcc -O3 -fomit-frame-pointer fast
gcc -O -fomit-frame-pointer fast
gcc -Os -fomit-frame-pointer fast
gcc -fno-schedule-insns -O2 -fomit-frame-pointer fast
gcc -fno-schedule-insns -O3 -fomit-frame-pointer fast
gcc -fno-schedule-insns -O -fomit-frame-pointer fast
gcc -fno-schedule-insns -Os -fomit-frame-pointer fast
gcc -funroll-loops -O2 -fomit-frame-pointer fast
gcc -funroll-loops -O3 -fomit-frame-pointer fast
gcc -funroll-loops -O -fomit-frame-pointer fast
gcc -funroll-loops -Os -fomit-frame-pointer fast
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer fast
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer fast
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer fast
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer fast
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer fast
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer fast
gcc -funroll-loops -m64 -O -fomit-frame-pointer fast
gcc -funroll-loops -m64 -Os -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer fast
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer fast
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer fast
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer fast
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer fast
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer fast
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer fast
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer fast
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer fast
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer fast
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer fast
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer fast
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer fast
gcc -m64 -O2 -fomit-frame-pointer fast
gcc -m64 -O3 -fomit-frame-pointer fast
gcc -m64 -O -fomit-frame-pointer fast
gcc -m64 -Os -fomit-frame-pointer fast
gcc -m64 -march=core2 -O2 -fomit-frame-pointer fast
gcc -m64 -march=core2 -O3 -fomit-frame-pointer fast
gcc -m64 -march=core2 -O -fomit-frame-pointer fast
gcc -m64 -march=core2 -Os -fomit-frame-pointer fast
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer fast
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer fast
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer fast
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer fast
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer fast
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer fast
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer fast
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer fast
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer fast
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer fast
gcc -m64 -march=corei7 -O -fomit-frame-pointer fast
gcc -m64 -march=corei7 -Os -fomit-frame-pointer fast
gcc -m64 -march=k8 -O2 -fomit-frame-pointer fast
gcc -m64 -march=k8 -O3 -fomit-frame-pointer fast
gcc -m64 -march=k8 -O -fomit-frame-pointer fast
gcc -m64 -march=k8 -Os -fomit-frame-pointer fast
gcc -m64 -march=nocona -O2 -fomit-frame-pointer fast
gcc -m64 -march=nocona -O3 -fomit-frame-pointer fast
gcc -m64 -march=nocona -O -fomit-frame-pointer fast
gcc -m64 -march=nocona -Os -fomit-frame-pointer fast
gcc -march=barcelona -O2 -fomit-frame-pointer fast
gcc -march=barcelona -O3 -fomit-frame-pointer fast
gcc -march=barcelona -O -fomit-frame-pointer fast
gcc -march=barcelona -Os -fomit-frame-pointer fast
gcc -march=k8 -O2 -fomit-frame-pointer fast
gcc -march=k8 -O3 -fomit-frame-pointer fast
gcc -march=k8 -O -fomit-frame-pointer fast
gcc -march=k8 -Os -fomit-frame-pointer fast
gcc -march=nocona -O2 -fomit-frame-pointer fast
gcc -march=nocona -O3 -fomit-frame-pointer fast
gcc -march=nocona -O -fomit-frame-pointer fast
gcc -march=nocona -Os -fomit-frame-pointer fast

Compiler output

Implementation: fast
Security model: unknown
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
aes.c: In file included from aes.c:9:0:
aes.c: aes.c: In function 'expand256':
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: aes.c:55:21: note: called from here
aes.c: rkeys[14] = s = assist256_1 (s, _mm_aeskeygenassist_si128 (t, 0x40));
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: aes.c:54:21: note: called from here
aes.c: rkeys[13] = t = assist256_2 (t, _mm_aeskeygenassist_si128 (s, 0x20));
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: aes.c:53:21: note: called from here
aes.c: rkeys[12] = s = assist256_1 (s, _mm_aeskeygenassist_si128 (t, 0x20));
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ...
aes.c: In file included from aes.c:9:0:
aes.c: aes.c: In function 'expand256':
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: aes.c:55:21: note: called from here
aes.c: rkeys[14] = s = assist256_1 (s, _mm_aeskeygenassist_si128 (t, 0x40));
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: aes.c:54:21: note: called from here
aes.c: rkeys[13] = t = assist256_2 (t, _mm_aeskeygenassist_si128 (s, 0x20));
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: aes.c:53:21: note: called from here
aes.c: rkeys[12] = s = assist256_1 (s, _mm_aeskeygenassist_si128 (t, 0x20));
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer fast
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer fast
gcc -m64 -march=barcelona -O -fomit-frame-pointer fast
gcc -m64 -march=barcelona -Os -fomit-frame-pointer fast