Implementation notes: amd64, oki, crypto_sign/gravitysphincsl

Computer: oki
Architecture: amd64
CPU ID: GenuineIntel-00050654-bfebfbff
SUPERCOP version: 20181123
Operation: crypto_sign
Primitive: gravitysphincsl
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
19656724? ? ?? ? ?fastgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv2019011320181123
19669752? ? ?? ? ?fastgcc_-m64_-march=native_-mtune=native_-O_-fomit-frame-pointer2019011320181123
19719292? ? ?? ? ?fastgcc_-m64_-march=core-avx2_-O3_-fomit-frame-pointer2019011320181123
19809540? ? ?? ? ?fastgcc_-m64_-march=native_-mtune=native_-O2_-fomit-frame-pointer2019011320181123
19978558? ? ?? ? ?fastgcc_-m64_-march=corei7-avx_-O3_-fomit-frame-pointer2019011320181123
20093284? ? ?? ? ?fastgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv2019011320181123
20125148? ? ?? ? ?fastclang_-O3_-fwrapv_-march=x86-64_-mcpu=core-avx2_-mavx2_-maes_-mpclmul_-fomit-frame-pointer_-Qunused-arguments2019011320181123
20142966? ? ?? ? ?fastgcc_-m64_-march=core-avx-i_-O3_-fomit-frame-pointer2019011320181123
20163712? ? ?? ? ?fastclang_-O3_-fwrapv_-mavx_-maes_-mpclmul_-fomit-frame-pointer_-Qunused-arguments2019011320181123
20202094? ? ?? ? ?fastgcc_-m64_-march=native_-mtune=native_-O3_-fomit-frame-pointer2019011320181123
20266830? ? ?? ? ?fastgcc_-m64_-march=core-avx-i_-O2_-fomit-frame-pointer2019011320181123
20373928? ? ?? ? ?fastgcc_-m64_-march=native_-mtune=native_-Os_-fomit-frame-pointer2019011320181123
20387678? ? ?? ? ?fastgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv2019011320181123
20447718? ? ?? ? ?fastgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv2019011320181123
20451966? ? ?? ? ?fastgcc_-m64_-march=corei7-avx_-O2_-fomit-frame-pointer2019011320181123
20555290? ? ?? ? ?fastgcc_-m64_-march=corei7-avx_-O_-fomit-frame-pointer2019011320181123
20744532? ? ?? ? ?fastgcc_-m64_-march=core-avx2_-O_-fomit-frame-pointer2019011320181123
20747970? ? ?? ? ?fastgcc_-m64_-march=core-avx-i_-O_-fomit-frame-pointer2019011320181123
21308264? ? ?? ? ?fastgcc_-m64_-march=core-avx2_-O2_-fomit-frame-pointer2019011320181123
22089670? ? ?? ? ?fastgcc_-m64_-march=core-avx-i_-Os_-fomit-frame-pointer2019011320181123
22200152? ? ?? ? ?fastgcc_-m64_-march=corei7-avx_-Os_-fomit-frame-pointer2019011320181123
22221198? ? ?? ? ?fastgcc_-m64_-march=core-avx2_-Os_-fomit-frame-pointer2019011320181123

Compiler output

Implementation: fast
Security model: unknown
Compiler: cc
aes.c: In file included from aes.c:6:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/4.8/include/smmintrin.h:31:3: error: #error "SSE4.1 instruction set not enabled"
aes.c: # error "SSE4.1 instruction set not enabled"
aes.c: ^
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/4.8/include/wmmintrin.h:34:3: error: #error "AES/PCLMUL instructions not enabled"
aes.c: # error "AES/PCLMUL instructions not enabled"
aes.c: ^
aes.c: aes.c: In function 'expand256':
aes.c: aes.c:43:5: error: incompatible type for argument 2 of 'assist256_1'
aes.c: rkeys[2] = s = assist256_1 (s, _mm_aeskeygenassist_si128 (t, 0x01));
aes.c: ^
aes.c: aes.c:12:16: note: expected '__m128i' but argument is of type 'int'
aes.c: static __m128i assist256_1 (__m128i a, __m128i b) {
aes.c: ^
aes.c: aes.c:44:5: error: incompatible type for argument 2 of 'assist256_2'
aes.c: rkeys[3] = t = assist256_2 (t, _mm_aeskeygenassist_si128 (s, 0x01));
aes.c: ^
aes.c: aes.c:25:16: note: expected '__m128i' but argument is of type 'int'
aes.c: static __m128i assist256_2 (__m128i a, __m128i b) {
aes.c: ^
aes.c: aes.c:45:5: error: incompatible type for argument 2 of 'assist256_1'
aes.c: rkeys[4] = s = assist256_1 (s, _mm_aeskeygenassist_si128 (t, 0x02));
aes.c: ^
aes.c: aes.c:12:16: note: expected '__m128i' but argument is of type 'int'
aes.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
cc fast

Compiler output

Implementation: fast
Security model: unknown
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
aes.c: aes.c:89:14: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'aesctr256_direct_x4' that is compiled without support for 'aes'
aes.c: s1 = _mm_aesenc_si128 (s1, rkeys[1]);
aes.c: ^
aes.c: aes.c:90:14: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'aesctr256_direct_x4' that is compiled without support for 'aes'
aes.c: s2 = _mm_aesenc_si128 (s2, rkeys[1]);
aes.c: ^
aes.c: aes.c:91:14: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'aesctr256_direct_x4' that is compiled without support for 'aes'
aes.c: s3 = _mm_aesenc_si128 (s3, rkeys[1]);
aes.c: ^
aes.c: aes.c:92:14: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'aesctr256_direct_x4' that is compiled without support for 'aes'
aes.c: s4 = _mm_aesenc_si128 (s4, rkeys[1]);
aes.c: ^
aes.c: aes.c:93:14: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'aesctr256_direct_x4' that is compiled without support for 'aes'
aes.c: s1 = _mm_aesenc_si128 (s1, rkeys[2]);
aes.c: ^
aes.c: aes.c:94:14: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'aesctr256_direct_x4' that is compiled without support for 'aes'
aes.c: s2 = _mm_aesenc_si128 (s2, rkeys[2]);
aes.c: ^
aes.c: aes.c:95:14: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'aesctr256_direct_x4' that is compiled without support for 'aes'
aes.c: s3 = _mm_aesenc_si128 (s3, rkeys[2]);
aes.c: ^
aes.c: aes.c:96:14: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'aesctr256_direct_x4' that is compiled without support for 'aes'
aes.c: s4 = _mm_aesenc_si128 (s4, rkeys[2]);
aes.c: ^
aes.c: aes.c:97:14: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'aesctr256_direct_x4' that is compiled without support for 'aes'
aes.c: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments fast
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments fast
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments fast
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments fast
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments fast
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments fast

Compiler output

Implementation: fast
Security model: unknown
Compiler: clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments
try.c: fatal error: error in backend: Cannot select: 0xf983d0: v4i64 = X86ISD::VTRUNC 0xf98200
try.c: 0xf98200: v16i32 = vselect 0xf9c680, 0xf33790, 0xf980d0
try.c: 0xf9c680: v4i1 = X86ISD::PCMPGTM 0xf828d0, 0xf34540
try.c: 0xf828d0: v4i64 = X86ISD::VBROADCAST 0xf348d0
try.c: 0xf348d0: i64,ch = load<LD8[%uglygep72]> 0xe7ddc0, 0xf38360, undef:i64
try.c: 0xf38360: i64 = add 0xf837b0, 0xf1e270
try.c: 0xf837b0: i64 = X86ISD::Wrapper TargetGlobalAddress:i64<[64 x i64]* @cycles> 0
try.c: 0xf43ec0: i64 = TargetGlobalAddress<[64 x i64]* @cycles> 0
try.c: 0xf1e270: i64 = shl 0xf43670, Constant:i8<3>
try.c: 0xf43670: i64,ch = CopyFromReg 0xe7ddc0, Register:i64 %vreg50
try.c: 0xf83c70: i64 = Register %vreg50
try.c: 0xf1b850: i8 = Constant<3>
try.c: 0xf80f40: i64 = undef
try.c: 0xf34540: v4i64,ch = CopyFromReg 0xe7ddc0, Register:v4i64 %vreg13
try.c: 0xf3e1b0: v4i64 = Register %vreg13
try.c: 0xf33790: v16i32 = X86ISD::VBROADCAST 0xf438d0
try.c: 0xf438d0: i32,ch = load<LD4[ConstantPool]> 0xe7ddc0, 0xf97d40, undef:i64
try.c: 0xf97d40: i64 = X86ISD::Wrapper TargetConstantPool:i64<i32 1> 0
try.c: 0xf302d0: i64 = TargetConstantPool<i32 1> 0
try.c: 0xf80f40: i64 = undef
try.c: 0xf980d0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0xf97fa0: i32 = Constant<0>
try.c: 0xf97fa0: i32 = Constant<0>
try.c: 0xf97fa0: i32 = Constant<0>
try.c: 0xf97fa0: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments fast

Compiler output

Implementation: fast
Security model: unknown
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
try.c: fatal error: error in backend: Cannot select: 0x1f2afe0: v4i64 = X86ISD::VTRUNC 0x1f2ae10
try.c: 0x1f2ae10: v16i32 = vselect 0x1f27a50, 0x1ea2d40, 0x1f2ace0
try.c: 0x1f27a50: v4i1 = X86ISD::PCMPGTM 0x1f130c0, 0x1ea8f50
try.c: 0x1f130c0: v4i64 = X86ISD::VBROADCAST 0x1ea92e0
try.c: 0x1ea92e0: i64,ch = load<LD8[%uglygep72]> 0x1e0ddd0, 0x1f0aa30, undef:i64
try.c: 0x1f0aa30: i64 = add 0x1f13fa0, 0x1ec0950
try.c: 0x1f13fa0: i64 = X86ISD::Wrapper TargetGlobalAddress:i64<[64 x i64]* @cycles> 0
try.c: 0x1f0bdd0: i64 = TargetGlobalAddress<[64 x i64]* @cycles> 0
try.c: 0x1ec0950: i64 = shl 0x1f0b580, Constant:i8<3>
try.c: 0x1f0b580: i64,ch = CopyFromReg 0x1e0ddd0, Register:i64 %vreg50
try.c: 0x1f14460: i64 = Register %vreg50
try.c: 0x1ebdf30: i8 = Constant<3>
try.c: 0x1f10f20: i64 = undef
try.c: 0x1ea8f50: v4i64,ch = CopyFromReg 0x1e0ddd0, Register:v4i64 %vreg13
try.c: 0x1eca850: v4i64 = Register %vreg13
try.c: 0x1ea2d40: v16i32 = X86ISD::VBROADCAST 0x1f0b7e0
try.c: 0x1f0b7e0: i32,ch = load<LD4[ConstantPool]> 0x1e0ddd0, 0x1f2a950, undef:i64
try.c: 0x1f2a950: i64 = X86ISD::Wrapper TargetConstantPool:i64<i32 1> 0
try.c: 0x1ec8370: i64 = TargetConstantPool<i32 1> 0
try.c: 0x1f10f20: i64 = undef
try.c: 0x1f2ace0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x1f2abb0: i32 = Constant<0>
try.c: 0x1f2abb0: i32 = Constant<0>
try.c: 0x1f2abb0: i32 = Constant<0>
try.c: 0x1f2abb0: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments fast

Compiler output

Implementation: fast
Security model: unknown
Compiler: gcc
aes.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/smmintrin.h:32:0,
aes.c: from aes.c:6:
aes.c: aes.c: In function 'increment_be':
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
aes.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
aes.c: ^~~~~~~~~~~~~~~~
aes.c: aes.c:63:7: note: called from here
aes.c: x = _mm_shuffle_epi8 (x, swap);
aes.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/smmintrin.h:32:0,
aes.c: from aes.c:6:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
aes.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
aes.c: ^~~~~~~~~~~~~~~~
aes.c: aes.c:61:7: note: called from here
aes.c: x = _mm_shuffle_epi8 (x, swap);
aes.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/smmintrin.h:32:0,
aes.c: from aes.c:6:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
aes.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
aes.c: ^~~~~~~~~~~~~~~~
aes.c: aes.c:61:7: note: called from here
aes.c: x = _mm_shuffle_epi8 (x, swap);
aes.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: ...

Number of similar (compiler,implementation) pairs: 2, namely:
CompilerImplementations
gcc fast
gcc -funroll-loops fast

Compiler output

Implementation: fast
Security model: unknown
Compiler: gcc -O2 -fomit-frame-pointer
aes.c: In file included from aes.c:9:0:
aes.c: aes.c: In function 'expand256':
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: aes.c:55:21: note: called from here
aes.c: rkeys[14] = s = assist256_1 (s, _mm_aeskeygenassist_si128 (t, 0x40));
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: aes.c:54:21: note: called from here
aes.c: rkeys[13] = t = assist256_2 (t, _mm_aeskeygenassist_si128 (s, 0x20));
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: aes.c:53:21: note: called from here
aes.c: rkeys[12] = s = assist256_1 (s, _mm_aeskeygenassist_si128 (t, 0x20));
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ...

Number of similar (compiler,implementation) pairs: 84, namely:
CompilerImplementations
gcc -O2 -fomit-frame-pointer fast
gcc -O3 -fomit-frame-pointer fast
gcc -O -fomit-frame-pointer fast
gcc -Os -fomit-frame-pointer fast
gcc -fno-schedule-insns -O2 -fomit-frame-pointer fast
gcc -fno-schedule-insns -O3 -fomit-frame-pointer fast
gcc -fno-schedule-insns -O -fomit-frame-pointer fast
gcc -fno-schedule-insns -Os -fomit-frame-pointer fast
gcc -funroll-loops -O2 -fomit-frame-pointer fast
gcc -funroll-loops -O3 -fomit-frame-pointer fast
gcc -funroll-loops -O -fomit-frame-pointer fast
gcc -funroll-loops -Os -fomit-frame-pointer fast
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer fast
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer fast
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer fast
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer fast
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer fast
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer fast
gcc -funroll-loops -m64 -O -fomit-frame-pointer fast
gcc -funroll-loops -m64 -Os -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer fast
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer fast
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer fast
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer fast
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer fast
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer fast
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer fast
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer fast
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer fast
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer fast
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer fast
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer fast
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer fast
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer fast
gcc -m64 -O2 -fomit-frame-pointer fast
gcc -m64 -O3 -fomit-frame-pointer fast
gcc -m64 -O -fomit-frame-pointer fast
gcc -m64 -Os -fomit-frame-pointer fast
gcc -m64 -march=core2 -O2 -fomit-frame-pointer fast
gcc -m64 -march=core2 -O3 -fomit-frame-pointer fast
gcc -m64 -march=core2 -O -fomit-frame-pointer fast
gcc -m64 -march=core2 -Os -fomit-frame-pointer fast
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer fast
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer fast
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer fast
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer fast
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer fast
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer fast
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer fast
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer fast
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer fast
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer fast
gcc -m64 -march=corei7 -O -fomit-frame-pointer fast
gcc -m64 -march=corei7 -Os -fomit-frame-pointer fast
gcc -m64 -march=k8 -O2 -fomit-frame-pointer fast
gcc -m64 -march=k8 -O3 -fomit-frame-pointer fast
gcc -m64 -march=k8 -O -fomit-frame-pointer fast
gcc -m64 -march=k8 -Os -fomit-frame-pointer fast
gcc -m64 -march=nocona -O2 -fomit-frame-pointer fast
gcc -m64 -march=nocona -O3 -fomit-frame-pointer fast
gcc -m64 -march=nocona -O -fomit-frame-pointer fast
gcc -m64 -march=nocona -Os -fomit-frame-pointer fast
gcc -march=barcelona -O2 -fomit-frame-pointer fast
gcc -march=barcelona -O3 -fomit-frame-pointer fast
gcc -march=barcelona -O -fomit-frame-pointer fast
gcc -march=barcelona -Os -fomit-frame-pointer fast
gcc -march=k8 -O2 -fomit-frame-pointer fast
gcc -march=k8 -O3 -fomit-frame-pointer fast
gcc -march=k8 -O -fomit-frame-pointer fast
gcc -march=k8 -Os -fomit-frame-pointer fast
gcc -march=nocona -O2 -fomit-frame-pointer fast
gcc -march=nocona -O3 -fomit-frame-pointer fast
gcc -march=nocona -O -fomit-frame-pointer fast
gcc -march=nocona -Os -fomit-frame-pointer fast

Compiler output

Implementation: fast
Security model: unknown
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
aes.c: In file included from aes.c:9:0:
aes.c: aes.c: In function 'expand256':
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: aes.c:55:21: note: called from here
aes.c: rkeys[14] = s = assist256_1 (s, _mm_aeskeygenassist_si128 (t, 0x40));
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: aes.c:54:21: note: called from here
aes.c: rkeys[13] = t = assist256_2 (t, _mm_aeskeygenassist_si128 (s, 0x20));
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: aes.c:53:21: note: called from here
aes.c: rkeys[12] = s = assist256_1 (s, _mm_aeskeygenassist_si128 (t, 0x20));
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ...
aes.c: In file included from aes.c:9:0:
aes.c: aes.c: In function 'expand256':
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: aes.c:55:21: note: called from here
aes.c: rkeys[14] = s = assist256_1 (s, _mm_aeskeygenassist_si128 (t, 0x40));
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: aes.c:54:21: note: called from here
aes.c: rkeys[13] = t = assist256_2 (t, _mm_aeskeygenassist_si128 (s, 0x20));
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: aes.c:53:21: note: called from here
aes.c: rkeys[12] = s = assist256_1 (s, _mm_aeskeygenassist_si128 (t, 0x20));
aes.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aes.c: In file included from aes.c:9:0:
aes.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
aes.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
aes.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer fast
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer fast
gcc -m64 -march=barcelona -O -fomit-frame-pointer fast
gcc -m64 -march=barcelona -Os -fomit-frame-pointer fast