Implementation notes: amd64, oki, crypto_sign/sphincss128haraka

Computer: oki
Architecture: amd64
CPU ID: GenuineIntel-00050654-bfebfbff
SUPERCOP version: 20181123
Operation: crypto_sign
Primitive: sphincss128haraka
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
505530534? ? ?? ? ?aesniclang_-O3_-fwrapv_-mavx_-maes_-mpclmul_-fomit-frame-pointer_-Qunused-arguments2019011520181123
505853858? ? ?? ? ?aesniclang_-O3_-fwrapv_-march=x86-64_-mcpu=core-avx2_-mavx2_-maes_-mpclmul_-fomit-frame-pointer_-Qunused-arguments2019011520181123
549071490? ? ?? ? ?aesnigcc_-m64_-march=core-avx-i_-O3_-fomit-frame-pointer2019011520181123
549257502? ? ?? ? ?aesnigcc_-m64_-march=corei7-avx_-O3_-fomit-frame-pointer2019011520181123
554701544? ? ?? ? ?aesnigcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv2019011520181123
557637892? ? ?? ? ?aesnigcc_-m64_-march=native_-mtune=native_-O3_-fomit-frame-pointer2019011520181123
571283916? ? ?? ? ?aesnigcc_-m64_-march=core-avx2_-O3_-fomit-frame-pointer2019011520181123
637016238? ? ?? ? ?aesnigcc_-m64_-march=corei7-avx_-O2_-fomit-frame-pointer2019011520181123
639206274? ? ?? ? ?aesnigcc_-m64_-march=core-avx-i_-O2_-fomit-frame-pointer2019011520181123
641947948? ? ?? ? ?aesnigcc_-m64_-march=core-avx2_-O2_-fomit-frame-pointer2019011520181123
680265138? ? ?? ? ?aesnigcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv2019011520181123
688749204? ? ?? ? ?aesnigcc_-m64_-march=native_-mtune=native_-O2_-fomit-frame-pointer2019011520181123
744421898? ? ?? ? ?aesnigcc_-m64_-march=core-avx2_-O_-fomit-frame-pointer2019011520181123
750991864? ? ?? ? ?aesnigcc_-m64_-march=native_-mtune=native_-O_-fomit-frame-pointer2019011520181123
766321338? ? ?? ? ?aesnigcc_-m64_-march=core-avx-i_-O_-fomit-frame-pointer2019011520181123
768971972? ? ?? ? ?aesnigcc_-m64_-march=corei7-avx_-O_-fomit-frame-pointer2019011520181123
809604028? ? ?? ? ?aesnigcc_-m64_-march=native_-mtune=native_-Os_-fomit-frame-pointer2019011520181123
829702366? ? ?? ? ?aesnigcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv2019011520181123
838782834? ? ?? ? ?aesnigcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv2019011520181123
867019514? ? ?? ? ?aesnigcc_-m64_-march=corei7-avx_-Os_-fomit-frame-pointer2019011520181123
869299542? ? ?? ? ?aesnigcc_-m64_-march=core-avx-i_-Os_-fomit-frame-pointer2019011520181123
903192348? ? ?? ? ?aesnigcc_-m64_-march=core-avx2_-Os_-fomit-frame-pointer2019011520181123

Compiler output

Implementation: aesni
Security model: unknown
Compiler: cc
haraka.c: haraka.c: In function 'haraka512_perm':
haraka.c: haraka.c:35:6: error: incompatible types when assigning to type '__m128i' from type 'int'
haraka.c: s0 = _mm_aesenc_si128(s0, *(rci)); \
haraka.c: ^
haraka.c: haraka.c:351:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc);
haraka.c: ^
haraka.c: haraka.c:36:6: error: incompatible types when assigning to type '__m128i' from type 'int'
haraka.c: s1 = _mm_aesenc_si128(s1, *(rci + 1)); \
haraka.c: ^
haraka.c: haraka.c:351:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc);
haraka.c: ^
haraka.c: haraka.c:37:6: error: incompatible types when assigning to type '__m128i' from type 'int'
haraka.c: s2 = _mm_aesenc_si128(s2, *(rci + 2)); \
haraka.c: ^
haraka.c: haraka.c:351:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc);
haraka.c: ^
haraka.c: haraka.c:38:6: error: incompatible types when assigning to type '__m128i' from type 'int'
haraka.c: s3 = _mm_aesenc_si128(s3, *(rci + 3)); \
haraka.c: ^
haraka.c: haraka.c:351:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc);
haraka.c: ^
haraka.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
cc aesni

Compiler output

Implementation: aesni
Security model: unknown
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
haraka.c: haraka.c:351:5: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'haraka512_perm' that is compiled without support for 'aes'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc);
haraka.c: ^
haraka.c: haraka.c:35:8: note: expanded from macro 'AES4'
haraka.c: s0 = _mm_aesenc_si128(s0, *(rci)); \
haraka.c: ^
haraka.c: haraka.c:351:5: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'haraka512_perm' that is compiled without support for 'aes'
haraka.c: haraka.c:36:8: note: expanded from macro 'AES4'
haraka.c: s1 = _mm_aesenc_si128(s1, *(rci + 1)); \
haraka.c: ^
haraka.c: haraka.c:351:5: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'haraka512_perm' that is compiled without support for 'aes'
haraka.c: haraka.c:37:8: note: expanded from macro 'AES4'
haraka.c: s2 = _mm_aesenc_si128(s2, *(rci + 2)); \
haraka.c: ^
haraka.c: haraka.c:351:5: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'haraka512_perm' that is compiled without support for 'aes'
haraka.c: haraka.c:38:8: note: expanded from macro 'AES4'
haraka.c: s3 = _mm_aesenc_si128(s3, *(rci + 3)); \
haraka.c: ^
haraka.c: haraka.c:351:5: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'haraka512_perm' that is compiled without support for 'aes'
haraka.c: haraka.c:39:8: note: expanded from macro 'AES4'
haraka.c: s0 = _mm_aesenc_si128(s0, *(rci + 4)); \
haraka.c: ^
haraka.c: haraka.c:351:5: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'haraka512_perm' that is compiled without support for 'aes'
haraka.c: haraka.c:40:8: note: expanded from macro 'AES4'
haraka.c: s1 = _mm_aesenc_si128(s1, *(rci + 5)); \
haraka.c: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments aesni
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments aesni
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments aesni
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments aesni
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments aesni
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments aesni

Compiler output

Implementation: aesni
Security model: unknown
Compiler: clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments
try.c: fatal error: error in backend: Cannot select: 0xb472f0: v4i64 = X86ISD::VTRUNC 0xb47120
try.c: 0xb47120: v16i32 = vselect 0xb23410, 0xac4dd0, 0xb46ff0
try.c: 0xb23410: v4i1 = X86ISD::PCMPGTM 0xb2d3c0, 0xac5b80
try.c: 0xb2d3c0: v4i64 = X86ISD::VBROADCAST 0xac5f10
try.c: 0xac5f10: i64,ch = load<LD8[%uglygep72]> 0xa28dd0, 0xb1d6a0, undef:i64
try.c: 0xb1d6a0: i64 = add 0xb2e2a0, 0xad6540
try.c: 0xb2e2a0: i64 = X86ISD::Wrapper TargetGlobalAddress:i64<[64 x i64]* @cycles> 0
try.c: 0xb24d40: i64 = TargetGlobalAddress<[64 x i64]* @cycles> 0
try.c: 0xad6540: i64 = shl 0xb244f0, Constant:i8<3>
try.c: 0xb244f0: i64,ch = CopyFromReg 0xa28dd0, Register:i64 %vreg50
try.c: 0xb2e760: i64 = Register %vreg50
try.c: 0xace370: i8 = Constant<3>
try.c: 0xb2b220: i64 = undef
try.c: 0xac5b80: v4i64,ch = CopyFromReg 0xa28dd0, Register:v4i64 %vreg13
try.c: 0xaeec60: v4i64 = Register %vreg13
try.c: 0xac4dd0: v16i32 = X86ISD::VBROADCAST 0xb24750
try.c: 0xb24750: i32,ch = load<LD4[ConstantPool]> 0xa28dd0, 0xb46c60, undef:i64
try.c: 0xb46c60: i64 = X86ISD::Wrapper TargetConstantPool:i64<i32 1> 0
try.c: 0xaf11b0: i64 = TargetConstantPool<i32 1> 0
try.c: 0xb2b220: i64 = undef
try.c: 0xb46ff0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0xb46ec0: i32 = Constant<0>
try.c: 0xb46ec0: i32 = Constant<0>
try.c: 0xb46ec0: i32 = Constant<0>
try.c: 0xb46ec0: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments aesni

Compiler output

Implementation: aesni
Security model: unknown
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
try.c: fatal error: error in backend: Cannot select: 0x9f1110: v4i64 = X86ISD::VTRUNC 0x9f0f40
try.c: 0x9f0f40: v16i32 = vselect 0x9f8120, 0x958a20, 0x9f0e10
try.c: 0x9f8120: v4i1 = X86ISD::PCMPGTM 0x9db610, 0x973160
try.c: 0x9db610: v4i64 = X86ISD::VBROADCAST 0x9734f0
try.c: 0x9734f0: i64,ch = load<LD8[%uglygep72]> 0x8d4dd0, 0x9ba400, undef:i64
try.c: 0x9ba400: i64 = add 0x9dc4f0, 0x95b610
try.c: 0x9dc4f0: i64 = X86ISD::Wrapper TargetGlobalAddress:i64<[64 x i64]* @cycles> 0
try.c: 0x9d39a0: i64 = TargetGlobalAddress<[64 x i64]* @cycles> 0
try.c: 0x95b610: i64 = shl 0x9d3150, Constant:i8<3>
try.c: 0x9d3150: i64,ch = CopyFromReg 0x8d4dd0, Register:i64 %vreg50
try.c: 0x9dc9b0: i64 = Register %vreg50
try.c: 0x9697d0: i8 = Constant<3>
try.c: 0x9d9c80: i64 = undef
try.c: 0x973160: v4i64,ch = CopyFromReg 0x8d4dd0, Register:v4i64 %vreg13
try.c: 0x99ce40: v4i64 = Register %vreg13
try.c: 0x958a20: v16i32 = X86ISD::VBROADCAST 0x9d33b0
try.c: 0x9d33b0: i32,ch = load<LD4[ConstantPool]> 0x8d4dd0, 0x9f0a80, undef:i64
try.c: 0x9f0a80: i64 = X86ISD::Wrapper TargetConstantPool:i64<i32 1> 0
try.c: 0x99a960: i64 = TargetConstantPool<i32 1> 0
try.c: 0x9d9c80: i64 = undef
try.c: 0x9f0e10: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x9f0ce0: i32 = Constant<0>
try.c: 0x9f0ce0: i32 = Constant<0>
try.c: 0x9f0ce0: i32 = Constant<0>
try.c: 0x9f0ce0: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments aesni

Compiler output

Implementation: aesni
Security model: unknown
Compiler: gcc
haraka.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:39:0,
haraka.c: from haraka.c:10:
haraka.c: haraka.c: In function 'haraka512_perm':
haraka.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '_mm_aesenc_si128': target specific option mismatch
haraka.c: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
haraka.c: ^~~~~~~~~~~~~~~~
haraka.c: haraka.c:42:8: note: called from here
haraka.c: s3 = _mm_aesenc_si128(s3, *(rci + 7));
haraka.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
haraka.c: haraka.c:363:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc + 32);
haraka.c: ^~~~
haraka.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:39:0,
haraka.c: from haraka.c:10:
haraka.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '_mm_aesenc_si128': target specific option mismatch
haraka.c: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
haraka.c: ^~~~~~~~~~~~~~~~
haraka.c: haraka.c:41:8: note: called from here
haraka.c: s2 = _mm_aesenc_si128(s2, *(rci + 6)); \
haraka.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
haraka.c: haraka.c:363:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc + 32);
haraka.c: ^~~~
haraka.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:39:0,
haraka.c: from haraka.c:10:
haraka.c: ...

Number of similar (compiler,implementation) pairs: 86, namely:
CompilerImplementations
gcc aesni
gcc -O2 -fomit-frame-pointer aesni
gcc -O3 -fomit-frame-pointer aesni
gcc -O -fomit-frame-pointer aesni
gcc -Os -fomit-frame-pointer aesni
gcc -fno-schedule-insns -O2 -fomit-frame-pointer aesni
gcc -fno-schedule-insns -O3 -fomit-frame-pointer aesni
gcc -fno-schedule-insns -O -fomit-frame-pointer aesni
gcc -fno-schedule-insns -Os -fomit-frame-pointer aesni
gcc -funroll-loops aesni
gcc -funroll-loops -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -O -fomit-frame-pointer aesni
gcc -funroll-loops -Os -fomit-frame-pointer aesni
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer aesni
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -O -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -Os -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer aesni
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer aesni
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer aesni
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer aesni
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer aesni
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer aesni
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer aesni
gcc -m64 -O2 -fomit-frame-pointer aesni
gcc -m64 -O3 -fomit-frame-pointer aesni
gcc -m64 -O -fomit-frame-pointer aesni
gcc -m64 -Os -fomit-frame-pointer aesni
gcc -m64 -march=core2 -O2 -fomit-frame-pointer aesni
gcc -m64 -march=core2 -O3 -fomit-frame-pointer aesni
gcc -m64 -march=core2 -O -fomit-frame-pointer aesni
gcc -m64 -march=core2 -Os -fomit-frame-pointer aesni
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer aesni
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer aesni
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer aesni
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer aesni
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer aesni
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer aesni
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer aesni
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer aesni
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer aesni
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer aesni
gcc -m64 -march=corei7 -O -fomit-frame-pointer aesni
gcc -m64 -march=corei7 -Os -fomit-frame-pointer aesni
gcc -m64 -march=k8 -O2 -fomit-frame-pointer aesni
gcc -m64 -march=k8 -O3 -fomit-frame-pointer aesni
gcc -m64 -march=k8 -O -fomit-frame-pointer aesni
gcc -m64 -march=k8 -Os -fomit-frame-pointer aesni
gcc -m64 -march=nocona -O2 -fomit-frame-pointer aesni
gcc -m64 -march=nocona -O3 -fomit-frame-pointer aesni
gcc -m64 -march=nocona -O -fomit-frame-pointer aesni
gcc -m64 -march=nocona -Os -fomit-frame-pointer aesni
gcc -march=barcelona -O2 -fomit-frame-pointer aesni
gcc -march=barcelona -O3 -fomit-frame-pointer aesni
gcc -march=barcelona -O -fomit-frame-pointer aesni
gcc -march=barcelona -Os -fomit-frame-pointer aesni
gcc -march=k8 -O2 -fomit-frame-pointer aesni
gcc -march=k8 -O3 -fomit-frame-pointer aesni
gcc -march=k8 -O -fomit-frame-pointer aesni
gcc -march=k8 -Os -fomit-frame-pointer aesni
gcc -march=nocona -O2 -fomit-frame-pointer aesni
gcc -march=nocona -O3 -fomit-frame-pointer aesni
gcc -march=nocona -O -fomit-frame-pointer aesni
gcc -march=nocona -Os -fomit-frame-pointer aesni

Compiler output

Implementation: aesni
Security model: unknown
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
haraka.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:39:0,
haraka.c: from haraka.c:10:
haraka.c: haraka.c: In function 'haraka512_perm':
haraka.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '_mm_aesenc_si128': target specific option mismatch
haraka.c: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
haraka.c: ^~~~~~~~~~~~~~~~
haraka.c: haraka.c:42:8: note: called from here
haraka.c: s3 = _mm_aesenc_si128(s3, *(rci + 7));
haraka.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
haraka.c: haraka.c:363:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc + 32);
haraka.c: ^~~~
haraka.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:39:0,
haraka.c: from haraka.c:10:
haraka.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '_mm_aesenc_si128': target specific option mismatch
haraka.c: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
haraka.c: ^~~~~~~~~~~~~~~~
haraka.c: haraka.c:41:8: note: called from here
haraka.c: s2 = _mm_aesenc_si128(s2, *(rci + 6)); \
haraka.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
haraka.c: haraka.c:363:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc + 32);
haraka.c: ^~~~
haraka.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:39:0,
haraka.c: from haraka.c:10:
haraka.c: ...
haraka.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:39:0,
haraka.c: from haraka.c:10:
haraka.c: haraka.c: In function 'haraka512_perm':
haraka.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '_mm_aesenc_si128': target specific option mismatch
haraka.c: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
haraka.c: ^~~~~~~~~~~~~~~~
haraka.c: haraka.c:42:8: note: called from here
haraka.c: s3 = _mm_aesenc_si128(s3, *(rci + 7));
haraka.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
haraka.c: haraka.c:363:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc + 32);
haraka.c: ^~~~
haraka.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:39:0,
haraka.c: from haraka.c:10:
haraka.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '_mm_aesenc_si128': target specific option mismatch
haraka.c: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
haraka.c: ^~~~~~~~~~~~~~~~
haraka.c: haraka.c:41:8: note: called from here
haraka.c: s2 = _mm_aesenc_si128(s2, *(rci + 6)); \
haraka.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
haraka.c: haraka.c:363:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc + 32);
haraka.c: ^~~~
haraka.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:39:0,
haraka.c: from haraka.c:10:
haraka.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer aesni
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer aesni
gcc -m64 -march=barcelona -O -fomit-frame-pointer aesni
gcc -m64 -march=barcelona -Os -fomit-frame-pointer aesni