Implementation notes: amd64, oki, crypto_sign/sphincsf128haraka

Computer: oki
Architecture: amd64
CPU ID: GenuineIntel-00050654-bfebfbff
SUPERCOP version: 20181123
Operation: crypto_sign
Primitive: sphincsf128haraka
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
31738000? ? ?? ? ?aesniclang_-O3_-fwrapv_-mavx_-maes_-mpclmul_-fomit-frame-pointer_-Qunused-arguments2019011420181123
31765892? ? ?? ? ?aesniclang_-O3_-fwrapv_-march=x86-64_-mcpu=core-avx2_-mavx2_-maes_-mpclmul_-fomit-frame-pointer_-Qunused-arguments2019011420181123
34403556? ? ?? ? ?aesnigcc_-m64_-march=native_-mtune=native_-O3_-fomit-frame-pointer2019011420181123
34773202? ? ?? ? ?aesnigcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv2019011420181123
35562660? ? ?? ? ?aesnigcc_-m64_-march=core-avx-i_-O3_-fomit-frame-pointer2019011420181123
35603396? ? ?? ? ?aesnigcc_-m64_-march=corei7-avx_-O3_-fomit-frame-pointer2019011420181123
36410230? ? ?? ? ?aesnigcc_-m64_-march=core-avx2_-O3_-fomit-frame-pointer2019011420181123
39802418? ? ?? ? ?aesnigcc_-m64_-march=native_-mtune=native_-O2_-fomit-frame-pointer2019011420181123
41071556? ? ?? ? ?aesnigcc_-m64_-march=core-avx-i_-O2_-fomit-frame-pointer2019011420181123
41085732? ? ?? ? ?aesnigcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv2019011420181123
41090330? ? ?? ? ?aesnigcc_-m64_-march=corei7-avx_-O2_-fomit-frame-pointer2019011420181123
42816348? ? ?? ? ?aesnigcc_-m64_-march=core-avx-i_-O_-fomit-frame-pointer2019011420181123
42884424? ? ?? ? ?aesnigcc_-m64_-march=corei7-avx_-O_-fomit-frame-pointer2019011420181123
43807550? ? ?? ? ?aesnigcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv2019011420181123
44370040? ? ?? ? ?aesnigcc_-m64_-march=core-avx2_-O_-fomit-frame-pointer2019011420181123
45017466? ? ?? ? ?aesnigcc_-m64_-march=core-avx2_-O2_-fomit-frame-pointer2019011420181123
46128374? ? ?? ? ?aesnigcc_-m64_-march=native_-mtune=native_-O_-fomit-frame-pointer2019011420181123
49575602? ? ?? ? ?aesnigcc_-m64_-march=native_-mtune=native_-Os_-fomit-frame-pointer2019011420181123
50719908? ? ?? ? ?aesnigcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv2019011420181123
52061572? ? ?? ? ?aesnigcc_-m64_-march=core-avx2_-Os_-fomit-frame-pointer2019011420181123
53298006? ? ?? ? ?aesnigcc_-m64_-march=core-avx-i_-Os_-fomit-frame-pointer2019011420181123
53364128? ? ?? ? ?aesnigcc_-m64_-march=corei7-avx_-Os_-fomit-frame-pointer2019011420181123

Compiler output

Implementation: aesni
Security model: unknown
Compiler: cc
haraka.c: haraka.c: In function 'haraka512_perm':
haraka.c: haraka.c:35:6: error: incompatible types when assigning to type '__m128i' from type 'int'
haraka.c: s0 = _mm_aesenc_si128(s0, *(rci)); \
haraka.c: ^
haraka.c: haraka.c:351:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc);
haraka.c: ^
haraka.c: haraka.c:36:6: error: incompatible types when assigning to type '__m128i' from type 'int'
haraka.c: s1 = _mm_aesenc_si128(s1, *(rci + 1)); \
haraka.c: ^
haraka.c: haraka.c:351:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc);
haraka.c: ^
haraka.c: haraka.c:37:6: error: incompatible types when assigning to type '__m128i' from type 'int'
haraka.c: s2 = _mm_aesenc_si128(s2, *(rci + 2)); \
haraka.c: ^
haraka.c: haraka.c:351:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc);
haraka.c: ^
haraka.c: haraka.c:38:6: error: incompatible types when assigning to type '__m128i' from type 'int'
haraka.c: s3 = _mm_aesenc_si128(s3, *(rci + 3)); \
haraka.c: ^
haraka.c: haraka.c:351:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc);
haraka.c: ^
haraka.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
cc aesni

Compiler output

Implementation: aesni
Security model: unknown
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
haraka.c: haraka.c:351:5: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'haraka512_perm' that is compiled without support for 'aes'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc);
haraka.c: ^
haraka.c: haraka.c:35:8: note: expanded from macro 'AES4'
haraka.c: s0 = _mm_aesenc_si128(s0, *(rci)); \
haraka.c: ^
haraka.c: haraka.c:351:5: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'haraka512_perm' that is compiled without support for 'aes'
haraka.c: haraka.c:36:8: note: expanded from macro 'AES4'
haraka.c: s1 = _mm_aesenc_si128(s1, *(rci + 1)); \
haraka.c: ^
haraka.c: haraka.c:351:5: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'haraka512_perm' that is compiled without support for 'aes'
haraka.c: haraka.c:37:8: note: expanded from macro 'AES4'
haraka.c: s2 = _mm_aesenc_si128(s2, *(rci + 2)); \
haraka.c: ^
haraka.c: haraka.c:351:5: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'haraka512_perm' that is compiled without support for 'aes'
haraka.c: haraka.c:38:8: note: expanded from macro 'AES4'
haraka.c: s3 = _mm_aesenc_si128(s3, *(rci + 3)); \
haraka.c: ^
haraka.c: haraka.c:351:5: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'haraka512_perm' that is compiled without support for 'aes'
haraka.c: haraka.c:39:8: note: expanded from macro 'AES4'
haraka.c: s0 = _mm_aesenc_si128(s0, *(rci + 4)); \
haraka.c: ^
haraka.c: haraka.c:351:5: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'haraka512_perm' that is compiled without support for 'aes'
haraka.c: haraka.c:40:8: note: expanded from macro 'AES4'
haraka.c: s1 = _mm_aesenc_si128(s1, *(rci + 5)); \
haraka.c: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments aesni
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments aesni
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments aesni
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments aesni
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments aesni
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments aesni

Compiler output

Implementation: aesni
Security model: unknown
Compiler: clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments
try.c: fatal error: error in backend: Cannot select: 0x21367d0: v4i64 = X86ISD::VTRUNC 0x2136600
try.c: 0x2136600: v16i32 = vselect 0x213f9d0, 0x20cc290, 0x21364d0
try.c: 0x213f9d0: v4i1 = X86ISD::PCMPGTM 0x21210d0, 0x20cd040
try.c: 0x21210d0: v4i64 = X86ISD::VBROADCAST 0x20cd3d0
try.c: 0x20cd3d0: i64,ch = load<LD8[%uglygep72]> 0x201adc0, 0x210e400, undef:i64
try.c: 0x210e400: i64 = add 0x2121fb0, 0x20bfe20
try.c: 0x2121fb0: i64 = X86ISD::Wrapper TargetGlobalAddress:i64<[64 x i64]* @cycles> 0
try.c: 0x207bd00: i64 = TargetGlobalAddress<[64 x i64]* @cycles> 0
try.c: 0x20bfe20: i64 = shl 0x207b4b0, Constant:i8<3>
try.c: 0x207b4b0: i64,ch = CopyFromReg 0x201adc0, Register:i64 %vreg50
try.c: 0x2122470: i64 = Register %vreg50
try.c: 0x209c830: i8 = Constant<3>
try.c: 0x211f740: i64 = undef
try.c: 0x20cd040: v4i64,ch = CopyFromReg 0x201adc0, Register:v4i64 %vreg13
try.c: 0x20b9b20: v4i64 = Register %vreg13
try.c: 0x20cc290: v16i32 = X86ISD::VBROADCAST 0x207b710
try.c: 0x207b710: i32,ch = load<LD4[ConstantPool]> 0x201adc0, 0x2136140, undef:i64
try.c: 0x2136140: i64 = X86ISD::Wrapper TargetConstantPool:i64<i32 1> 0
try.c: 0x20c64c0: i64 = TargetConstantPool<i32 1> 0
try.c: 0x211f740: i64 = undef
try.c: 0x21364d0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x21363a0: i32 = Constant<0>
try.c: 0x21363a0: i32 = Constant<0>
try.c: 0x21363a0: i32 = Constant<0>
try.c: 0x21363a0: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments aesni

Compiler output

Implementation: aesni
Security model: unknown
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
try.c: fatal error: error in backend: Cannot select: 0x2178020: v4i64 = X86ISD::VTRUNC 0x2177e50
try.c: 0x2177e50: v16i32 = vselect 0x2174a90, 0x20f6210, 0x2177d20
try.c: 0x2174a90: v4i1 = X86ISD::PCMPGTM 0x215f8f0, 0x20f6fc0
try.c: 0x215f8f0: v4i64 = X86ISD::VBROADCAST 0x20f7350
try.c: 0x20f7350: i64,ch = load<LD8[%uglygep72]> 0x205ade0, 0x215c7d0, undef:i64
try.c: 0x215c7d0: i64 = add 0x21607d0, 0x20db450
try.c: 0x21607d0: i64 = X86ISD::Wrapper TargetGlobalAddress:i64<[64 x i64]* @cycles> 0
try.c: 0x2151890: i64 = TargetGlobalAddress<[64 x i64]* @cycles> 0
try.c: 0x20db450: i64 = shl 0x2151040, Constant:i8<3>
try.c: 0x2151040: i64,ch = CopyFromReg 0x205ade0, Register:i64 %vreg50
try.c: 0x2160c90: i64 = Register %vreg50
try.c: 0x2110cd0: i8 = Constant<3>
try.c: 0x21570e0: i64 = undef
try.c: 0x20f6fc0: v4i64,ch = CopyFromReg 0x205ade0, Register:v4i64 %vreg13
try.c: 0x210d270: v4i64 = Register %vreg13
try.c: 0x20f6210: v16i32 = X86ISD::VBROADCAST 0x21512a0
try.c: 0x21512a0: i32,ch = load<LD4[ConstantPool]> 0x205ade0, 0x2177990, undef:i64
try.c: 0x2177990: i64 = X86ISD::Wrapper TargetConstantPool:i64<i32 1> 0
try.c: 0x20fa9b0: i64 = TargetConstantPool<i32 1> 0
try.c: 0x21570e0: i64 = undef
try.c: 0x2177d20: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x2177bf0: i32 = Constant<0>
try.c: 0x2177bf0: i32 = Constant<0>
try.c: 0x2177bf0: i32 = Constant<0>
try.c: 0x2177bf0: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments aesni

Compiler output

Implementation: aesni
Security model: unknown
Compiler: gcc
haraka.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:39:0,
haraka.c: from haraka.c:10:
haraka.c: haraka.c: In function 'haraka512_perm':
haraka.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '_mm_aesenc_si128': target specific option mismatch
haraka.c: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
haraka.c: ^~~~~~~~~~~~~~~~
haraka.c: haraka.c:42:8: note: called from here
haraka.c: s3 = _mm_aesenc_si128(s3, *(rci + 7));
haraka.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
haraka.c: haraka.c:363:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc + 32);
haraka.c: ^~~~
haraka.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:39:0,
haraka.c: from haraka.c:10:
haraka.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '_mm_aesenc_si128': target specific option mismatch
haraka.c: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
haraka.c: ^~~~~~~~~~~~~~~~
haraka.c: haraka.c:41:8: note: called from here
haraka.c: s2 = _mm_aesenc_si128(s2, *(rci + 6)); \
haraka.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
haraka.c: haraka.c:363:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc + 32);
haraka.c: ^~~~
haraka.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:39:0,
haraka.c: from haraka.c:10:
haraka.c: ...

Number of similar (compiler,implementation) pairs: 86, namely:
CompilerImplementations
gcc aesni
gcc -O2 -fomit-frame-pointer aesni
gcc -O3 -fomit-frame-pointer aesni
gcc -O -fomit-frame-pointer aesni
gcc -Os -fomit-frame-pointer aesni
gcc -fno-schedule-insns -O2 -fomit-frame-pointer aesni
gcc -fno-schedule-insns -O3 -fomit-frame-pointer aesni
gcc -fno-schedule-insns -O -fomit-frame-pointer aesni
gcc -fno-schedule-insns -Os -fomit-frame-pointer aesni
gcc -funroll-loops aesni
gcc -funroll-loops -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -O -fomit-frame-pointer aesni
gcc -funroll-loops -Os -fomit-frame-pointer aesni
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer aesni
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -O -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -Os -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer aesni
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer aesni
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer aesni
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer aesni
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer aesni
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer aesni
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer aesni
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer aesni
gcc -m64 -O2 -fomit-frame-pointer aesni
gcc -m64 -O3 -fomit-frame-pointer aesni
gcc -m64 -O -fomit-frame-pointer aesni
gcc -m64 -Os -fomit-frame-pointer aesni
gcc -m64 -march=core2 -O2 -fomit-frame-pointer aesni
gcc -m64 -march=core2 -O3 -fomit-frame-pointer aesni
gcc -m64 -march=core2 -O -fomit-frame-pointer aesni
gcc -m64 -march=core2 -Os -fomit-frame-pointer aesni
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer aesni
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer aesni
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer aesni
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer aesni
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer aesni
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer aesni
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer aesni
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer aesni
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer aesni
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer aesni
gcc -m64 -march=corei7 -O -fomit-frame-pointer aesni
gcc -m64 -march=corei7 -Os -fomit-frame-pointer aesni
gcc -m64 -march=k8 -O2 -fomit-frame-pointer aesni
gcc -m64 -march=k8 -O3 -fomit-frame-pointer aesni
gcc -m64 -march=k8 -O -fomit-frame-pointer aesni
gcc -m64 -march=k8 -Os -fomit-frame-pointer aesni
gcc -m64 -march=nocona -O2 -fomit-frame-pointer aesni
gcc -m64 -march=nocona -O3 -fomit-frame-pointer aesni
gcc -m64 -march=nocona -O -fomit-frame-pointer aesni
gcc -m64 -march=nocona -Os -fomit-frame-pointer aesni
gcc -march=barcelona -O2 -fomit-frame-pointer aesni
gcc -march=barcelona -O3 -fomit-frame-pointer aesni
gcc -march=barcelona -O -fomit-frame-pointer aesni
gcc -march=barcelona -Os -fomit-frame-pointer aesni
gcc -march=k8 -O2 -fomit-frame-pointer aesni
gcc -march=k8 -O3 -fomit-frame-pointer aesni
gcc -march=k8 -O -fomit-frame-pointer aesni
gcc -march=k8 -Os -fomit-frame-pointer aesni
gcc -march=nocona -O2 -fomit-frame-pointer aesni
gcc -march=nocona -O3 -fomit-frame-pointer aesni
gcc -march=nocona -O -fomit-frame-pointer aesni
gcc -march=nocona -Os -fomit-frame-pointer aesni

Compiler output

Implementation: aesni
Security model: unknown
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
haraka.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:39:0,
haraka.c: from haraka.c:10:
haraka.c: haraka.c: In function 'haraka512_perm':
haraka.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '_mm_aesenc_si128': target specific option mismatch
haraka.c: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
haraka.c: ^~~~~~~~~~~~~~~~
haraka.c: haraka.c:42:8: note: called from here
haraka.c: s3 = _mm_aesenc_si128(s3, *(rci + 7));
haraka.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
haraka.c: haraka.c:363:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc + 32);
haraka.c: ^~~~
haraka.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:39:0,
haraka.c: from haraka.c:10:
haraka.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '_mm_aesenc_si128': target specific option mismatch
haraka.c: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
haraka.c: ^~~~~~~~~~~~~~~~
haraka.c: haraka.c:41:8: note: called from here
haraka.c: s2 = _mm_aesenc_si128(s2, *(rci + 6)); \
haraka.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
haraka.c: haraka.c:363:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc + 32);
haraka.c: ^~~~
haraka.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:39:0,
haraka.c: from haraka.c:10:
haraka.c: ...
haraka.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:39:0,
haraka.c: from haraka.c:10:
haraka.c: haraka.c: In function 'haraka512_perm':
haraka.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '_mm_aesenc_si128': target specific option mismatch
haraka.c: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
haraka.c: ^~~~~~~~~~~~~~~~
haraka.c: haraka.c:42:8: note: called from here
haraka.c: s3 = _mm_aesenc_si128(s3, *(rci + 7));
haraka.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
haraka.c: haraka.c:363:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc + 32);
haraka.c: ^~~~
haraka.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:39:0,
haraka.c: from haraka.c:10:
haraka.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '_mm_aesenc_si128': target specific option mismatch
haraka.c: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
haraka.c: ^~~~~~~~~~~~~~~~
haraka.c: haraka.c:41:8: note: called from here
haraka.c: s2 = _mm_aesenc_si128(s2, *(rci + 6)); \
haraka.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
haraka.c: haraka.c:363:5: note: in expansion of macro 'AES4'
haraka.c: AES4(s[0], s[1], s[2], s[3], rc + 32);
haraka.c: ^~~~
haraka.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:39:0,
haraka.c: from haraka.c:10:
haraka.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer aesni
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer aesni
gcc -m64 -march=barcelona -O -fomit-frame-pointer aesni
gcc -m64 -march=barcelona -Os -fomit-frame-pointer aesni