Implementation notes: amd64, h4atom, crypto_core/aes256decrypt

Computer: h4atom
Architecture: amd64
CPU ID: GenuineIntel-000106ca-bfe9fbff
SUPERCOP version: 20160806
Operation: crypto_core
Primitive: aes256decrypt
TimeImplementationCompilerBenchmark dateSUPERCOP version
2112dolbeau/std-4rt-nodkgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016081120160806
2112dolbeau/std-4rt-nodkgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016081120160806
2128dolbeau/std-2rt-nodkgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016081120160806
2128dolbeau/std-2rt-nodkgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016081120160806
2136dolbeau/std-1rt-nodkgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016081120160806
2136dolbeau/std-1rt-nodkgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016081120160806
2152dolbeau/std-2rt-nodkgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016081120160806
2152dolbeau/std-4rt-nodkgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016081120160806
2328dolbeau/std-1rt-nodkgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016081120160806
2424dolbeau/std-4rt-nodkclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016081120160806
2432dolbeau/std-4rt-nodkgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016081120160806
2464dolbeau/std-2rt-nodkclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016081120160806
2480dolbeau/std-1rt-nodkgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016081120160806
2512dolbeau/std-2rt-nodkgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016081120160806
2680dolbeau/std-1rt-nodkclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016081120160806
3192opensslclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016081120160806
3192opensslgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016081120160806
3200opensslgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016081120160806
3208opensslgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016081120160806
3384opensslgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016081120160806
690824refgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016081120160806
2009784refclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016081120160806
3272448refgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016081120160806
3370056refgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016081120160806
3670016refgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016081120160806

Compiler output

Implementation: crypto_core/aes256decrypt/dolbeau/aesenc-int
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
core.c: core.c:13:34: warning: incompatible pointer types passing 'const unsigned int *' to parameter of type 'const __m128i *' [-Wincompatible-pointer-types]
core.c: __m128i key0 = _mm_loadu_si128((const unsigned int *)(key+0));
core.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
core.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/emmintrin.h:1112:32: note: passing argument to parameter '__p' here
core.c: _mm_loadu_si128(__m128i const *__p)
core.c: ^
core.c: core.c:14:34: warning: incompatible pointer types passing 'const unsigned int *' to parameter of type 'const __m128i *' [-Wincompatible-pointer-types]
core.c: __m128i key1 = _mm_loadu_si128((const unsigned int *)(key+16));
core.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
core.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/emmintrin.h:1112:32: note: passing argument to parameter '__p' here
core.c: _mm_loadu_si128(__m128i const *__p)
core.c: ^
core.c: core.c:72:16: error: always_inline function '_mm_aesimc_si128' requires target feature 'aes', but would be inlined into function 'aes256ni_setkey_decrypt' that is compiled without support for 'aes'
core.c: rkeys[i] = _mm_aesimc_si128(tkeys[14-i]);
core.c: ^
core.c: core.c:43:3: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
core.c: BLOCK1(0x01);
core.c: ^
core.c: core.c:24:11: note: expanded from macro 'BLOCK1'
core.c: temp1 = _mm_aeskeygenassist_si128(temp2, IMM); \
core.c: ^
core.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_aes.h:62:12: note: expanded from macro '_mm_aeskeygenassist_si128'
core.c: (__m128i)__builtin_ia32_aeskeygenassist128((__v2di)(__m128i)(C), (int)(R))
core.c: ^
core.c: core.c:44:3: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
core.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments dolbeau/aesenc-int

Compiler output

Implementation: crypto_core/aes256decrypt/dolbeau/aesenc-int
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv
core.c: core.c: In function 'aes256ni_setkey_encrypt':
core.c: core.c:13:34: warning: passing argument 1 of '_mm_loadu_si128' from incompatible pointer type [-Wincompatible-pointer-types]
core.c: __m128i key0 = _mm_loadu_si128((const unsigned int *)(key+0));
core.c: ^
core.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/xmmintrin.h:1249:0,
core.c: from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:29,
core.c: from core.c:9:
core.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/emmintrin.h:696:1: note: expected 'const __vector(2) long long int *' but argument is of type 'const unsigned int *'
core.c: _mm_loadu_si128 (__m128i const *__P)
core.c: ^
core.c: core.c:14:34: warning: passing argument 1 of '_mm_loadu_si128' from incompatible pointer type [-Wincompatible-pointer-types]
core.c: __m128i key1 = _mm_loadu_si128((const unsigned int *)(key+16));
core.c: ^
core.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/xmmintrin.h:1249:0,
core.c: from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:29,
core.c: from core.c:9:
core.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/emmintrin.h:696:1: note: expected 'const __vector(2) long long int *' but argument is of type 'const unsigned int *'
core.c: _mm_loadu_si128 (__m128i const *__P)
core.c: ^
core.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:39:0,
core.c: from core.c:9:
core.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:87:1: error: inlining failed in call to always_inline '_mm_aeskeygenassist_si128': target specific option mismatch
core.c: _mm_aeskeygenassist_si128 (__m128i __X, const int __C)
core.c: ^
core.c: core.c:24:9: error: called from here
core.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv dolbeau/aesenc-int
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv dolbeau/aesenc-int
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv dolbeau/aesenc-int
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv dolbeau/aesenc-int