Implementation notes: amd64, katana, crypto_aead/paeq128t

Computer: katana
Architecture: amd64
CPU ID: GenuineIntel-000006f6-bfebfbff
SUPERCOP version: 20170105
Operation: crypto_aead
Primitive: paeq128t
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
3967008? ? ?? ? ?refclang++_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments2016102420161009
3971224? ? ?? ? ?refclang++_-O3_-fomit-frame-pointer_-Qunused-arguments2016102420161009
3975112? ? ?? ? ?refclang++_-mcpu=native_-mfpu=neon_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments2016102420161009
3977792? ? ?? ? ?refclang++_-mcpu=cortex-a9_-mfpu=neon_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments2016102420161009
4013848? ? ?? ? ?refclang++_-mcpu=cortex-a8_-mfpu=neon_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments2016102420161009
4014224? ? ?? ? ?refclang++_-O3_-fwrapv_-march=native_-fomit-frame-pointer_-Qunused-arguments2016102420161009
6364848? ? ?? ? ?refg++_-funroll-loops_-fno-schedule-insns_-O3_-fomit-frame-pointer2016102420161009
6418928? ? ?? ? ?refg++_-m64_-march=native_-mtune=native_-O3_-fomit-frame-pointer2016102420161009
6429960? ? ?? ? ?refg++_-m64_-march=core2_-O3_-fomit-frame-pointer2016102420161009
6451728? ? ?? ? ?refg++_-funroll-loops_-fno-schedule-insns_-O2_-fomit-frame-pointer2016102420161009
6457904? ? ?? ? ?refg++_-fno-schedule-insns_-O3_-fomit-frame-pointer2016102420161009
6463096? ? ?? ? ?refg++_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv2016102420161009
6473456? ? ?? ? ?refg++_-O3_-fomit-frame-pointer2016102420161009
6479096? ? ?? ? ?refg++_-m64_-O3_-fomit-frame-pointer2016102420161009
8882600? ? ?? ? ?refg++_-m64_-march=barcelona_-O3_-fomit-frame-pointer2016102420161009
8885056? ? ?? ? ?refg++_-m64_-march=k8_-O3_-fomit-frame-pointer2016102420161009
8889440? ? ?? ? ?refg++_-march=barcelona_-O3_-fomit-frame-pointer2016102420161009
8945488? ? ?? ? ?refg++_-march=k8_-O3_-fomit-frame-pointer2016102420161009
9056768? ? ?? ? ?refg++_-m64_-march=nocona_-O3_-fomit-frame-pointer2016102420161009
9058904? ? ?? ? ?refg++_-march=nocona_-O3_-fomit-frame-pointer2016102420161009
10123160? ? ?? ? ?refg++_-funroll-loops_-fno-schedule-insns_-O_-fomit-frame-pointer2016102420161009
13070072? ? ?? ? ?refg++_-m64_-march=nocona_-O2_-fomit-frame-pointer2016102420161009
13195952? ? ?? ? ?refg++_-march=k8_-O2_-fomit-frame-pointer2016102420161009
13201032? ? ?? ? ?refg++_-m64_-march=k8_-O2_-fomit-frame-pointer2016102420161009
13219616? ? ?? ? ?refg++_-m64_-march=core2_-msse4_-O2_-fomit-frame-pointer2016102420161009
13220216? ? ?? ? ?refg++_-m64_-march=core2_-O2_-fomit-frame-pointer2016102420161009
13227216? ? ?? ? ?refg++_-march=barcelona_-O2_-fomit-frame-pointer2016102420161009
13227400? ? ?? ? ?refg++_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv2016102420161009
13228656? ? ?? ? ?refg++_-m64_-march=native_-mtune=native_-O2_-fomit-frame-pointer2016102420161009
13230480? ? ?? ? ?refg++_-m64_-march=core2_-msse4.1_-O2_-fomit-frame-pointer2016102420161009
13233256? ? ?? ? ?refg++_-m64_-march=barcelona_-O2_-fomit-frame-pointer2016102420161009
13243136? ? ?? ? ?refg++_-m64_-O2_-fomit-frame-pointer2016102420161009
13249552? ? ?? ? ?refg++_-O2_-fomit-frame-pointer2016102420161009
13253280? ? ?? ? ?refg++_-fno-schedule-insns_-O2_-fomit-frame-pointer2016102420161009
13272224? ? ?? ? ?refg++_-m64_-march=corei7_-O2_-fomit-frame-pointer2016102420161009
13310592? ? ?? ? ?refg++_-march=nocona_-O2_-fomit-frame-pointer2016102420161009
13508656? ? ?? ? ?refg++_-march=barcelona_-O_-fomit-frame-pointer2016102420161009
13529152? ? ?? ? ?refg++_-m64_-march=barcelona_-O_-fomit-frame-pointer2016102420161009
13592336? ? ?? ? ?refg++_-m64_-march=nocona_-O_-fomit-frame-pointer2016102420161009
13594592? ? ?? ? ?refg++_-march=nocona_-O_-fomit-frame-pointer2016102420161009
13625784? ? ?? ? ?refg++_-m64_-march=core2_-msse4_-O_-fomit-frame-pointer2016102420161009
13628328? ? ?? ? ?refg++_-fno-schedule-insns_-O_-fomit-frame-pointer2016102420161009
13628424? ? ?? ? ?refg++_-m64_-O_-fomit-frame-pointer2016102420161009
13635968? ? ?? ? ?refg++_-m64_-march=core2_-msse4.1_-O_-fomit-frame-pointer2016102420161009
13639912? ? ?? ? ?refg++_-m64_-march=core2_-O_-fomit-frame-pointer2016102420161009
13640880? ? ?? ? ?refg++_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv2016102420161009
13671464? ? ?? ? ?refg++_-m64_-march=k8_-O_-fomit-frame-pointer2016102420161009
13855352? ? ?? ? ?refg++_-O_-fomit-frame-pointer2016102420161009
13861928? ? ?? ? ?refg++_-march=k8_-O_-fomit-frame-pointer2016102420161009
14002968? ? ?? ? ?refg++_-m64_-march=native_-mtune=native_-O_-fomit-frame-pointer2016102420161009
14025360? ? ?? ? ?refg++_-m64_-march=corei7_-O_-fomit-frame-pointer2016102420161009
18761496? ? ?? ? ?refg++_-funroll-loops_-fno-schedule-insns_-Os_-fomit-frame-pointer2016102420161009
19599032? ? ?? ? ?refg++_-march=nocona_-Os_-fomit-frame-pointer2016102420161009
19599448? ? ?? ? ?refg++_-m64_-march=nocona_-Os_-fomit-frame-pointer2016102420161009
20115488? ? ?? ? ?refg++_-march=k8_-Os_-fomit-frame-pointer2016102420161009
20116024? ? ?? ? ?refg++_-Os_-fomit-frame-pointer2016102420161009
20116824? ? ?? ? ?refg++_-m64_-Os_-fomit-frame-pointer2016102420161009
20118336? ? ?? ? ?refg++_-march=barcelona_-Os_-fomit-frame-pointer2016102420161009
20122248? ? ?? ? ?refg++_-m64_-march=barcelona_-Os_-fomit-frame-pointer2016102420161009
20122416? ? ?? ? ?refg++_-m64_-march=corei7_-Os_-fomit-frame-pointer2016102420161009
20122584? ? ?? ? ?refg++_-m64_-march=core2_-msse4_-Os_-fomit-frame-pointer2016102420161009
20123384? ? ?? ? ?refg++_-m64_-march=core2_-msse4.1_-Os_-fomit-frame-pointer2016102420161009
20123720? ? ?? ? ?refg++_-fno-schedule-insns_-Os_-fomit-frame-pointer2016102420161009
20124328? ? ?? ? ?refg++_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv2016102420161009
20140024? ? ?? ? ?refg++_-m64_-march=native_-mtune=native_-Os_-fomit-frame-pointer2016102420161009
20464896? ? ?? ? ?refg++_-m64_-march=core2_-Os_-fomit-frame-pointer2016102420161009
20472000? ? ?? ? ?refg++_-m64_-march=k8_-Os_-fomit-frame-pointer2016102420161009
51044768? ? ?? ? ?refg++2016102420161009

Compiler output

Implementation: crypto_aead/paeq128t/aesni
Compiler: clang++ -O3 -fomit-frame-pointer -Qunused-arguments
encrypt.cpp: encrypt.cpp:168:11: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'DoubleFPermAsm' that is compiled without support for 'aes'
encrypt.cpp: acc0 = _mm_aesenc_si128(acc0, roundkey0);
encrypt.cpp: ^
encrypt.cpp: encrypt.cpp:169:11: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'DoubleFPermAsm' that is compiled without support for 'aes'
encrypt.cpp: acc1 = _mm_aesenc_si128(acc1, roundkey1);
encrypt.cpp: ^
encrypt.cpp: encrypt.cpp:170:11: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'DoubleFPermAsm' that is compiled without support for 'aes'
encrypt.cpp: acc2 = _mm_aesenc_si128(acc2, roundkey2);
encrypt.cpp: ^
encrypt.cpp: encrypt.cpp:171:11: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'DoubleFPermAsm' that is compiled without support for 'aes'
encrypt.cpp: acc3 = _mm_aesenc_si128(acc3, roundkey3);
encrypt.cpp: ^
encrypt.cpp: encrypt.cpp:173:11: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'DoubleFPermAsm' that is compiled without support for 'aes'
encrypt.cpp: acc4 = _mm_aesenc_si128(acc4, roundkey0);
encrypt.cpp: ^
encrypt.cpp: encrypt.cpp:174:11: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'DoubleFPermAsm' that is compiled without support for 'aes'
encrypt.cpp: acc5 = _mm_aesenc_si128(acc5, roundkey1);
encrypt.cpp: ^
encrypt.cpp: encrypt.cpp:175:11: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'DoubleFPermAsm' that is compiled without support for 'aes'
encrypt.cpp: acc6 = _mm_aesenc_si128(acc6, roundkey2);
encrypt.cpp: ^
encrypt.cpp: encrypt.cpp:176:11: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'DoubleFPermAsm' that is compiled without support for 'aes'
encrypt.cpp: acc7 = _mm_aesenc_si128(acc7, roundkey3);
encrypt.cpp: ^
encrypt.cpp: 8 errors generated.

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
clang++ -O3 -fomit-frame-pointer -Qunused-arguments aesni
clang++ -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments aesni
clang++ -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments aesni
clang++ -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments aesni
clang++ -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments aesni
clang++ -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments aesni

Compiler output

Implementation: crypto_aead/paeq128t/aesni
Compiler: g++
encrypt.cpp: In file included from encrypt.cpp:14:0:
encrypt.cpp: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h: In function 'void DoubleFPermAsm(const __m128i*, const __m128i*, __m128i*, __m128i*)':
encrypt.cpp: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '__m128i _mm_aesenc_si128(__m128i, __m128i)': target specific option mismatch
encrypt.cpp: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
encrypt.cpp: ^
encrypt.cpp: encrypt.cpp:176:44: error: called from here
encrypt.cpp: acc7 = _mm_aesenc_si128(acc7, roundkey3);
encrypt.cpp: ^
encrypt.cpp: In file included from encrypt.cpp:14:0:
encrypt.cpp: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '__m128i _mm_aesenc_si128(__m128i, __m128i)': target specific option mismatch
encrypt.cpp: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
encrypt.cpp: ^
encrypt.cpp: encrypt.cpp:175:44: error: called from here
encrypt.cpp: acc6 = _mm_aesenc_si128(acc6, roundkey2);
encrypt.cpp: ^
encrypt.cpp: In file included from encrypt.cpp:14:0:
encrypt.cpp: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '__m128i _mm_aesenc_si128(__m128i, __m128i)': target specific option mismatch
encrypt.cpp: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
encrypt.cpp: ^
encrypt.cpp: encrypt.cpp:174:44: error: called from here
encrypt.cpp: acc5 = _mm_aesenc_si128(acc5, roundkey1);
encrypt.cpp: ^
encrypt.cpp: In file included from encrypt.cpp:14:0:
encrypt.cpp: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '__m128i _mm_aesenc_si128(__m128i, __m128i)': target specific option mismatch
encrypt.cpp: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
encrypt.cpp: ...

Number of similar (compiler,implementation) pairs: 58, namely:
CompilerImplementations
g++ aesni
g++ -O2 -fomit-frame-pointer aesni
g++ -O3 -fomit-frame-pointer aesni
g++ -O -fomit-frame-pointer aesni
g++ -Os -fomit-frame-pointer aesni
g++ -fno-schedule-insns -O2 -fomit-frame-pointer aesni
g++ -fno-schedule-insns -O3 -fomit-frame-pointer aesni
g++ -fno-schedule-insns -O -fomit-frame-pointer aesni
g++ -fno-schedule-insns -Os -fomit-frame-pointer aesni
g++ -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer aesni
g++ -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer aesni
g++ -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer aesni
g++ -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer aesni
g++ -m64 -O2 -fomit-frame-pointer aesni
g++ -m64 -O3 -fomit-frame-pointer aesni
g++ -m64 -O -fomit-frame-pointer aesni
g++ -m64 -Os -fomit-frame-pointer aesni
g++ -m64 -march=core2 -O2 -fomit-frame-pointer aesni
g++ -m64 -march=core2 -O3 -fomit-frame-pointer aesni
g++ -m64 -march=core2 -O -fomit-frame-pointer aesni
g++ -m64 -march=core2 -Os -fomit-frame-pointer aesni
g++ -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer aesni
g++ -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer aesni
g++ -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer aesni
g++ -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer aesni
g++ -m64 -march=core2 -msse4 -O -fomit-frame-pointer aesni
g++ -m64 -march=core2 -msse4 -Os -fomit-frame-pointer aesni
g++ -m64 -march=corei7 -O2 -fomit-frame-pointer aesni
g++ -m64 -march=corei7 -O -fomit-frame-pointer aesni
g++ -m64 -march=corei7 -Os -fomit-frame-pointer aesni
g++ -m64 -march=k8 -O2 -fomit-frame-pointer aesni
g++ -m64 -march=k8 -O3 -fomit-frame-pointer aesni
g++ -m64 -march=k8 -O -fomit-frame-pointer aesni
g++ -m64 -march=k8 -Os -fomit-frame-pointer aesni
g++ -m64 -march=native -mtune=native -O2 -fomit-frame-pointer aesni
g++ -m64 -march=native -mtune=native -O3 -fomit-frame-pointer aesni
g++ -m64 -march=native -mtune=native -O -fomit-frame-pointer aesni
g++ -m64 -march=native -mtune=native -Os -fomit-frame-pointer aesni
g++ -m64 -march=nocona -O2 -fomit-frame-pointer aesni
g++ -m64 -march=nocona -O3 -fomit-frame-pointer aesni
g++ -m64 -march=nocona -O -fomit-frame-pointer aesni
g++ -m64 -march=nocona -Os -fomit-frame-pointer aesni
g++ -march=barcelona -O2 -fomit-frame-pointer aesni
g++ -march=barcelona -O3 -fomit-frame-pointer aesni
g++ -march=barcelona -O -fomit-frame-pointer aesni
g++ -march=barcelona -Os -fomit-frame-pointer aesni
g++ -march=k8 -O2 -fomit-frame-pointer aesni
g++ -march=k8 -O3 -fomit-frame-pointer aesni
g++ -march=k8 -O -fomit-frame-pointer aesni
g++ -march=k8 -Os -fomit-frame-pointer aesni
g++ -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv aesni
g++ -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv aesni
g++ -march=native -mtune=native -O -fomit-frame-pointer -fwrapv aesni
g++ -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv aesni
g++ -march=nocona -O2 -fomit-frame-pointer aesni
g++ -march=nocona -O3 -fomit-frame-pointer aesni
g++ -march=nocona -O -fomit-frame-pointer aesni
g++ -march=nocona -Os -fomit-frame-pointer aesni

Compiler output

Implementation: crypto_aead/paeq128t/aesni
Compiler: g++ -m64 -march=barcelona -O2 -fomit-frame-pointer
encrypt.cpp: In file included from encrypt.cpp:14:0:
encrypt.cpp: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h: In function 'void DoubleFPermAsm(const __m128i*, const __m128i*, __m128i*, __m128i*)':
encrypt.cpp: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '__m128i _mm_aesenc_si128(__m128i, __m128i)': target specific option mismatch
encrypt.cpp: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
encrypt.cpp: ^
encrypt.cpp: encrypt.cpp:176:44: error: called from here
encrypt.cpp: acc7 = _mm_aesenc_si128(acc7, roundkey3);
encrypt.cpp: ^
encrypt.cpp: In file included from encrypt.cpp:14:0:
encrypt.cpp: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '__m128i _mm_aesenc_si128(__m128i, __m128i)': target specific option mismatch
encrypt.cpp: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
encrypt.cpp: ^
encrypt.cpp: encrypt.cpp:175:44: error: called from here
encrypt.cpp: acc6 = _mm_aesenc_si128(acc6, roundkey2);
encrypt.cpp: ^
encrypt.cpp: In file included from encrypt.cpp:14:0:
encrypt.cpp: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '__m128i _mm_aesenc_si128(__m128i, __m128i)': target specific option mismatch
encrypt.cpp: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
encrypt.cpp: ^
encrypt.cpp: encrypt.cpp:174:44: error: called from here
encrypt.cpp: acc5 = _mm_aesenc_si128(acc5, roundkey1);
encrypt.cpp: ^
encrypt.cpp: In file included from encrypt.cpp:14:0:
encrypt.cpp: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '__m128i _mm_aesenc_si128(__m128i, __m128i)': target specific option mismatch
encrypt.cpp: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
encrypt.cpp: ...
encrypt.cpp: In file included from encrypt.cpp:14:0:
encrypt.cpp: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h: In function 'void DoubleFPermAsm(const __m128i*, const __m128i*, __m128i*, __m128i*)':
encrypt.cpp: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '__m128i _mm_aesenc_si128(__m128i, __m128i)': target specific option mismatch
encrypt.cpp: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
encrypt.cpp: ^
encrypt.cpp: encrypt.cpp:176:44: error: called from here
encrypt.cpp: acc7 = _mm_aesenc_si128(acc7, roundkey3);
encrypt.cpp: ^
encrypt.cpp: In file included from encrypt.cpp:14:0:
encrypt.cpp: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '__m128i _mm_aesenc_si128(__m128i, __m128i)': target specific option mismatch
encrypt.cpp: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
encrypt.cpp: ^
encrypt.cpp: encrypt.cpp:175:44: error: called from here
encrypt.cpp: acc6 = _mm_aesenc_si128(acc6, roundkey2);
encrypt.cpp: ^
encrypt.cpp: In file included from encrypt.cpp:14:0:
encrypt.cpp: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '__m128i _mm_aesenc_si128(__m128i, __m128i)': target specific option mismatch
encrypt.cpp: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
encrypt.cpp: ^
encrypt.cpp: encrypt.cpp:174:44: error: called from here
encrypt.cpp: acc5 = _mm_aesenc_si128(acc5, roundkey1);
encrypt.cpp: ^
encrypt.cpp: In file included from encrypt.cpp:14:0:
encrypt.cpp: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '__m128i _mm_aesenc_si128(__m128i, __m128i)': target specific option mismatch
encrypt.cpp: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
encrypt.cpp: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
g++ -m64 -march=barcelona -O2 -fomit-frame-pointer aesni
g++ -m64 -march=barcelona -O3 -fomit-frame-pointer aesni
g++ -m64 -march=barcelona -O -fomit-frame-pointer aesni
g++ -m64 -march=barcelona -Os -fomit-frame-pointer aesni