Implementation notes: amd64, comet, crypto_aead/aezv3

Computer: comet
Microarchitecture: amd64; Comet Lake (806ec)
Architecture: amd64
CPU ID: GenuineIntel-000806ec-bfebfbff
SUPERCOP version: 20240107
Operation: crypto_aead
Primitive: aezv3
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
310316535 0 039236 780 1088T:aesnigcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024010620231222
31179803 0 030596 780 1088T:aesnigcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024010620231222
311810876 0 034649 852 1088T:aesniclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024010620231222
312110844 0 034321 852 1056T:aesniclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024010620231222
31339539 0 030191 844 1088T:aesniclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024010620231222
314210436 0 029993 852 1024T:aesniclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024010620231222
32308428 0 026975 756 1056T:aesnigcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024010620231222
327810222 0 030379 772 1088T:aesnigcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024010620231222
78910533605 0 057537 868 1056T:refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024010620231222
78987036127 0 060361 868 1088T:refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024010620231222
81801238063 0 060649 868 1024T:refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024010620231222
95588222118 0 043063 860 1088T:refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024010620231222
99282626290 0 047268 796 1088T:refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024010620231222
102707655140 0 078124 796 1088T:refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024010620231222
106981023348 0 043385 868 1024T:refclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024010620231222
114752624700 0 045084 796 1088T:refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024010620231222
140055022063 0 040735 772 1056T:refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024010620231222

Compiler output

Implementation: T:aesni
Security model: timingleaks
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
aez_ni.c: aez_ni.c:146:22: error: '__builtin_ia32_vec_set_v16qi' needs target feature sse4.1
aez_ni.c: __m128i i1 = _mm_insert_epi8(zero, 1, 7);
aez_ni.c: ^
aez_ni.c: /usr/lib/llvm-14/lib/clang/14.0.6/include/smmintrin.h:930:13: note: expanded from macro '_mm_insert_epi8'
aez_ni.c: ((__m128i)__builtin_ia32_vec_set_v16qi((__v16qi)(__m128i)(X), \
aez_ni.c: ^
aez_ni.c: aez_ni.c:147:22: error: '__builtin_ia32_vec_set_v16qi' needs target feature sse4.1
aez_ni.c: __m128i i2 = _mm_insert_epi8(zero, 2, 7);
aez_ni.c: ^
aez_ni.c: /usr/lib/llvm-14/lib/clang/14.0.6/include/smmintrin.h:930:13: note: expanded from macro '_mm_insert_epi8'
aez_ni.c: ((__m128i)__builtin_ia32_vec_set_v16qi((__v16qi)(__m128i)(X), \
aez_ni.c: ^
aez_ni.c: aez_ni.c:148:22: error: '__builtin_ia32_vec_set_v16qi' needs target feature sse4.1
aez_ni.c: __m128i i3 = _mm_insert_epi8(zero, 3, 7);
aez_ni.c: ^
aez_ni.c: /usr/lib/llvm-14/lib/clang/14.0.6/include/smmintrin.h:930:13: note: expanded from macro '_mm_insert_epi8'
aez_ni.c: ((__m128i)__builtin_ia32_vec_set_v16qi((__v16qi)(__m128i)(X), \
aez_ni.c: ^
aez_ni.c: aez_ni.c:149:26: error: '__builtin_ia32_vec_set_v16qi' needs target feature sse4.1
aez_ni.c: __m128i j, one = _mm_insert_epi8(zero, 1, 15);
aez_ni.c: ^
aez_ni.c: /usr/lib/llvm-14/lib/clang/14.0.6/include/smmintrin.h:930:13: note: expanded from macro '_mm_insert_epi8'
aez_ni.c: ((__m128i)__builtin_ia32_vec_set_v16qi((__v16qi)(__m128i)(X), \
aez_ni.c: ^
aez_ni.c: 4 errors generated.

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:aesni

Compiler output

Implementation: T:aesni
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE
aez_ni.c: In function 'load_partial',
aez_ni.c: inlined from 'load_partial' at aez_ni.c:119:16,
aez_ni.c: inlined from 'cipher_aez_tiny' at aez_ni.c:498:18,
aez_ni.c: inlined from 'aez_encrypt' at aez_ni.c:588:9,
aez_ni.c: inlined from 'crypto_aead_aezv3_aesni_timingleaks_encrypt' at aez_ni.c:637:5:
aez_ni.c: aez_ni.c:123:46: warning: '__builtin_memcpy' writing between 4294967264 and 4294967279 bytes into a region of size 16 overflows the destination [-Wstringop-overflow=]
aez_ni.c: 123 | for (i=0; i<n; i++) ((char*)&tmp)[i] = ((char*)p)[i];
aez_ni.c: | ~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~
aez_ni.c: aez_ni.c: In function 'crypto_aead_aezv3_aesni_timingleaks_encrypt':
aez_ni.c: aez_ni.c:122:17: note: destination object 'tmp' of size 16
aez_ni.c: 122 | __m128i tmp; unsigned i;
aez_ni.c: | ^~~

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:aesni