Implementation notes: amd64, shoe, crypto_aead/aezv3

Computer: shoe
Microarchitecture: amd64; Broadwell+AES (306d4)
Architecture: amd64
CPU ID: GenuineIntel-000306d4-bfebfbff
SUPERCOP version: 20240716
Operation: crypto_aead
Primitive: aezv3
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
380016542 0 039072 780 1080T:aesnigcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
380810944 0 033600 812 1048T:aesniclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
38129762 0 030536 780 1080T:aesnigcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
382110992 0 033744 812 1048T:aesniclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
38289628 0 029070 804 1016T:aesniclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
38408352 0 027211 756 1048T:aesnigcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
394410352 0 030719 772 1080T:aesnigcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
1324815293 0 035126 804 1016T:aesniclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
79764231951 0 055208 828 1048T:refclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
80292229493 0 052632 828 1048T:refclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
101847722221 0 041982 820 1016T:refclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
105575341884 0 064624 796 1080T:refgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
115999722748 0 042734 820 1016T:refclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
118555432516 0 055016 828 1016T:refclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
118596925818 0 046776 796 1080T:refgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
118708524954 0 045552 796 1080T:refgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
134523022030 0 041019 772 1048T:refgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625

Compiler output


aez_ni.c: aez_ni.c:146:22: error: '__builtin_ia32_vec_set_v16qi' needs target feature sse4.1
aez_ni.c:         __m128i i1 = _mm_insert_epi8(zero, 1, 7);
aez_ni.c:                      ^
aez_ni.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/smmintrin.h:928:12: note: expanded from macro '_mm_insert_epi8'
aez_ni.c:   (__m128i)__builtin_ia32_vec_set_v16qi((__v16qi)(__m128i)(X), \
aez_ni.c:            ^
aez_ni.c: aez_ni.c:147:22: error: '__builtin_ia32_vec_set_v16qi' needs target feature sse4.1
aez_ni.c:         __m128i i2 = _mm_insert_epi8(zero, 2, 7);
aez_ni.c:                      ^
aez_ni.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/smmintrin.h:928:12: note: expanded from macro '_mm_insert_epi8'
aez_ni.c:   (__m128i)__builtin_ia32_vec_set_v16qi((__v16qi)(__m128i)(X), \
aez_ni.c:            ^
aez_ni.c: aez_ni.c:148:22: error: '__builtin_ia32_vec_set_v16qi' needs target feature sse4.1
aez_ni.c:         __m128i i3 = _mm_insert_epi8(zero, 3, 7);
aez_ni.c:                      ^
aez_ni.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/smmintrin.h:928:12: note: expanded from macro '_mm_insert_epi8'
aez_ni.c:   (__m128i)__builtin_ia32_vec_set_v16qi((__v16qi)(__m128i)(X), \
aez_ni.c:            ^
aez_ni.c: aez_ni.c:149:26: error: '__builtin_ia32_vec_set_v16qi' needs target feature sse4.1
aez_ni.c:         __m128i j, one = _mm_insert_epi8(zero, 1, 15);
aez_ni.c:                          ^
aez_ni.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/smmintrin.h:928:12: note: expanded from macro '_mm_insert_epi8'
aez_ni.c:   (__m128i)__builtin_ia32_vec_set_v16qi((__v16qi)(__m128i)(X), \
aez_ni.c:            ^
aez_ni.c: 4 errors generated.

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
T:aesniclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)