Implementation notes: amd64, h3neo, crypto_aead/aezv3

Computer: h3neo
Microarchitecture: amd64; K10 45nm (100f63)
Architecture: amd64
CPU ID: AuthenticAMD-00100f63-078bfbff
SUPERCOP version: 20240625
Operation: crypto_aead
Primitive: aezv3
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
138198537288 0 059195 860 1024T:refclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
139543838063 0 060563 860 1024T:refclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
140599735142 0 056147 860 1024T:refclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
143312449461 0 073276 828 1088T:refgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
169276321818 0 041253 852 1024T:refclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
203170923252 0 043211 860 1024T:refclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
206970327140 0 049564 828 1088T:refgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
230850125554 0 046996 828 1088T:refgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
271957722943 0 042655 804 1056T:refgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625

Compiler output


aez_ni.c: aez_ni.c:146:22: error: '__builtin_ia32_vec_set_v16qi' needs target feature sse4.1
aez_ni.c:         __m128i i1 = _mm_insert_epi8(zero, 1, 7);
aez_ni.c:                      ^
aez_ni.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/smmintrin.h:930:13: note: expanded from macro '_mm_insert_epi8'
aez_ni.c:   ((__m128i)__builtin_ia32_vec_set_v16qi((__v16qi)(__m128i)(X), \
aez_ni.c:             ^
aez_ni.c: aez_ni.c:147:22: error: '__builtin_ia32_vec_set_v16qi' needs target feature sse4.1
aez_ni.c:         __m128i i2 = _mm_insert_epi8(zero, 2, 7);
aez_ni.c:                      ^
aez_ni.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/smmintrin.h:930:13: note: expanded from macro '_mm_insert_epi8'
aez_ni.c:   ((__m128i)__builtin_ia32_vec_set_v16qi((__v16qi)(__m128i)(X), \
aez_ni.c:             ^
aez_ni.c: aez_ni.c:148:22: error: '__builtin_ia32_vec_set_v16qi' needs target feature sse4.1
aez_ni.c:         __m128i i3 = _mm_insert_epi8(zero, 3, 7);
aez_ni.c:                      ^
aez_ni.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/smmintrin.h:930:13: note: expanded from macro '_mm_insert_epi8'
aez_ni.c:   ((__m128i)__builtin_ia32_vec_set_v16qi((__v16qi)(__m128i)(X), \
aez_ni.c:             ^
aez_ni.c: aez_ni.c:149:26: error: '__builtin_ia32_vec_set_v16qi' needs target feature sse4.1
aez_ni.c:         __m128i j, one = _mm_insert_epi8(zero, 1, 15);
aez_ni.c:                          ^
aez_ni.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/smmintrin.h:930:13: note: expanded from macro '_mm_insert_epi8'
aez_ni.c:   ((__m128i)__builtin_ia32_vec_set_v16qi((__v16qi)(__m128i)(X), \
aez_ni.c:             ^
aez_ni.c: 4 errors generated.

Number of similar (implementation,compiler) pairs: 5, namely:
ImplementationCompiler
T:aesniclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:aesniclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:aesniclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:aesniclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:aesniclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


aez_ni.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/smmintrin.h:32,
aez_ni.c:                  from aez_ni.c:38:
aez_ni.c: aez_ni.c: In function 'bswap16':
aez_ni.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/tmmintrin.h:136:1: error: inlining failed in call to 'always_inline' '_mm_shuffle_epi8': target specific option mismatch
aez_ni.c:   136 | _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
aez_ni.c:       | ^~~~~~~~~~~~~~~~
aez_ni.c: aez_ni.c:76:12: note: called from here
aez_ni.c:    76 |     return _mm_shuffle_epi8(
aez_ni.c:       |            ^~~~~~~~~~~~~~~~~
aez_ni.c:    77 |         b,_mm_set_epi8(0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15)
aez_ni.c:       |         ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aez_ni.c:    78 |     );
aez_ni.c:       |     ~
aez_ni.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/smmintrin.h:32,
aez_ni.c:                  from aez_ni.c:38:
aez_ni.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/tmmintrin.h:136:1: error: inlining failed in call to 'always_inline' '_mm_shuffle_epi8': target specific option mismatch
aez_ni.c:   136 | _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
aez_ni.c:       | ^~~~~~~~~~~~~~~~
aez_ni.c: aez_ni.c:76:12: note: called from here
aez_ni.c:    76 |     return _mm_shuffle_epi8(
aez_ni.c:       |            ^~~~~~~~~~~~~~~~~
aez_ni.c:    77 |         b,_mm_set_epi8(0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15)
aez_ni.c:       |         ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
aez_ni.c:    78 |     );
aez_ni.c:       |     ~

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:aesnigcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:aesnigcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:aesnigcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:aesnigcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)