Implementation notes: amd64, bolero, crypto_encode/653xfreeze3

Computer: bolero
Microarchitecture: amd64; BW+AES (406f1)
Architecture: amd64
CPU ID: GenuineIntel-000406f1-bfebfbff
SUPERCOP version: 20221122
Operation: crypto_encode
Primitive: 653xfreeze3
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
140339 0 013089 776 744avxclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020081920200816
144339 0 013089 776 744avxclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020081920200816
144355 0 013137 776 744avxclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020081920200816
176187 0 09135 768 728avxclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020081920200816
180173 0 013558 776 800avxgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020081920200816
184166 0 010293 768 800avxgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020081920200816
196167 0 09489 752 768avxgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020081920200816
5041655 0 015046 776 800refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020081920200816
544638 0 013345 776 744refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020081920200816
544638 0 013409 776 744refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020081920200816
544638 0 013345 776 744refclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020081920200816
600173 0 010357 768 800avxgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020081920200816
1256647 0 012689 776 728refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020081920200816
151696 0 09039 768 728refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020081920200816
179295 0 09393 752 768refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020081920200816
2120106 0 010261 768 800refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020081920200816
2268104 0 010197 768 800refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020081920200816

Compiler output

Implementation: avx
Security model: constbranchindex
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
encode.c: encode.c:15:19: error: always_inline function '_mm256_loadu_si256' requires target feature 'sse4.2', but would be inlined into function 'crypto_encode_653xfreeze3_avx_constbranchindex' that is compiled without support for 'sse4.2'
encode.c: __m256i x = _mm256_loadu_si256((__m256i *) r);
encode.c: ^
encode.c: encode.c:16:19: error: always_inline function '_mm256_mulhrs_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_encode_653xfreeze3_avx_constbranchindex' that is compiled without support for 'avx2'
encode.c: __m256i y = _mm256_mulhrs_epi16(x,_mm256_set1_epi16(10923));
encode.c: ^
encode.c: encode.c:16:41: error: always_inline function '_mm256_set1_epi16' requires target feature 'sse4.2', but would be inlined into function 'crypto_encode_653xfreeze3_avx_constbranchindex' that is compiled without support for 'sse4.2'
encode.c: __m256i y = _mm256_mulhrs_epi16(x,_mm256_set1_epi16(10923));
encode.c: ^
encode.c: encode.c:17:11: error: always_inline function '_mm256_sub_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_encode_653xfreeze3_avx_constbranchindex' that is compiled without support for 'avx2'
encode.c: x = _mm256_sub_epi16(x,y);
encode.c: ^
encode.c: encode.c:18:11: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_encode_653xfreeze3_avx_constbranchindex' that is compiled without support for 'avx2'
encode.c: y = _mm256_add_epi16(y,y);
encode.c: ^
encode.c: encode.c:19:11: error: always_inline function '_mm256_sub_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_encode_653xfreeze3_avx_constbranchindex' that is compiled without support for 'avx2'
encode.c: x = _mm256_sub_epi16(x,y);
encode.c: ^
encode.c: encode.c:20:20: error: always_inline function '_mm256_undefined_si256' requires target feature 'sse4.2', but would be inlined into function 'crypto_encode_653xfreeze3_avx_constbranchindex' that is compiled without support for 'sse4.2'
encode.c: __m128i x0 = _mm256_extractf128_si256(x,0);
encode.c: ^
encode.c: /usr/lib/llvm-6.0/lib/clang/6.0.0/include/avxintrin.h:4857:14: note: expanded from macro '_mm256_extractf128_si256'
encode.c: (__v4di)(_mm256_undefined_si256()), \
encode.c: ^
encode.c: encode.c:21:20: error: always_inline function '_mm256_undefined_si256' requires target feature 'sse4.2', but would be inlined into function 'crypto_encode_653xfreeze3_avx_constbranchindex' that is compiled without support for 'sse4.2'
encode.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx