Implementation notes: amd64, colossus6, crypto_decode/256x2

Computer: colossus6
Architecture: amd64
CPU ID: AuthenticAMD-00830f10-178bfbff
SUPERCOP version: 20210125
Operation: crypto_decode
Primitive: 256x2
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
23512 0 012576 792 744avxclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
23512 0 013048 792 744avxclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
23512 0 012576 792 744avxclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
23226 0 010493 768 800avxgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
23478 0 012934 776 800avxgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
23224 0 010341 768 800avxgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
23213 0 09537 752 768avxgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
45149 0 09294 784 728avxclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
450130 0 012840 792 728refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
540104 0 012176 792 744refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
540104 0 012648 792 744refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
562104 0 012176 792 744refclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
562104 0 010333 768 800refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
562104 0 012550 776 800refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
56390 0 09230 784 728refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
56393 0 010181 768 800refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
58590 0 09377 752 768refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125

Compiler output

Implementation: avx
Security model: constbranchindex
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
decode.c: decode.c:16:17: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'crypto_decode_256x2_avx_constbranchindex' that is compiled without support for 'sse4.2'
decode.c: __m256i x = _mm256_set1_epi32(*(int32_t *) s);
decode.c: ^
decode.c: decode.c:18:9: error: always_inline function '_mm256_shuffle_epi8' requires target feature 'avx2', but would be inlined into function 'crypto_decode_256x2_avx_constbranchindex' that is compiled without support for 'avx2'
decode.c: x = _mm256_shuffle_epi8(x,COPY);
decode.c: ^
decode.c: decode.c:18:31: error: always_inline function '_mm256_set_epi64x' requires target feature 'sse4.2', but would be inlined into function 'crypto_decode_256x2_avx_constbranchindex' that is compiled without support for 'sse4.2'
decode.c: x = _mm256_shuffle_epi8(x,COPY);
decode.c: ^
decode.c: decode.c:5:14: note: expanded from macro 'COPY'
decode.c: #define COPY _mm256_set_epi64x(0x0303030303030303,0x0202020202020202,0x0101010101010101,0x0000000000000000)
decode.c: ^
decode.c: decode.c:20:9: error: always_inline function '_mm256_andnot_si256' requires target feature 'avx2', but would be inlined into function 'crypto_decode_256x2_avx_constbranchindex' that is compiled without support for 'avx2'
decode.c: x = _mm256_andnot_si256(x,MASK);
decode.c: ^
decode.c: decode.c:20:31: error: always_inline function '_mm256_set1_epi64x' requires target feature 'sse4.2', but would be inlined into function 'crypto_decode_256x2_avx_constbranchindex' that is compiled without support for 'sse4.2'
decode.c: x = _mm256_andnot_si256(x,MASK);
decode.c: ^
decode.c: decode.c:6:14: note: expanded from macro 'MASK'
decode.c: #define MASK _mm256_set1_epi64x(0x8040201008040201)
decode.c: ^
decode.c: decode.c:21:9: error: always_inline function '_mm256_cmpeq_epi8' requires target feature 'avx2', but would be inlined into function 'crypto_decode_256x2_avx_constbranchindex' that is compiled without support for 'avx2'
decode.c: x = _mm256_cmpeq_epi8(x,_mm256_setzero_si256());
decode.c: ^
decode.c: decode.c:21:29: error: always_inline function '_mm256_setzero_si256' requires target feature 'sse4.2', but would be inlined into function 'crypto_decode_256x2_avx_constbranchindex' that is compiled without support for 'sse4.2'
decode.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx