Implementation notes: amd64, intelnuci8, crypto_core/invhrss701

Computer: intelnuci8
Architecture: amd64
CPU ID: GenuineIntel-000906e9-bfebfbff
SUPERCOP version: 20191221
Operation: crypto_core
Primitive: invhrss701
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
8866210545 0 023737 776 776faster821clang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
8867410545 0 023737 776 776faster821clang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
8870210545 0 023737 776 776faster821clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
8870310545 0 020183 768 760faster821clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
8870410545 0 023001 776 760faster821clang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
9475210622 0 024806 776 832fastergcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019120620191017
10172513087 0 026305 776 776fasterclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
10184013087 0 026305 776 776fasterclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
11085214379 0 027601 776 776fasterclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
1140757321 0 016983 768 760fasterclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
1244618189 0 019013 768 832fastergcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019120620191017
1334597659 0 018445 768 832fastergcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019120620191017
14532428168 0 041393 776 776avx2clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
14533827387 0 041574 776 832avx2gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019120620191017
14533927144 0 040369 776 776avx2clang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
14536727144 0 040369 776 776avx2clang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
14560727339 0 039825 776 760avx2clang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
14664626793 0 036447 768 760avx2clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
14688126859 0 037653 768 832avx2gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019120620191017
14689126874 0 037621 768 832avx2gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019120620191017
14937526857 0 036673 752 800avx2gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019120620191017
1879594921 0 014801 752 800fastergcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019120620191017
24712119067 0 023262 776 832refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019120620191017
27752202999 0 017288 784 832simplergcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019120620191017
27793568000 0 021209 776 776refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
35429916320 0 019529 776 776refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
35519376320 0 019529 776 776refclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
39257387199 0 019673 776 760refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
94261814786 0 017985 776 776simplerclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
94392374786 0 017985 776 776simplerclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
95086365490 0 018721 776 776simplerclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
95554634656 0 017129 776 760simplerclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
17842443922 0 011725 768 832simplergcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019120620191017
19342573909 0 011661 768 832simplergcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019120620191017
281310561428 0 012197 768 832refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019120620191017
286800221360 0 012085 768 832refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019120620191017
298444661459 0 011111 768 760refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
35456084718 0 010375 768 760simplerclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019120620191017
382276451255 0 011041 752 800refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019120620191017
45498575758 0 010545 752 800simplergcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019120620191017

Test failure

Implementation: faster821
Security model: unknown
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
error 111
./try: Symbol `memcpy' causes overflow in R_X86_64_PC32 relocation

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE faster821
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE faster821
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE faster821
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE faster821

Compiler output

Implementation: faster
Security model: unknown
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
core.c: core.c:353:19: error: always_inline function '_mm256_set_epi32' requires target feature 'sse4.2', but would be inlined into function 'r3_recip' that is compiled without support for 'sse4.2'
core.c: F0[0] = F0[1] = _mm256_set_epi32(-1,-1,-1,-1,-1,-1,-1,-1);
core.c: ^
core.c: core.c:354:11: error: always_inline function '_mm256_set_epi32' requires target feature 'sse4.2', but would be inlined into function 'r3_recip' that is compiled without support for 'sse4.2'
core.c: F0[2] = _mm256_set_epi32(32767,-1,32767,-1,32767,-1,65535,-1);
core.c: ^
core.c: core.c:355:27: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'r3_recip' that is compiled without support for 'sse4.2'
core.c: F1[0] = F1[1] = F1[2] = _mm256_set1_epi32(0);
core.c: ^
core.c: core.c:357:27: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'r3_recip' that is compiled without support for 'sse4.2'
core.c: V0[2] = V0[1] = V0[0] = _mm256_set1_epi32(0);
core.c: ^
core.c: core.c:358:27: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'r3_recip' that is compiled without support for 'sse4.2'
core.c: V1[2] = V1[1] = V1[0] = _mm256_set1_epi32(0);
core.c: ^
core.c: core.c:360:11: error: always_inline function '_mm256_set_epi32' requires target feature 'sse4.2', but would be inlined into function 'r3_recip' that is compiled without support for 'sse4.2'
core.c: R0[0] = _mm256_set_epi32(0,0,0,0,0,0,0,1);
core.c: ^
core.c: core.c:361:19: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'r3_recip' that is compiled without support for 'sse4.2'
core.c: R0[2] = R0[1] = _mm256_set1_epi32(0);
core.c: ^
core.c: core.c:362:27: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'r3_recip' that is compiled without support for 'sse4.2'
core.c: R1[2] = R1[1] = R1[0] = _mm256_set1_epi32(0);
core.c: ^
core.c: core.c:376:15: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'r3_recip' that is compiled without support for 'sse4.2'
core.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE faster