Implementation notes: amd64, gcc67, crypto_core/invhrss701

Computer: gcc67
Architecture: amd64
CPU ID: AuthenticAMD-00810f10-178bfbff
SUPERCOP version: 20210125
Operation: crypto_core
Primitive: invhrss701
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
13407410149 0 021197 812 912faster921clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
13464910149 0 021197 828 944faster921gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
13474610149 0 022181 836 944faster921gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
13478110149 0 022317 836 944faster921gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
13493710149 0 022779 820 944faster921clang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
1450839117 0 023246 844 944fastergcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
14945010149 0 023683 820 944faster921clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
15887211004 0 023659 820 944fasterclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
15916311004 0 024563 820 944fasterclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
15916311004 0 023659 820 944fasterclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
1790217605 0 018669 812 912fasterclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
1826907354 0 019405 836 944fastergcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
1868577605 0 019805 836 944fastergcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
2366415168 0 016233 820 944fastergcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
26680210149 0 022779 820 944faster921clang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
26754610149 0 024286 844 944faster921gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
26841610149 0 023155 820 912faster921clang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
27065127215 0 040747 820 944avx2clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
27065127215 0 039843 820 944avx2clang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
27231226831 0 038853 836 944avx2gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
27260426788 0 037861 812 912avx2clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
27279727087 0 041198 844 944avx2gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
27320926828 0 038989 836 944avx2gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
27515226825 0 037801 820 944avx2gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
28392227056 0 040091 820 912avx2clang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
41865327215 0 039843 820 944avx2clang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
38250355944 0 020128 852 944simplergcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
39392765066 0 019166 844 944refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
132628523847 0 017387 820 944refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
132721163847 0 016483 820 944refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
132758923847 0 016483 820 944refclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
147341334818 0 017859 820 912refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
155153694224 0 017251 820 912simplerclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
164974703647 0 016307 820 944simplerclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
165076703647 0 017211 820 944simplerclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
165568963647 0 016307 820 944simplerclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
19448063870 0 013029 836 944simplergcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
22001445811 0 012829 836 944simplergcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
336523411402 0 013549 836 944refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
370743241383 0 012329 820 944refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
40407897767 0 011813 812 912simplerclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
404672911497 0 012557 812 912refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021011520210114
423115491364 0 013365 836 944refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114
45615461806 0 011761 820 944simplergcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021011520210114

Compiler output

Implementation: faster
Security model: constbranchindex
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
core.c: core.c:353:19: error: always_inline function '_mm256_set_epi32' requires target feature 'sse4.2', but would be inlined into function 'r3_recip' that is compiled without support for 'sse4.2'
core.c: F0[0] = F0[1] = _mm256_set_epi32(-1,-1,-1,-1,-1,-1,-1,-1);
core.c: ^
core.c: core.c:354:11: error: always_inline function '_mm256_set_epi32' requires target feature 'sse4.2', but would be inlined into function 'r3_recip' that is compiled without support for 'sse4.2'
core.c: F0[2] = _mm256_set_epi32(32767,-1,32767,-1,32767,-1,65535,-1);
core.c: ^
core.c: core.c:355:27: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'r3_recip' that is compiled without support for 'sse4.2'
core.c: F1[0] = F1[1] = F1[2] = _mm256_set1_epi32(0);
core.c: ^
core.c: core.c:357:27: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'r3_recip' that is compiled without support for 'sse4.2'
core.c: V0[2] = V0[1] = V0[0] = _mm256_set1_epi32(0);
core.c: ^
core.c: core.c:358:27: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'r3_recip' that is compiled without support for 'sse4.2'
core.c: V1[2] = V1[1] = V1[0] = _mm256_set1_epi32(0);
core.c: ^
core.c: core.c:360:11: error: always_inline function '_mm256_set_epi32' requires target feature 'sse4.2', but would be inlined into function 'r3_recip' that is compiled without support for 'sse4.2'
core.c: R0[0] = _mm256_set_epi32(0,0,0,0,0,0,0,1);
core.c: ^
core.c: core.c:361:19: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'r3_recip' that is compiled without support for 'sse4.2'
core.c: R0[2] = R0[1] = _mm256_set1_epi32(0);
core.c: ^
core.c: core.c:362:27: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'r3_recip' that is compiled without support for 'sse4.2'
core.c: R1[2] = R1[1] = R1[0] = _mm256_set1_epi32(0);
core.c: ^
core.c: core.c:376:15: error: always_inline function '_mm256_set1_epi32' requires target feature 'sse4.2', but would be inlined into function 'r3_recip' that is compiled without support for 'sse4.2'
core.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE faster