Implementation notes: amd64, rumba3, crypto_core/invhrss701

Computer: rumba3
Microarchitecture: amd64; Zen (800f11)
Architecture: amd64
CPU ID: AuthenticAMD-00800f11-178bfbff
SUPERCOP version: 20221122
Operation: crypto_core
Primitive: invhrss701
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
13422010149 0 021164 788 960faster921gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
13423510149 0 024539 844 960faster921clang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
13431810149 0 022389 836 1024faster921clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
13443810149 0 021627 844 928faster921clang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
13443910149 0 022661 804 992faster921gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
13445010149 0 023893 804 992faster921gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
13445710149 0 024915 844 960faster921clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
13449710149 0 022188 796 992faster921gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
13465611141 0 025317 804 992fastergcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
13487410149 0 023619 844 928faster921clang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
14741014264 0 029059 844 960fasterclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
15459312845 0 027267 844 960fasterclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
1825797004 0 018507 844 928fasterclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
1837887385 0 020357 804 992fastergcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
1849107272 0 019740 796 992fastergcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
1867427067 0 019349 836 1024fasterclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
2107234110 0 015504 780 960fastergcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
27072728431 0 043235 844 960avx2clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
27099727359 0 041787 844 960avx2clang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
27102227309 0 040803 844 928avx2clang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
27136827029 0 040741 804 992avx2gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
27216626806 0 038315 844 928avx2clang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
27218626801 0 039069 836 1024avx2clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
27237626902 0 039381 804 992avx2gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
27407426895 0 037800 780 960avx2gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
27408826909 0 038908 796 992avx2gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
18466535767 0 020015 812 992simplergcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
35364173645 0 017781 804 992refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
713833111770 0 026555 844 960refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
71460108538 0 022947 844 960refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
75136066460 0 019947 844 928refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
82628972561 0 016117 852 928simplerclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
84573334619 0 019493 852 960simplerclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
93018023255 0 017757 852 960simplerclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
16585201993 0 014015 812 992simplergcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
17957667800 0 012365 852 928simplerclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
21137146878 0 013324 796 992simplergcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
26755369772 0 013095 844 1024simplerclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
302484771379 0 012867 844 928refclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
307139322215 0 014445 836 1024refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023052520221122
318122091372 0 013796 796 992refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
321230661391 0 014293 804 992refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
37948356783 0 012170 788 960simplergcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122
404780401157 0 012448 780 960refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023052520221122

Compiler output

Implementation: faster
Security model: constbranchindex
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
core.c: core.c:353:19: error: always_inline function '_mm256_set_epi32' requires target feature 'avx', but would be inlined into function 'r3_recip' that is compiled without support for 'avx'
core.c: F0[0] = F0[1] = _mm256_set_epi32(-1,-1,-1,-1,-1,-1,-1,-1);
core.c: ^
core.c: core.c:353:19: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
core.c: core.c:354:11: error: always_inline function '_mm256_set_epi32' requires target feature 'avx', but would be inlined into function 'r3_recip' that is compiled without support for 'avx'
core.c: F0[2] = _mm256_set_epi32(32767,-1,32767,-1,32767,-1,65535,-1);
core.c: ^
core.c: core.c:354:11: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
core.c: core.c:355:27: error: always_inline function '_mm256_set1_epi32' requires target feature 'avx', but would be inlined into function 'r3_recip' that is compiled without support for 'avx'
core.c: F1[0] = F1[1] = F1[2] = _mm256_set1_epi32(0);
core.c: ^
core.c: core.c:355:27: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
core.c: core.c:357:27: error: always_inline function '_mm256_set1_epi32' requires target feature 'avx', but would be inlined into function 'r3_recip' that is compiled without support for 'avx'
core.c: V0[2] = V0[1] = V0[0] = _mm256_set1_epi32(0);
core.c: ^
core.c: core.c:357:27: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
core.c: core.c:358:27: error: always_inline function '_mm256_set1_epi32' requires target feature 'avx', but would be inlined into function 'r3_recip' that is compiled without support for 'avx'
core.c: V1[2] = V1[1] = V1[0] = _mm256_set1_epi32(0);
core.c: ^
core.c: core.c:358:27: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
core.c: core.c:360:11: error: always_inline function '_mm256_set_epi32' requires target feature 'avx', but would be inlined into function 'r3_recip' that is compiled without support for 'avx'
core.c: R0[0] = _mm256_set_epi32(0,0,0,0,0,0,0,1);
core.c: ^
core.c: core.c:360:11: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
core.c: core.c:361:19: error: always_inline function '_mm256_set1_epi32' requires target feature 'avx', but would be inlined into function 'r3_recip' that is compiled without support for 'avx'
core.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE faster