Implementation notes: aarch64, pi3aplus, crypto_core/invhrss701

Computer: pi3aplus
Microarchitecture: aarch64; Cortex-A53 (410fd034)
Architecture: aarch64
CPU ID: 410fd034
SUPERCOP version: 20240107
Operation: crypto_core
Primitive: invhrss701
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
76269244576 0 015858 840 768simplergcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023121320231212
98354953900 0 015933 768 760simplerclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023121420231212
158777004264 0 015472 832 768refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023121320231212
2293308512420 0 024437 768 760refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023121320231212
59514094772 0 010695 824 752simplergcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023121320231212
59550410672 0 09815 808 744simplergcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023121320231212
81319170816 0 010823 824 752simplergcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023121420231212
875134741108 0 010231 808 744refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023121320231212
895317351192 0 011127 824 752refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023121320231212
915648491260 0 011271 824 752refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023121320231212

Compiler output

Implementation: faster
Security model: constbranchindex
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
core.c: In file included from core.c:4:
core.c: In file included from /usr/lib/llvm-7/lib/clang/7.0.1/include/immintrin.h:28:
core.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:64:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
core.c: return (__m64)__builtin_ia32_vec_init_v2si(__i, 0);
core.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
core.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:143:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
core.c: return (__m64)__builtin_ia32_packsswb((__v4hi)__m1, (__v4hi)__m2);
core.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
core.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:173:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
core.c: return (__m64)__builtin_ia32_packssdw((__v2si)__m1, (__v2si)__m2);
core.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
core.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:203:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
core.c: return (__m64)__builtin_ia32_packuswb((__v4hi)__m1, (__v4hi)__m2);
core.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
core.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:230:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
core.c: return (__m64)__builtin_ia32_punpckhbw((__v8qi)__m1, (__v8qi)__m2);
core.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
core.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:253:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
core.c: return (__m64)__builtin_ia32_punpckhwd((__v4hi)__m1, (__v4hi)__m2);
core.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
core.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:274:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
core.c: return (__m64)__builtin_ia32_punpckhdq((__v2si)__m1, (__v2si)__m2);
core.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
core.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:301:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
core.c: return (__m64)__builtin_ia32_punpcklbw((__v8qi)__m1, (__v8qi)__m2);
core.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE faster

Compiler output

Implementation: faster
Security model: constbranchindex
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
core.c: core.c:4:10: fatal error: immintrin.h: No such file or directory
core.c: #include <immintrin.h>
core.c: ^~~~~~~~~~~~~
core.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE faster
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE faster
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE faster
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE faster

Compiler output

Implementation: faster921
Security model: constbranchindex
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
core.S: core.S:9:8: error: unknown token in expression
core.S: pushq %rbp
core.S: ^
core.S: core.S:9:8: error: invalid operand
core.S: pushq %rbp
core.S: ^
core.S: core.S:12:7: error: unknown token in expression
core.S: xorl %ecx, %ecx
core.S: ^
core.S: core.S:12:7: error: invalid operand
core.S: xorl %ecx, %ecx
core.S: ^
core.S: core.S:13:11: error: unknown token in expression
core.S: vpcmpeqd %ymm4, %ymm4, %ymm4
core.S: ^
core.S: core.S:13:11: error: invalid operand
core.S: vpcmpeqd %ymm4, %ymm4, %ymm4
core.S: ^
core.S: core.S:14:7: error: unknown token in expression
core.S: movq %rsp, %rbp
core.S: ^
core.S: core.S:14:7: error: invalid operand
core.S: movq %rsp, %rbp
core.S: ^
core.S: core.S:16:8: error: unknown token in expression
core.S: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE faster921

Compiler output

Implementation: faster921
Security model: constbranchindex
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
core.S: core.c: Assembler messages:
core.S: core.c:9: Error: unknown mnemonic `pushq' -- `pushq %rbp'
core.S: core.c:12: Error: unknown mnemonic `xorl' -- `xorl %ecx,%ecx'
core.S: core.c:13: Error: unknown mnemonic `vpcmpeqd' -- `vpcmpeqd %ymm4,%ymm4,%ymm4'
core.S: core.c:14: Error: unknown mnemonic `movq' -- `movq %rsp,%rbp'
core.S: core.c:16: Error: unknown mnemonic `pushq' -- `pushq %r13'
core.S: core.c:17: Error: unknown mnemonic `pushq' -- `pushq %r12'
core.S: core.c:18: Error: unknown mnemonic `pushq' -- `pushq %rbx'
core.S: core.c:22: Error: unknown mnemonic `movq' -- `movq %rdi,%rbx'
core.S: core.c:23: Error: unknown mnemonic `andq' -- `andq $-32,%rsp'
core.S: core.c:24: Error: unknown mnemonic `subq' -- `subq $5568,%rsp'
core.S: core.c:25: Error: unknown mnemonic `movzbl' -- `movzbl 1400(%rsi),%eax'
core.S: core.c:26: Error: unknown mnemonic `vmovdqa' -- `vmovdqa .LC2(%rip),%ymm1'
core.S: core.c:27: Error: unknown mnemonic `vmovdqa' -- `vmovdqa .LC3(%rip),%ymm3'
core.S: core.c:28: Error: unknown mnemonic `vmovdqa' -- `vmovdqa .LC4(%rip),%ymm5'
core.S: core.c:29: Error: unknown mnemonic `leaq' -- `leaq 864(%rsp),%rdi'
core.S: core.c:30: Error: unknown mnemonic `notl' -- `notl %eax'
core.S: core.c:31: Error: unknown mnemonic `movl' -- `movl %eax,%r8d'
core.S: core.c:32: Error: unknown mnemonic `andl' -- `andl $3,%eax'
core.S: core.c:33: Error: unknown mnemonic `leal' -- `leal -3(%rax),%edx'
core.S: core.c:34: Error: unknown mnemonic `andl' -- `andl $3,%r8d'
core.S: core.c:35: Error: unknown mnemonic `movl' -- `movl %edx,%eax'
core.S: core.c:36: Error: unknown mnemonic `sarl' -- `sarl $5,%eax'
core.S: core.c:37: Error: unknown mnemonic `andl' -- `andl %r8d,%eax'
core.S: core.c:38: Error: unknown mnemonic `leal' -- `leal 1(%rax),%r8d'
core.S: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE faster921
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE faster921
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE faster921
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE faster921