Implementation notes: aarch64, pi4b, crypto_core/invhrss701

Computer: pi4b
Microarchitecture: aarch64; Cortex-A72 (410fd083)
Architecture: aarch64
CPU ID: 410fd083
SUPERCOP version: 20240107
Operation: crypto_core
Primitive: invhrss701
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
23426752592 0 014110 824 768simplergcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023122320231222
37030382092 0 014692 848 760simplerclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023122320231222
125761486772 0 019290 840 760refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023122320231222
133905852572 0 014028 816 768refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023122320231222
24148946760 0 010374 808 744simplergcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023122320231222
26049029856 0 011406 824 752simplergcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023122320231222
32081421824 0 011251 808 752simplergcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023122320231222
407508821232 0 011716 816 752refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023122320231222
420855021296 0 011747 808 752refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023122320231222
435960451112 0 010652 800 744refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023122320231222

Compiler output

Implementation: faster
Security model: constbranchindex
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
core.c: In file included from core.c:4:
core.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/immintrin.h:14:2: error: "This header is only meant to be used on x86 and x64 architecture"
core.c: #error "This header is only meant to be used on x86 and x64 architecture"
core.c: ^
core.c: In file included from core.c:4:
core.c: In file included from /usr/lib/llvm-14/lib/clang/14.0.0/include/immintrin.h:17:
core.c: In file included from /usr/lib/llvm-14/lib/clang/14.0.0/include/x86gprintrin.h:15:
core.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/hresetintrin.h:42:27: error: invalid input constraint 'a' in asm
core.c: __asm__ ("hreset $0" :: "a"(__eax));
core.c: ^
core.c: In file included from core.c:4:
core.c: In file included from /usr/lib/llvm-14/lib/clang/14.0.0/include/immintrin.h:21:
core.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/mmintrin.h:14:2: error: "This header is only meant to be used on x86 and x64 architecture"
core.c: #error "This header is only meant to be used on x86 and x64 architecture"
core.c: ^
core.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/mmintrin.h:54:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
core.c: return (__m64)__builtin_ia32_vec_init_v2si(__i, 0);
core.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
core.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/mmintrin.h:133:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
core.c: return (__m64)__builtin_ia32_packsswb((__v4hi)__m1, (__v4hi)__m2);
core.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
core.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/mmintrin.h:163:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
core.c: return (__m64)__builtin_ia32_packssdw((__v2si)__m1, (__v2si)__m2);
core.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
core.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/mmintrin.h:193:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
core.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE faster

Compiler output

Implementation: faster
Security model: constbranchindex
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
core.c: core.c:4:10: fatal error: immintrin.h: No such file or directory
core.c: 4 | #include <immintrin.h>
core.c: | ^~~~~~~~~~~~~
core.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE faster
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE faster
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE faster
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE faster

Compiler output

Implementation: faster921
Security model: constbranchindex
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
core.S: core.S:9:8: error: unknown token in expression
core.S: pushq %rbp
core.S: ^
core.S: core.S:9:8: error: invalid operand
core.S: pushq %rbp
core.S: ^
core.S: core.S:12:7: error: unknown token in expression
core.S: xorl %ecx, %ecx
core.S: ^
core.S: core.S:12:7: error: invalid operand
core.S: xorl %ecx, %ecx
core.S: ^
core.S: core.S:13:11: error: unknown token in expression
core.S: vpcmpeqd %ymm4, %ymm4, %ymm4
core.S: ^
core.S: core.S:13:11: error: invalid operand
core.S: vpcmpeqd %ymm4, %ymm4, %ymm4
core.S: ^
core.S: core.S:14:7: error: unknown token in expression
core.S: movq %rsp, %rbp
core.S: ^
core.S: core.S:14:7: error: invalid operand
core.S: movq %rsp, %rbp
core.S: ^
core.S: core.S:16:8: error: unknown token in expression
core.S: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE faster921

Compiler output

Implementation: faster921
Security model: constbranchindex
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
core.S: core.c: Assembler messages:
core.S: core.c:9: Error: unknown mnemonic `pushq' -- `pushq %rbp'
core.S: core.c:12: Error: unknown mnemonic `xorl' -- `xorl %ecx,%ecx'
core.S: core.c:13: Error: unknown mnemonic `vpcmpeqd' -- `vpcmpeqd %ymm4,%ymm4,%ymm4'
core.S: core.c:14: Error: unknown mnemonic `movq' -- `movq %rsp,%rbp'
core.S: core.c:16: Error: unknown mnemonic `pushq' -- `pushq %r13'
core.S: core.c:17: Error: unknown mnemonic `pushq' -- `pushq %r12'
core.S: core.c:18: Error: unknown mnemonic `pushq' -- `pushq %rbx'
core.S: core.c:22: Error: unknown mnemonic `movq' -- `movq %rdi,%rbx'
core.S: core.c:23: Error: unknown mnemonic `andq' -- `andq $-32,%rsp'
core.S: core.c:24: Error: unknown mnemonic `subq' -- `subq $5568,%rsp'
core.S: core.c:25: Error: unknown mnemonic `movzbl' -- `movzbl 1400(%rsi),%eax'
core.S: core.c:26: Error: unknown mnemonic `vmovdqa' -- `vmovdqa .LC2(%rip),%ymm1'
core.S: core.c:27: Error: unknown mnemonic `vmovdqa' -- `vmovdqa .LC3(%rip),%ymm3'
core.S: core.c:28: Error: unknown mnemonic `vmovdqa' -- `vmovdqa .LC4(%rip),%ymm5'
core.S: core.c:29: Error: unknown mnemonic `leaq' -- `leaq 864(%rsp),%rdi'
core.S: core.c:30: Error: unknown mnemonic `notl' -- `notl %eax'
core.S: core.c:31: Error: unknown mnemonic `movl' -- `movl %eax,%r8d'
core.S: core.c:32: Error: unknown mnemonic `andl' -- `andl $3,%eax'
core.S: core.c:33: Error: unknown mnemonic `leal' -- `leal -3(%rax),%edx'
core.S: core.c:34: Error: unknown mnemonic `andl' -- `andl $3,%r8d'
core.S: core.c:35: Error: unknown mnemonic `movl' -- `movl %edx,%eax'
core.S: core.c:36: Error: unknown mnemonic `sarl' -- `sarl $5,%eax'
core.S: core.c:37: Error: unknown mnemonic `andl' -- `andl %r8d,%eax'
core.S: core.c:38: Error: unknown mnemonic `leal' -- `leal 1(%rax),%r8d'
core.S: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE faster921
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE faster921
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE faster921
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE faster921