Implementation notes: aarch64, pi3aplus, crypto_kem/hila5
Computer: pi3aplus
Microarchitecture: aarch64; Cortex-A53 (410fd034)
Architecture: aarch64
CPU ID: 410fd034
SUPERCOP version: 20240425
Operation: crypto_kem
Primitive: hila5
Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
930724 | 18781 0 0 | 36200 856 1568 | T:opt | gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE | 20240503 | 20240425 |
990860 | 15525 0 0 | 31744 856 1552 | T:opt | gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE | 20240503 | 20240425 |
1013193 | 21109 0 0 | 38941 792 1552 | T:opt | clang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE | 20240503 | 20240425 |
1255597 | 15165 0 0 | 30352 840 1536 | T:opt | gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE | 20240503 | 20240425 |
1280246 | 16113 0 0 | 32216 856 1552 | T:opt | gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE | 20240503 | 20240425 |
64445142 | 5597 0 8196 | 20736 824 9728 | T:ref | gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE | 20240503 | 20240425 |
64970795 | 10461 0 8196 | 28285 776 9760 | T:ref | clang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE | 20240503 | 20240425 |
73877962 | 7981 0 8208 | 25336 840 9776 | T:ref | gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE | 20240503 | 20240425 |
75222472 | 6345 0 8200 | 22496 840 9744 | T:ref | gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE | 20240503 | 20240425 |
79530340 | 6277 0 8200 | 22336 840 9744 | T:ref | gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE | 20240503 | 20240425 |
Compiler output
ms_x64_ntt.S: ms_x64_ntt.S:18:1: error: unknown directive
ms_x64_ntt.S: .intel_syntax noprefix
ms_x64_ntt.S: ^
ms_x64_ntt.S: ms_x64_ntt.S:37:12: error: invalid alignment value
ms_x64_ntt.S: .align 32
ms_x64_ntt.S: ^
ms_x64_ntt.S: ms_x64_ntt.S:68:3: error: unrecognized instruction mnemonic, did you mean: ushl, ushr?
ms_x64_ntt.S: push r12
ms_x64_ntt.S: ^
ms_x64_ntt.S: ms_x64_ntt.S:69:3: error: unrecognized instruction mnemonic, did you mean: ushl, ushr?
ms_x64_ntt.S: push r13
ms_x64_ntt.S: ^
ms_x64_ntt.S: ms_x64_ntt.S:70:3: error: unrecognized instruction mnemonic, did you mean: ushl, ushr?
ms_x64_ntt.S: push r14
ms_x64_ntt.S: ^
ms_x64_ntt.S: ms_x64_ntt.S:73:7: error: invalid operand for instruction
ms_x64_ntt.S: mov r9, 1
ms_x64_ntt.S: ^
ms_x64_ntt.S: ms_x64_ntt.S:74:7: error: invalid operand for instruction
ms_x64_ntt.S: mov rax, rdx
ms_x64_ntt.S: ^
ms_x64_ntt.S: ms_x64_ntt.S:75:7: error: invalid operand for instruction
ms_x64_ntt.S: mov r12, rdx
ms_x64_ntt.S: ^
ms_x64_ntt.S: ms_x64_ntt.S:76:3: error: unrecognized instruction mnemonic, did you mean: asr, lsr, msr, rshrn, shl, shrn, shrn2, sri, srshr, sshr, str, urshr, ushr?
ms_x64_ntt.S: ...
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
T:avx2 | clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE (4.2.1_Compatible_Clang_7.0.1_(tags/RELEASE_701/final)) |
Compiler output
ms_x64_ntt.S: ms_x64_ntt.S: Assembler messages:
ms_x64_ntt.S: ms_x64_ntt.S:18: Error: unknown pseudo-op: `.intel_syntax'
ms_x64_ntt.S: ms_x64_ntt.S:68: Error: unknown mnemonic `push' -- `push r12'
ms_x64_ntt.S: ms_x64_ntt.S:69: Error: unknown mnemonic `push' -- `push r13'
ms_x64_ntt.S: ms_x64_ntt.S:70: Error: unknown mnemonic `push' -- `push r14'
ms_x64_ntt.S: ms_x64_ntt.S:73: Error: operand 1 must be an integer register -- `mov r9,1'
ms_x64_ntt.S: ms_x64_ntt.S:74: Error: operand 1 must be an integer register -- `mov rax,rdx'
ms_x64_ntt.S: ms_x64_ntt.S:75: Error: operand 1 must be an integer register -- `mov r12,rdx'
ms_x64_ntt.S: ms_x64_ntt.S:76: Error: unknown mnemonic `shr' -- `shr r12,4'
ms_x64_ntt.S: ms_x64_ntt.S:77: Error: unknown mnemonic `vmovdqu' -- `vmovdqu ymm14,[rip+MASK12x8]'
ms_x64_ntt.S: ms_x64_ntt.S:78: Error: unknown mnemonic `vmovdqu' -- `vmovdqu ymm12,[rip+PERM0246]'
ms_x64_ntt.S: ms_x64_ntt.S:79: Error: operand 1 must be an integer register -- `mov r14,16'
ms_x64_ntt.S: ms_x64_ntt.S:80: Error: operand 1 must be an integer register -- `mov rcx,11'
ms_x64_ntt.S: ms_x64_ntt.S:82: Error: unknown mnemonic `shr' -- `shr rax,1'
ms_x64_ntt.S: ms_x64_ntt.S:83: Error: unknown mnemonic `dec' -- `dec rcx'
ms_x64_ntt.S: ms_x64_ntt.S:84: Error: unknown mnemonic `xor' -- `xor rdx,rdx'
ms_x64_ntt.S: ms_x64_ntt.S:86: Error: operand 1 must be an integer register -- `mov r10,rdx'
ms_x64_ntt.S: ms_x64_ntt.S:87: Error: operand 1 must be an integer register -- `mov r11,rax'
ms_x64_ntt.S: ms_x64_ntt.S:88: Error: unknown mnemonic `dec' -- `dec r11'
ms_x64_ntt.S: ms_x64_ntt.S:89: Error: operand 1 must be a SIMD scalar register -- `shl r10,cl'
ms_x64_ntt.S: ms_x64_ntt.S:90: Error: operand 1 must be an integer or stack pointer register -- `add r11,r10'
ms_x64_ntt.S: ms_x64_ntt.S:91: Error: operand 1 must be an integer register -- `mov r13,r9'
ms_x64_ntt.S: ms_x64_ntt.S:92: Error: operand 1 must be an integer or stack pointer register -- `add r13,rdx'
ms_x64_ntt.S: ms_x64_ntt.S:93: Error: unknown mnemonic `vbroadcastss' -- `vbroadcastss ymm11,DWORD PTR[rsi+4*r13]'
ms_x64_ntt.S: ms_x64_ntt.S:96: Error: operand 1 must be an integer register -- `mov r13,r10'
ms_x64_ntt.S: ...
Number of similar (implementation,compiler) pairs: 4, namely:
Implementation | Compiler |
T:avx2 | gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE (8.3.0) |
T:avx2 | gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE (8.3.0) |
T:avx2 | gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE (8.3.0) |
T:avx2 | gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE (8.3.0) |
Namespace violations
hila5_sha3_opt.o hila5_sha3 T
hila5_sha3_opt.o hila5_sha3_final T
hila5_sha3_opt.o hila5_sha3_init T
hila5_sha3_opt.o hila5_sha3_update T
hila5_sha3_opt.o hila5_shake_out T
hila5_sha3_opt.o hila5_shake_xof T
ms_ntt.o mslc_correction T
ms_ntt.o mslc_intt T
ms_ntt.o mslc_ntt T
ms_ntt.o mslc_pmul T
ms_ntt.o mslc_pmuladd T
ms_ntt.o mslc_smul T
ms_ntt.o mslc_two_reduce12289 T
ms_ntt_const.o mslc_inv_rev_ntt1024 R
ms_ntt_const.o mslc_psi_rev_ntt1024 R
Number of similar (implementation,compiler) pairs: 5, namely:
Implementation | Compiler |
T:opt | clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE (4.2.1_Compatible_Clang_7.0.1_(tags/RELEASE_701/final)) |
T:opt | gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE (8.3.0) |
T:opt | gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE (8.3.0) |
T:opt | gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE (8.3.0) |
T:opt | gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE (8.3.0) |
Namespace violations
hila5_sha3_ref.o hila5_sha3 T
hila5_sha3_ref.o hila5_sha3_final T
hila5_sha3_ref.o hila5_sha3_init T
hila5_sha3_ref.o hila5_sha3_keccakf T
hila5_sha3_ref.o hila5_sha3_update T
hila5_sha3_ref.o hila5_shake_out T
hila5_sha3_ref.o hila5_shake_xof T
kem.o slow_rmul T
Number of similar (implementation,compiler) pairs: 5, namely:
Implementation | Compiler |
T:ref | clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE (4.2.1_Compatible_Clang_7.0.1_(tags/RELEASE_701/final)) |
T:ref | gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE (8.3.0) |
T:ref | gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE (8.3.0) |
T:ref | gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE (8.3.0) |
T:ref | gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE (8.3.0) |