Implementation notes: aarch64, rockpi4, crypto_hash/blake2s

Computer: rockpi4
Architecture: aarch64
CPU ID: unknown CPU ID
SUPERCOP version: 20190816
Operation: crypto_hash
Primitive: blake2s
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
1445510224 0 022415 896 760regsgcc_-O3_-fomit-frame-pointer2019091120190816
1445510224 0 021246 888 744regsgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv2019091120190816
1445510224 0 022511 896 760regsgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv2019091120190816
1486810316 0 023684 800 768regsclang_-O3_-fwrapv_-mavx_-fomit-frame-pointer_-Qunused-arguments2019091120190816
1486810316 0 023684 800 768regsclang_-O3_-fwrapv_-mavx_-maes_-mpclmul_-fomit-frame-pointer_-Qunused-arguments2019091120190816
1486810796 0 022975 896 760regsgcc_-fno-schedule-insns_-O3_-fomit-frame-pointer2019091120190816
1486810836 0 021054 872 736regsgcc_-funroll-loops_-Os_-fomit-frame-pointer2019091120190816
1486810796 0 024335 896 760regsgcc_-funroll-loops_-fno-schedule-insns_-O3_-fomit-frame-pointer2019091120190816
1528110224 0 021134 888 744regsgcc_-O2_-fomit-frame-pointer2019091120190816
1528110224 0 023791 896 760regsgcc_-funroll-loops_-O3_-fomit-frame-pointer2019091120190816
156947280 0 019535 896 760refgcc_-O3_-fomit-frame-pointer2019091120190816
1569410324 0 023700 800 768regsclang_-mcpu=native_-mfpu=neon_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments2019091120190816
161077560 0 019903 896 760refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv2019091120190816
1610710316 0 023636 800 768regsclang_-O3_-fomit-frame-pointer_-Qunused-arguments2019091120190816
1610710316 0 023684 800 768regsclang_-O3_-fwrapv_-mavx2_-fomit-frame-pointer_-Qunused-arguments2019091120190816
165207496 0 019727 896 760refgcc_-fno-schedule-insns_-O3_-fomit-frame-pointer2019091120190816
169338096 0 021412 800 768refclang_-O3_-fomit-frame-pointer_-Qunused-arguments2019091120190816
173468096 0 021460 800 768refclang_-O3_-fwrapv_-mavx_-fomit-frame-pointer_-Qunused-arguments2019091120190816
173466580 0 017615 896 744refgcc_-O2_-fomit-frame-pointer2019091120190816
1734610224 0 023214 888 744regsgcc_-funroll-loops_-O2_-fomit-frame-pointer2019091120190816
1775910832 0 020990 872 736regsgcc_-Os_-fomit-frame-pointer2019091120190816
1775910796 0 021654 888 744regsgcc_-fno-schedule-insns_-O2_-fomit-frame-pointer2019091120190816
1775910832 0 020990 872 736regsgcc_-fno-schedule-insns_-Os_-fomit-frame-pointer2019091120190816
1775910796 0 023878 888 744regsgcc_-funroll-loops_-fno-schedule-insns_-O2_-fomit-frame-pointer2019091120190816
181726636 0 016911 880 736refgcc_-Os_-fomit-frame-pointer2019091120190816
1817210836 0 021054 872 736regsgcc_-funroll-loops_-fno-schedule-insns_-Os_-fomit-frame-pointer2019091120190816
1817210832 0 020990 872 736regsgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv2019091120190816
185856636 0 016911 880 736refgcc_-fno-schedule-insns_-Os_-fomit-frame-pointer2019091120190816
198246764 0 019887 896 744refgcc_-funroll-loops_-O2_-fomit-frame-pointer2019091120190816
202377368 0 020991 896 760refgcc_-funroll-loops_-O3_-fomit-frame-pointer2019091120190816
210636596 0 017743 896 744refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv2019091120190816
214768096 0 021460 800 768refclang_-O3_-fwrapv_-mavx2_-fomit-frame-pointer_-Qunused-arguments2019091120190816
218898096 0 021460 800 768refclang_-O3_-fwrapv_-mavx_-maes_-mpclmul_-fomit-frame-pointer_-Qunused-arguments2019091120190816
227156756 0 017735 896 744refgcc_-fno-schedule-insns_-O2_-fomit-frame-pointer2019091120190816
239546636 0 016911 880 736refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv2019091120190816
243677108 0 020303 896 744refgcc_-funroll-loops_-fno-schedule-insns_-O2_-fomit-frame-pointer2019091120190816
256069276 0 022644 800 768refclang_-mcpu=native_-mfpu=neon_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments2019091120190816
268457584 0 021183 896 760refgcc_-funroll-loops_-fno-schedule-insns_-O3_-fomit-frame-pointer2019091120190816
305626636 0 016975 880 736refgcc_-funroll-loops_-Os_-fomit-frame-pointer2019091120190816
305626636 0 016975 880 736refgcc_-funroll-loops_-fno-schedule-insns_-Os_-fomit-frame-pointer2019091120190816
330407920 0 019039 896 744refgcc_-O_-fomit-frame-pointer2019091120190816
330407920 0 019039 896 744refgcc_-fno-schedule-insns_-O_-fomit-frame-pointer2019091120190816
355188876 0 023223 896 744refgcc_-funroll-loops_-fno-schedule-insns_-O_-fomit-frame-pointer2019091120190816
4419113652 0 024654 888 744regsgcc_-fno-schedule-insns_-O_-fomit-frame-pointer2019091120190816
4419113644 0 027862 888 744regsgcc_-funroll-loops_-fno-schedule-insns_-O_-fomit-frame-pointer2019091120190816
4419113652 0 024654 888 744regsgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv2019091120190816
524518876 0 023223 896 744refgcc_-funroll-loops_-O_-fomit-frame-pointer2019091120190816
545167912 0 019039 896 744refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv2019091120190816
5616813652 0 024654 888 744regsgcc_-O_-fomit-frame-pointer2019091120190816
5616813644 0 027862 888 744regsgcc_-funroll-loops_-O_-fomit-frame-pointer2019091120190816
8796927376 0 040558 872 744regsgcc2019091120190816
8796927376 0 040558 872 744regsgcc_-funroll-loops2019091120190816
9209927376 0 040558 872 744regscc2019091120190816
17056920280 0 037743 880 744refgcc_-funroll-loops2019091120190816
17098220280 0 037743 880 744refgcc2019091120190816
22549820280 0 037743 880 744refcc2019091120190816

Compiler output

Implementation: avxicc
Security model: unknown
Compiler: cc
blake2s.s: blake2s.s: Assembler messages:
blake2s.s: blake2s.s:13: Error: unknown mnemonic `xorl' -- `xorl %r11d,%r11d'
blake2s.s: blake2s.s:14: Error: unknown mnemonic `xorl' -- `xorl %ecx,%ecx'
blake2s.s: blake2s.s:15: Error: unknown mnemonic `movq' -- `movq %rsi,%r9'
blake2s.s: blake2s.s:16: Error: unknown mnemonic `vmovdqu' -- `vmovdqu .L_2il0floatpacket.10(%rip),%xmm4'
blake2s.s: blake2s.s:17: Error: unknown mnemonic `xorl' -- `xorl %eax,%eax'
blake2s.s: blake2s.s:18: Error: unknown mnemonic `vmovdqu' -- `vmovdqu .L_2il0floatpacket.7(%rip),%xmm0'
blake2s.s: blake2s.s:19: Error: unknown mnemonic `vmovdqa' -- `vmovdqa %xmm4,%xmm2'
blake2s.s: blake2s.s:20: Error: unknown mnemonic `vmovdqu' -- `vmovdqu .L_2il0floatpacket.8(%rip),%xmm1'
blake2s.s: blake2s.s:21: Error: unknown mnemonic `movq' -- `movq %rdx,%r8'
blake2s.s: blake2s.s:22: Error: unknown mnemonic `vmovdqu' -- `vmovdqu .L_2il0floatpacket.9(%rip),%xmm3'
blake2s.s: blake2s.s:23: Error: unknown mnemonic `xorl' -- `xorl %r10d,%r10d'
blake2s.s: blake2s.s:24: Error: unknown mnemonic `vmovdqu' -- `vmovdqu .L_2il0floatpacket.11(%rip),%xmm5'
blake2s.s: blake2s.s:25: Error: unknown mnemonic `cmpq' -- `cmpq $64,%rdx'
blake2s.s: blake2s.s:26: Error: unknown mnemonic `jbe' -- `jbe ..B1.5'
blake2s.s: blake2s.s:28: Error: unknown mnemonic `movq' -- `movq %rbp,-104(%rsp)'
blake2s.s: blake2s.s:29: Error: unknown mnemonic `lea' -- `lea -1(%rdx),%rsi'
blake2s.s: blake2s.s:30: Error: unknown mnemonic `sarq' -- `sarq $5,%rsi'
blake2s.s: blake2s.s:32: Error: unknown mnemonic `movq' -- `movq %rcx,%rbp'
blake2s.s: blake2s.s:33: Error: unknown mnemonic `shrq' -- `shrq $58,%rsi'
blake2s.s: blake2s.s:34: Error: unknown mnemonic `movq' -- `movq %rax,%rcx'
blake2s.s: blake2s.s:35: Error: unknown mnemonic `lea' -- `lea -1(%rsi,%rdx),%rdx'
blake2s.s: blake2s.s:36: Error: unknown mnemonic `sarq' -- `sarq $6,%rdx'
blake2s.s: blake2s.s:37: Error: unknown mnemonic `movq' -- `movq %rdx,%rax'
blake2s.s: blake2s.s:39: Error: unknown mnemonic `vmovdqu' -- `vmovdqu (%rcx,%r9),%xmm9'
blake2s.s: ...

Number of similar (compiler,implementation) pairs: 23, namely:
CompilerImplementations
cc avxicc
gcc avxicc
gcc -O2 -fomit-frame-pointer avxicc
gcc -O3 -fomit-frame-pointer avxicc
gcc -O -fomit-frame-pointer avxicc
gcc -Os -fomit-frame-pointer avxicc
gcc -fno-schedule-insns -O2 -fomit-frame-pointer avxicc
gcc -fno-schedule-insns -O3 -fomit-frame-pointer avxicc
gcc -fno-schedule-insns -O -fomit-frame-pointer avxicc
gcc -fno-schedule-insns -Os -fomit-frame-pointer avxicc
gcc -funroll-loops avxicc
gcc -funroll-loops -O2 -fomit-frame-pointer avxicc
gcc -funroll-loops -O3 -fomit-frame-pointer avxicc
gcc -funroll-loops -O -fomit-frame-pointer avxicc
gcc -funroll-loops -Os -fomit-frame-pointer avxicc
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer avxicc
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer avxicc
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer avxicc
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer avxicc
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv avxicc
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv avxicc
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv avxicc
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv avxicc

Compiler output

Implementation: avxicc
Security model: unknown
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
blake2s.s: blake2s.s:13:19: error: unknown token in expression
blake2s.s: xorl %r11d, %r11d
blake2s.s: ^
blake2s.s: blake2s.s:13:19: error: invalid operand
blake2s.s: xorl %r11d, %r11d
blake2s.s: ^
blake2s.s: blake2s.s:14:19: error: unknown token in expression
blake2s.s: xorl %ecx, %ecx
blake2s.s: ^
blake2s.s: blake2s.s:14:19: error: invalid operand
blake2s.s: xorl %ecx, %ecx
blake2s.s: ^
blake2s.s: blake2s.s:15:19: error: unknown token in expression
blake2s.s: movq %rsi, %r9
blake2s.s: ^
blake2s.s: blake2s.s:15:19: error: invalid operand
blake2s.s: movq %rsi, %r9
blake2s.s: ^
blake2s.s: blake2s.s:16:40: error: unexpected token in argument list
blake2s.s: vmovdqu .L_2il0floatpacket.10(%rip), %xmm4
blake2s.s: ^
blake2s.s: blake2s.s:17:19: error: unknown token in expression
blake2s.s: xorl %eax, %eax
blake2s.s: ^
blake2s.s: blake2s.s:17:19: error: invalid operand
blake2s.s: ...

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments avxicc
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments avxicc
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments avxicc
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments avxicc
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments avxicc

Compiler output

Implementation: regs
Security model: unknown
Compiler: cc
blake2s.c: blake2s.c: In function 'crypto_hash_blake2s_regs':
blake2s.c: blake2s.c:796:11: warning: conversion from 'long unsigned int' to 'uint32_t' {aka 'unsigned int'} changes value from '18446744073180816980' to '3766232660' [-Woverflow]
blake2s.c: v14 = ~0x1F83D9ABUL;
blake2s.c: ^

Number of similar (compiler,implementation) pairs: 23, namely:
CompilerImplementations
cc regs
gcc regs
gcc -O2 -fomit-frame-pointer regs
gcc -O3 -fomit-frame-pointer regs
gcc -O -fomit-frame-pointer regs
gcc -Os -fomit-frame-pointer regs
gcc -fno-schedule-insns -O2 -fomit-frame-pointer regs
gcc -fno-schedule-insns -O3 -fomit-frame-pointer regs
gcc -fno-schedule-insns -O -fomit-frame-pointer regs
gcc -fno-schedule-insns -Os -fomit-frame-pointer regs
gcc -funroll-loops regs
gcc -funroll-loops -O2 -fomit-frame-pointer regs
gcc -funroll-loops -O3 -fomit-frame-pointer regs
gcc -funroll-loops -O -fomit-frame-pointer regs
gcc -funroll-loops -Os -fomit-frame-pointer regs
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer regs
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer regs
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer regs
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer regs
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv regs
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv regs
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv regs
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv regs

Compiler output

Implementation: regs
Security model: unknown
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
blake2s.c: blake2s.c:796:11: warning: implicit conversion from 'unsigned long' to 'uint32_t' (aka 'unsigned int') changes value from 18446744073180816980 to 3766232660 [-Wconstant-conversion]
blake2s.c: v14 = ~0x1F83D9ABUL;
blake2s.c: ~ ^~~~~~~~~~~~~
blake2s.c: 1 warning generated.

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments regs
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments regs
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments regs
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments regs
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments regs

Compiler output

Implementation: xmm
Security model: unknown
Compiler: cc
blake2s.c: In file included from blake2s.c:21:
blake2s.c: blake2-config.h:68:2: error: #error "This code requires at least SSE2."
blake2s.c: #error "This code requires at least SSE2."
blake2s.c: ^~~~~
blake2s.c: blake2s.c:24:10: fatal error: emmintrin.h: No such file or directory
blake2s.c: #include <emmintrin.h>
blake2s.c: ^~~~~~~~~~~~~
blake2s.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 23, namely:
CompilerImplementations
cc xmm
gcc xmm
gcc -O2 -fomit-frame-pointer xmm
gcc -O3 -fomit-frame-pointer xmm
gcc -O -fomit-frame-pointer xmm
gcc -Os -fomit-frame-pointer xmm
gcc -fno-schedule-insns -O2 -fomit-frame-pointer xmm
gcc -fno-schedule-insns -O3 -fomit-frame-pointer xmm
gcc -fno-schedule-insns -O -fomit-frame-pointer xmm
gcc -fno-schedule-insns -Os -fomit-frame-pointer xmm
gcc -funroll-loops xmm
gcc -funroll-loops -O2 -fomit-frame-pointer xmm
gcc -funroll-loops -O3 -fomit-frame-pointer xmm
gcc -funroll-loops -O -fomit-frame-pointer xmm
gcc -funroll-loops -Os -fomit-frame-pointer xmm
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer xmm
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer xmm
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer xmm
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer xmm
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv xmm
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv xmm
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv xmm
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv xmm

Compiler output

Implementation: xmm
Security model: unknown
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
blake2s.c: In file included from blake2s.c:21:
blake2s.c: ./blake2-config.h:68:2: error: "This code requires at least SSE2."
blake2s.c: #error "This code requires at least SSE2."
blake2s.c: ^
blake2s.c: In file included from blake2s.c:24:
blake2s.c: In file included from /usr/lib/llvm-7/lib/clang/7.0.1/include/emmintrin.h:27:
blake2s.c: In file included from /usr/lib/llvm-7/lib/clang/7.0.1/include/xmmintrin.h:27:
blake2s.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:64:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
blake2s.c: return (__m64)__builtin_ia32_vec_init_v2si(__i, 0);
blake2s.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blake2s.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:143:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
blake2s.c: return (__m64)__builtin_ia32_packsswb((__v4hi)__m1, (__v4hi)__m2);
blake2s.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blake2s.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:173:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
blake2s.c: return (__m64)__builtin_ia32_packssdw((__v2si)__m1, (__v2si)__m2);
blake2s.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blake2s.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:203:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
blake2s.c: return (__m64)__builtin_ia32_packuswb((__v4hi)__m1, (__v4hi)__m2);
blake2s.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blake2s.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:230:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
blake2s.c: return (__m64)__builtin_ia32_punpckhbw((__v8qi)__m1, (__v8qi)__m2);
blake2s.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blake2s.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:253:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
blake2s.c: return (__m64)__builtin_ia32_punpckhwd((__v4hi)__m1, (__v4hi)__m2);
blake2s.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blake2s.c: ...

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments xmm
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments xmm
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments xmm
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments xmm
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments xmm