Implementation notes: aarch64, par3, crypto_stream/chacha20

Computer: par3
Architecture: aarch64
CPU ID: unknown CPU ID
SUPERCOP version: 20170718
Operation: crypto_stream
Primitive: chacha20
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
8576? ? ?? ? ?dolbeau/arm-neongcc_-funroll-loops_-march=native_-mtune=native_-O22017071920170718
8587? ? ?? ? ?dolbeau/arm-neongcc_-funroll-loops_-march=native_-mtune=native_-O32017071920170718
8652? ? ?? ? ?dolbeau/arm-neongcc_-march=native_-mtune=native_-O22017071920170718
8782? ? ?? ? ?dolbeau/arm-neongcc_-march=native_-mtune=native_-O32017071920170718
9081? ? ?? ? ?dolbeau/arm-neongcc_-funroll-loops_-march=native_-mtune=native_-Os2017071920170718
9286? ? ?? ? ?dolbeau/arm-neongcc_-march=native_-mtune=native_-Os2017071920170718
15011? ? ?? ? ?e/mergedgcc_-funroll-loops_-march=native_-mtune=native_-O32017071920170718
15291? ? ?? ? ?dolbeau/mipsel-msagcc_-march=native_-mtune=native_-O32017071920170718
15310? ? ?? ? ?e/regsgcc_-funroll-loops_-march=native_-mtune=native_-O32017071920170718
15311? ? ?? ? ?e/regsgcc_-march=native_-mtune=native_-O32017071920170718
15319? ? ?? ? ?e/mergedgcc_-march=native_-mtune=native_-O32017071920170718
15329? ? ?? ? ?e/refgcc_-march=native_-mtune=native_-O32017071920170718
15337? ? ?? ? ?e/mergedgcc_-march=native_-mtune=native_-O22017071920170718
15420? ? ?? ? ?e/mergedgcc_-funroll-loops_-march=native_-mtune=native_-O22017071920170718
15526? ? ?? ? ?e/refgcc_-funroll-loops_-march=native_-mtune=native_-O32017071920170718
15813? ? ?? ? ?dolbeau/mipsel-msagcc_-funroll-loops_-march=native_-mtune=native_-O32017071920170718
15880? ? ?? ? ?e/mergedgcc_-funroll-loops_-march=native_-mtune=native_-Os2017071920170718
15892? ? ?? ? ?e/mergedgcc_-march=native_-mtune=native_-Os2017071920170718
20490? ? ?? ? ?dolbeau/mipsel-msagcc_-funroll-loops_-march=native_-mtune=native_-O22017071920170718
21068? ? ?? ? ?e/regsgcc_-funroll-loops_-march=native_-mtune=native_-O22017071920170718
21234? ? ?? ? ?e/refgcc_-funroll-loops_-march=native_-mtune=native_-O22017071920170718
25590? ? ?? ? ?e/regsgcc_-march=native_-mtune=native_-O22017071920170718
28227? ? ?? ? ?e/regsgcc_-funroll-loops_-march=native_-mtune=native_-Os2017071920170718
29631? ? ?? ? ?e/refgcc_-march=native_-mtune=native_-O22017071920170718
29894? ? ?? ? ?dolbeau/mipsel-msagcc_-march=native_-mtune=native_-O22017071920170718
31180? ? ?? ? ?e/regsgcc_-march=native_-mtune=native_-Os2017071920170718
31253? ? ?? ? ?e/refgcc_-funroll-loops_-march=native_-mtune=native_-Os2017071920170718
31255? ? ?? ? ?dolbeau/mipsel-msagcc_-funroll-loops_-march=native_-mtune=native_-Os2017071920170718
31255? ? ?? ? ?dolbeau/mipsel-msagcc_-march=native_-mtune=native_-Os2017071920170718
31276? ? ?? ? ?e/refgcc_-march=native_-mtune=native_-Os2017071920170718

Compiler output

Implementation: crypto_stream/chacha20/dolbeau/ppc-altivec
Compiler: gcc -funroll-loops -march=native -mtune=native -O2
chacha.c: chacha.c:11:10: fatal error: altivec.h: No such file or directory
chacha.c: #include <altivec.h>
chacha.c: ^~~~~~~~~~~
chacha.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mtune=native -O2 dolbeau/ppc-altivec
gcc -funroll-loops -march=native -mtune=native -O3 dolbeau/ppc-altivec
gcc -funroll-loops -march=native -mtune=native -Os dolbeau/ppc-altivec
gcc -march=native -mtune=native -O2 dolbeau/ppc-altivec
gcc -march=native -mtune=native -O3 dolbeau/ppc-altivec
gcc -march=native -mtune=native -Os dolbeau/ppc-altivec

Compiler output

Implementation: crypto_stream/chacha20/amd64-ssse3
Compiler: gcc -funroll-loops -march=native -mtune=native -O2
chacha.s: chacha.s: Assembler messages:
chacha.s: chacha.s:22: Error: operand 1 must be an integer register -- `mov %rsp,%r11'
chacha.s: chacha.s:23: Error: operand 1 must be an integer or stack pointer register -- `and $31,%r11'
chacha.s: chacha.s:24: Error: operand 1 must be an integer or stack pointer register -- `add $384,%r11'
chacha.s: chacha.s:25: Error: operand 1 must be an integer or stack pointer register -- `sub %r11,%rsp'
chacha.s: chacha.s:26: Error: operand 1 must be an integer register -- `mov %rdi,%r8'
chacha.s: chacha.s:27: Error: operand 1 must be an integer register -- `mov %rsi,%rsi'
chacha.s: chacha.s:28: Error: operand 1 must be an integer register -- `mov %rsi,%rdi'
chacha.s: chacha.s:29: Error: operand 1 must be an integer register -- `mov %rdx,%rdx'
chacha.s: chacha.s:30: Error: operand 1 must be an integer or stack pointer register -- `cmp $0,%rdx'
chacha.s: chacha.s:32: Error: unknown mnemonic `jbe' -- `jbe ._done'
chacha.s: chacha.s:34: Error: operand 1 must be an integer register -- `mov $0,%rax'
chacha.s: chacha.s:36: Error: operand 1 must be an integer register -- `mov %rdx,%rcx'
chacha.s: chacha.s:38: Error: unknown mnemonic `rep' -- `rep stosb'
chacha.s: chacha.s:40: Error: operand 1 must be an integer or stack pointer register -- `sub %rdx,%rdi'
chacha.s: chacha.s:42: Error: unknown mnemonic `jmp' -- `jmp ._start'
chacha.s: chacha.s:50: Error: operand 1 must be an integer register -- `mov %rsp,%r11'
chacha.s: chacha.s:51: Error: operand 1 must be an integer or stack pointer register -- `and $31,%r11'
chacha.s: chacha.s:52: Error: operand 1 must be an integer or stack pointer register -- `add $384,%r11'
chacha.s: chacha.s:53: Error: operand 1 must be an integer or stack pointer register -- `sub %r11,%rsp'
chacha.s: chacha.s:55: Error: operand 1 must be an integer register -- `mov %rdi,%r8'
chacha.s: chacha.s:57: Error: operand 1 must be an integer register -- `mov %rsi,%rsi'
chacha.s: chacha.s:59: Error: operand 1 must be an integer register -- `mov %rdx,%rdi'
chacha.s: chacha.s:61: Error: operand 1 must be an integer register -- `mov %rcx,%rdx'
chacha.s: chacha.s:63: Error: operand 1 must be an integer or stack pointer register -- `cmp $0,%rdx'
chacha.s: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mtune=native -O2 amd64-ssse3
gcc -funroll-loops -march=native -mtune=native -O3 amd64-ssse3
gcc -funroll-loops -march=native -mtune=native -Os amd64-ssse3
gcc -march=native -mtune=native -O2 amd64-ssse3
gcc -march=native -mtune=native -O3 amd64-ssse3
gcc -march=native -mtune=native -Os amd64-ssse3

Compiler output

Implementation: crypto_stream/chacha20/goll_gueron
Compiler: gcc -funroll-loops -march=native -mtune=native -O2
stream.c: stream.c:11:10: fatal error: immintrin.h: No such file or directory
stream.c: #include <immintrin.h>
stream.c: ^~~~~~~~~~~~~
stream.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mtune=native -O2 goll_gueron
gcc -funroll-loops -march=native -mtune=native -O3 goll_gueron
gcc -funroll-loops -march=native -mtune=native -Os goll_gueron
gcc -march=native -mtune=native -O2 goll_gueron
gcc -march=native -mtune=native -O3 goll_gueron
gcc -march=native -mtune=native -Os goll_gueron

Compiler output

Implementation: crypto_stream/chacha20/krovetz/vec128
Compiler: gcc -funroll-loops -march=native -mtune=native -O2
stream.c: stream.c:80:2: error: #error -- Implementation supports only machines with neon, altivec or SSE2
stream.c: #error -- Implementation supports only machines with neon, altivec or SSE2
stream.c: ^~~~~
stream.c: stream.c: In function 'crypto_stream_chacha20_krovetz_vec128_xor':
stream.c: stream.c:151:14: warning: implicit declaration of function 'NONCE' [-Wimplicit-function-declaration]
stream.c: vec s3 = NONCE(np);
stream.c: ^~~~~
stream.c: stream.c:151:14: error: incompatible types when initializing type 'vec {aka __vector(4) unsigned int}' using type 'int'
stream.c: stream.c:91:19: error: 'VBPI' undeclared (first use in this function); did you mean 'BPI'?
stream.c: #define BPI (VBPI + GPR_TOO) /* Blocks computed per loop iteration */
stream.c: ^
stream.c: stream.c:152:36: note: in expansion of macro 'BPI'
stream.c: for (iters = 0; iters < inlen/(BPI*64); iters++) {
stream.c: ^~~
stream.c: stream.c:91:19: note: each undeclared identifier is reported only once for each function it appears in
stream.c: #define BPI (VBPI + GPR_TOO) /* Blocks computed per loop iteration */
stream.c: ^
stream.c: stream.c:152:36: note: in expansion of macro 'BPI'
stream.c: for (iters = 0; iters < inlen/(BPI*64); iters++) {
stream.c: ^~~
stream.c: stream.c:91:26: error: 'GPR_TOO' undeclared (first use in this function)
stream.c: #define BPI (VBPI + GPR_TOO) /* Blocks computed per loop iteration */
stream.c: ^
stream.c: stream.c:152:36: note: in expansion of macro 'BPI'
stream.c: for (iters = 0; iters < inlen/(BPI*64); iters++) {
stream.c: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mtune=native -O2 krovetz/vec128
gcc -funroll-loops -march=native -mtune=native -O3 krovetz/vec128
gcc -funroll-loops -march=native -mtune=native -Os krovetz/vec128
gcc -march=native -mtune=native -O2 krovetz/vec128
gcc -march=native -mtune=native -O3 krovetz/vec128
gcc -march=native -mtune=native -Os krovetz/vec128

Compiler output

Implementation: crypto_stream/chacha20/krovetz/avx2
Compiler: gcc -funroll-loops -march=native -mtune=native -O2
stream.c: stream.c:8:10: fatal error: immintrin.h: No such file or directory
stream.c: #include <immintrin.h>
stream.c: ^~~~~~~~~~~~~
stream.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mtune=native -O2 krovetz/avx2
gcc -funroll-loops -march=native -mtune=native -O3 krovetz/avx2
gcc -funroll-loops -march=native -mtune=native -Os krovetz/avx2
gcc -march=native -mtune=native -O2 krovetz/avx2
gcc -march=native -mtune=native -O3 krovetz/avx2
gcc -march=native -mtune=native -Os krovetz/avx2