Implementation notes: armeabi, c1, crypto_stream/chacha8

Computer: c1
Architecture: armeabi
CPU ID: unknown CPU ID
SUPERCOP version: 20161026
Operation: crypto_stream
Primitive: chacha8
TimeImplementationCompilerBenchmark dateSUPERCOP version
10941moon/neon/32gcc -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
10945moon/neon/32gcc -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
10955moon/neon/32gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
10956moon/neon/32gcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
11005moon/neon/32gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
12939dolbeau/arm-neongcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
12992dolbeau/arm-neongcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
13507moon/neon/32gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
13813dolbeau/arm-neongcc -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
13930dolbeau/arm-neongcc -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
13985dolbeau/arm-neongcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
14781moon/armv6/32gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
14790moon/armv6/32gcc -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
14790moon/armv6/32gcc -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
14805moon/armv6/32gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
14816moon/armv6/32gcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
14855dolbeau/arm-neongcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
16674moon/armv6/32gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
19225e/mergedgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
20007e/mergedgcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
21418e/mergedgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
21419e/mergedgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
21705e/mergedgcc -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
21720e/mergedgcc -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
22821dolbeau/mipsel-msagcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
22941e/refgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
22941e/regsgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
23113dolbeau/mipsel-msagcc -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
23227e/refgcc -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
23370e/regsgcc -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
27536e/refgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
28197dolbeau/mipsel-msagcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
31670e/regsgcc -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
32336e/regsgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
35545e/regsgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
36966dolbeau/mipsel-msagcc -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
37505e/refgcc -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
42762e/regsgcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
43065e/refgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
43085dolbeau/mipsel-msagcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
48450e/refgcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
52270dolbeau/mipsel-msagcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026

Test failure

Implementation: crypto_stream/chacha8/krovetz/vec128
Compiler: gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2
error 111
crypto_stream_xor does not handle n=c overlap

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 krovetz/vec128
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 krovetz/vec128
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os krovetz/vec128
gcc -mcpu=native -mfpu=neon-vfpv4 -O2 krovetz/vec128
gcc -mcpu=native -mfpu=neon-vfpv4 -O3 krovetz/vec128
gcc -mcpu=native -mfpu=neon-vfpv4 -Os krovetz/vec128

Compiler output

Implementation: crypto_stream/chacha8/dolbeau/ppc-altivec
Compiler: gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2
chacha.c: chacha.c:11:21: fatal error: altivec.h: No such file or directory
chacha.c: #include gt;
chacha.c: ^
chacha.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 dolbeau/ppc-altivec
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 dolbeau/ppc-altivec
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os dolbeau/ppc-altivec
gcc -mcpu=native -mfpu=neon-vfpv4 -O2 dolbeau/ppc-altivec
gcc -mcpu=native -mfpu=neon-vfpv4 -O3 dolbeau/ppc-altivec
gcc -mcpu=native -mfpu=neon-vfpv4 -Os dolbeau/ppc-altivec

Compiler output

Implementation: crypto_stream/chacha8/amd64-ssse3
Compiler: gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2
chacha.s: chacha.s: Assembler messages:
chacha.s: chacha.s:22: Error: ARM register expected -- `mov %rsp,%r11'
chacha.s: chacha.s:23: Error: ARM register expected -- `and $31,%r11'
chacha.s: chacha.s:24: Error: ARM register expected -- `add $384,%r11'
chacha.s: chacha.s:25: Error: immediate expression requires a # prefix -- `sub %r11,%rsp'
chacha.s: chacha.s:26: Error: ARM register expected -- `mov %rdi,%r8'
chacha.s: chacha.s:27: Error: ARM register expected -- `mov %rsi,%rsi'
chacha.s: chacha.s:28: Error: ARM register expected -- `mov %rsi,%rdi'
chacha.s: chacha.s:29: Error: ARM register expected -- `mov %rdx,%rdx'
chacha.s: chacha.s:30: Error: ARM register expected -- `cmp $0,%rdx'
chacha.s: chacha.s:32: Error: bad instruction `jbe ._done'
chacha.s: chacha.s:34: Error: ARM register expected -- `mov $0,%rax'
chacha.s: chacha.s:36: Error: ARM register expected -- `mov %rdx,%rcx'
chacha.s: chacha.s:38: Error: bad instruction `rep stosb'
chacha.s: chacha.s:40: Error: ARM register expected -- `sub %rdx,%rdi'
chacha.s: chacha.s:42: Error: bad instruction `jmp ._start'
chacha.s: chacha.s:50: Error: ARM register expected -- `mov %rsp,%r11'
chacha.s: chacha.s:51: Error: ARM register expected -- `and $31,%r11'
chacha.s: chacha.s:52: Error: ARM register expected -- `add $384,%r11'
chacha.s: chacha.s:53: Error: immediate expression requires a # prefix -- `sub %r11,%rsp'
chacha.s: chacha.s:55: Error: ARM register expected -- `mov %rdi,%r8'
chacha.s: chacha.s:57: Error: ARM register expected -- `mov %rsi,%rsi'
chacha.s: chacha.s:59: Error: ARM register expected -- `mov %rdx,%rdi'
chacha.s: chacha.s:61: Error: ARM register expected -- `mov %rcx,%rdx'
chacha.s: chacha.s:63: Error: ARM register expected -- `cmp $0,%rdx'
chacha.s: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 amd64-ssse3
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 amd64-ssse3
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os amd64-ssse3
gcc -mcpu=native -mfpu=neon-vfpv4 -O2 amd64-ssse3
gcc -mcpu=native -mfpu=neon-vfpv4 -O3 amd64-ssse3
gcc -mcpu=native -mfpu=neon-vfpv4 -Os amd64-ssse3

Compiler output

Implementation: crypto_stream/chacha8/goll_gueron
Compiler: gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2
stream.c: stream.c:11:23: fatal error: immintrin.h: No such file or directory
stream.c: #include gt;
stream.c: ^
stream.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 goll_gueron
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 goll_gueron
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os goll_gueron
gcc -mcpu=native -mfpu=neon-vfpv4 -O2 goll_gueron
gcc -mcpu=native -mfpu=neon-vfpv4 -O3 goll_gueron
gcc -mcpu=native -mfpu=neon-vfpv4 -Os goll_gueron

Compiler output

Implementation: crypto_stream/chacha8/krovetz/avx2
Compiler: gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2
stream.c: stream.c:8:23: fatal error: immintrin.h: No such file or directory
stream.c: #include gt;
stream.c: ^
stream.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 krovetz/avx2
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 krovetz/avx2
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os krovetz/avx2
gcc -mcpu=native -mfpu=neon-vfpv4 -O2 krovetz/avx2
gcc -mcpu=native -mfpu=neon-vfpv4 -O3 krovetz/avx2
gcc -mcpu=native -mfpu=neon-vfpv4 -Os krovetz/avx2