Implementation notes: armeabi, cubie2, crypto_stream/chacha12

Computer: cubie2
Architecture: armeabi
CPU ID: unknown CPU ID
SUPERCOP version: 20161026
Operation: crypto_stream
Primitive: chacha12
TimeImplementationCompilerBenchmark dateSUPERCOP version
13358dolbeau/arm-neongcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121620161026
13361dolbeau/arm-neongcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121620161026
13514dolbeau/arm-neongcc -mcpu=native -mfpu=neon-vfpv4 -O22016121620161026
13533dolbeau/arm-neongcc -mcpu=native -mfpu=neon-vfpv4 -O32016121620161026
13935moon/neon/32gcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121620161026
13937moon/neon/32gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121620161026
13938moon/neon/32gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121620161026
13938moon/neon/32gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121620161026
13971moon/neon/32gcc -mcpu=native -mfpu=neon-vfpv4 -O32016121620161026
14139moon/neon/32gcc -mcpu=native -mfpu=neon-vfpv4 -O22016121620161026
16080dolbeau/arm-neongcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121620161026
17101dolbeau/arm-neongcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121620161026
18897moon/armv6/32gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121620161026
18900moon/armv6/32gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121620161026
18909moon/armv6/32gcc -mcpu=native -mfpu=neon-vfpv4 -O22016121620161026
18909moon/armv6/32gcc -mcpu=native -mfpu=neon-vfpv4 -O32016121620161026
18909moon/armv6/32gcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121620161026
18914moon/armv6/32gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121620161026
24006e/mergedgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121620161026
24771e/mergedgcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121620161026
25761e/mergedgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121620161026
25782e/mergedgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121620161026
25941e/mergedgcc -mcpu=native -mfpu=neon-vfpv4 -O22016121620161026
25965e/mergedgcc -mcpu=native -mfpu=neon-vfpv4 -O32016121620161026
27768e/regsgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121620161026
27783dolbeau/mipsel-msagcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121620161026
27792e/refgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121620161026
27897dolbeau/mipsel-msagcc -mcpu=native -mfpu=neon-vfpv4 -O32016121620161026
27996e/refgcc -mcpu=native -mfpu=neon-vfpv4 -O32016121620161026
28068e/regsgcc -mcpu=native -mfpu=neon-vfpv4 -O32016121620161026
32652e/refgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121620161026
33285dolbeau/mipsel-msagcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121620161026
33840e/regsgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121620161026
36438e/regsgcc -mcpu=native -mfpu=neon-vfpv4 -O22016121620161026
37524e/regsgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121620161026
39537dolbeau/mipsel-msagcc -mcpu=native -mfpu=neon-vfpv4 -O22016121620161026
39645e/refgcc -mcpu=native -mfpu=neon-vfpv4 -O22016121620161026
42588e/regsgcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121620161026
43401dolbeau/mipsel-msagcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121620161026
43401e/refgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121620161026
49188dolbeau/mipsel-msagcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121620161026
49209e/refgcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121620161026

Test failure

Implementation: crypto_stream/chacha12/krovetz/vec128
Compiler: gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2
error 111
crypto_stream_xor does not handle n=c overlap

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 krovetz/vec128
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 krovetz/vec128
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os krovetz/vec128
gcc -mcpu=native -mfpu=neon-vfpv4 -O2 krovetz/vec128
gcc -mcpu=native -mfpu=neon-vfpv4 -O3 krovetz/vec128
gcc -mcpu=native -mfpu=neon-vfpv4 -Os krovetz/vec128

Compiler output

Implementation: crypto_stream/chacha12/dolbeau/ppc-altivec
Compiler: gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2
chacha.c: chacha.c:11:21: fatal error: altivec.h: No such file or directory
chacha.c: #include gt;
chacha.c: ^
chacha.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 dolbeau/ppc-altivec
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 dolbeau/ppc-altivec
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os dolbeau/ppc-altivec
gcc -mcpu=native -mfpu=neon-vfpv4 -O2 dolbeau/ppc-altivec
gcc -mcpu=native -mfpu=neon-vfpv4 -O3 dolbeau/ppc-altivec
gcc -mcpu=native -mfpu=neon-vfpv4 -Os dolbeau/ppc-altivec

Compiler output

Implementation: crypto_stream/chacha12/amd64-ssse3
Compiler: gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2
chacha.s: chacha.s: Assembler messages:
chacha.s: chacha.s:22: Error: ARM register expected -- `mov %rsp,%r11'
chacha.s: chacha.s:23: Error: ARM register expected -- `and $31,%r11'
chacha.s: chacha.s:24: Error: ARM register expected -- `add $384,%r11'
chacha.s: chacha.s:25: Error: immediate expression requires a # prefix -- `sub %r11,%rsp'
chacha.s: chacha.s:26: Error: ARM register expected -- `mov %rdi,%r8'
chacha.s: chacha.s:27: Error: ARM register expected -- `mov %rsi,%rsi'
chacha.s: chacha.s:28: Error: ARM register expected -- `mov %rsi,%rdi'
chacha.s: chacha.s:29: Error: ARM register expected -- `mov %rdx,%rdx'
chacha.s: chacha.s:30: Error: ARM register expected -- `cmp $0,%rdx'
chacha.s: ...
chacha.s: chacha.s:1516: Error: bad instruction `movl 0(%rsi),%eax'
chacha.s: chacha.s:1518: Error: bad instruction `movl 4(%rsi),%esi'
chacha.s: chacha.s:1520: Error: bad instruction `movl %r8d,48(%rdi)'
chacha.s: chacha.s:1522: Error: bad instruction `movl %r9d,52(%rdi)'
chacha.s: chacha.s:1524: Error: bad instruction `movl %eax,56(%rdi)'
chacha.s: chacha.s:1526: Error: bad instruction `movl %esi,60(%rdi)'
chacha.s: chacha.s:1528: Error: immediate expression requires a # prefix -- `add %r11,%rsp'
chacha.s: chacha.s:1529: Error: ARM register expected -- `mov %rdi,%rax'
chacha.s: chacha.s:1530: Error: ARM register expected -- `mov %rsi,%rdx'
chacha.s: chacha.s:1531: Error: bad instruction `ret'

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 amd64-ssse3
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 amd64-ssse3
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os amd64-ssse3
gcc -mcpu=native -mfpu=neon-vfpv4 -O2 amd64-ssse3
gcc -mcpu=native -mfpu=neon-vfpv4 -O3 amd64-ssse3
gcc -mcpu=native -mfpu=neon-vfpv4 -Os amd64-ssse3

Compiler output

Implementation: crypto_stream/chacha12/goll_gueron
Compiler: gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2
stream.c: stream.c:11:23: fatal error: immintrin.h: No such file or directory
stream.c: #include gt;
stream.c: ^
stream.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 goll_gueron
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 goll_gueron
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os goll_gueron
gcc -mcpu=native -mfpu=neon-vfpv4 -O2 goll_gueron
gcc -mcpu=native -mfpu=neon-vfpv4 -O3 goll_gueron
gcc -mcpu=native -mfpu=neon-vfpv4 -Os goll_gueron

Compiler output

Implementation: crypto_stream/chacha12/krovetz/avx2
Compiler: gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2
stream.c: stream.c:8:23: fatal error: immintrin.h: No such file or directory
stream.c: #include gt;
stream.c: ^
stream.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 krovetz/avx2
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 krovetz/avx2
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os krovetz/avx2
gcc -mcpu=native -mfpu=neon-vfpv4 -O2 krovetz/avx2
gcc -mcpu=native -mfpu=neon-vfpv4 -O3 krovetz/avx2
gcc -mcpu=native -mfpu=neon-vfpv4 -Os krovetz/avx2