Implementation notes: armeabi, c1, crypto_stream/chacha20

Computer: c1
Architecture: armeabi
CPU ID: unknown CPU ID
SUPERCOP version: 20161026
Operation: crypto_stream
Primitive: chacha20
TimeImplementationCompilerBenchmark dateSUPERCOP version
22641moon/neon/32gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
22655moon/neon/32gcc -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
22660moon/neon/32gcc -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
22670moon/neon/32gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
22742moon/neon/32gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
25889moon/neon/32gcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
26085dolbeau/arm-neongcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
26196dolbeau/arm-neongcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
27554dolbeau/arm-neongcc -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
29138dolbeau/arm-neongcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
30635moon/armv6/32gcc -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
30635moon/armv6/32gcc -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
30643moon/armv6/32gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
30645moon/armv6/32gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
30645moon/armv6/32gcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
30654moon/armv6/32gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
31065dolbeau/arm-neongcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
31501dolbeau/arm-neongcc -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
35805e/mergedgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
36570e/mergedgcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
41117e/regsgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
41875e/regsgcc -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
42199e/mergedgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
42210e/mergedgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
42716e/mergedgcc -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
42790e/mergedgcc -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
43148dolbeau/mipsel-msagcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
43264e/refgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
43575dolbeau/mipsel-msagcc -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
43899e/refgcc -mcpu=native -mfpu=neon-vfpv4 -O32016121020161026
46607dolbeau/mipsel-msagcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
50064e/regsgcc -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
51705e/regsgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
52408e/regsgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
52695e/refgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
57990dolbeau/mipsel-msagcc -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
59191e/refgcc -mcpu=native -mfpu=neon-vfpv4 -O22016121020161026
59764e/refgcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
59836dolbeau/mipsel-msagcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
61298e/regsgcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
65394dolbeau/mipsel-msagcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026
65966e/refgcc -mcpu=native -mfpu=neon-vfpv4 -Os2016121020161026

Test failure

Implementation: crypto_stream/chacha20/krovetz/vec128
Compiler: gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2
error 111
crypto_stream_xor does not handle n=c overlap

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 krovetz/vec128
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 krovetz/vec128
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os krovetz/vec128
gcc -mcpu=native -mfpu=neon-vfpv4 -O2 krovetz/vec128
gcc -mcpu=native -mfpu=neon-vfpv4 -O3 krovetz/vec128
gcc -mcpu=native -mfpu=neon-vfpv4 -Os krovetz/vec128

Compiler output

Implementation: crypto_stream/chacha20/dolbeau/ppc-altivec
Compiler: gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2
chacha.c: chacha.c:11:21: fatal error: altivec.h: No such file or directory
chacha.c: #include gt;
chacha.c: ^
chacha.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 dolbeau/ppc-altivec
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 dolbeau/ppc-altivec
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os dolbeau/ppc-altivec
gcc -mcpu=native -mfpu=neon-vfpv4 -O2 dolbeau/ppc-altivec
gcc -mcpu=native -mfpu=neon-vfpv4 -O3 dolbeau/ppc-altivec
gcc -mcpu=native -mfpu=neon-vfpv4 -Os dolbeau/ppc-altivec

Compiler output

Implementation: crypto_stream/chacha20/amd64-ssse3
Compiler: gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2
chacha.s: chacha.s: Assembler messages:
chacha.s: chacha.s:22: Error: ARM register expected -- `mov %rsp,%r11'
chacha.s: chacha.s:23: Error: ARM register expected -- `and $31,%r11'
chacha.s: chacha.s:24: Error: ARM register expected -- `add $384,%r11'
chacha.s: chacha.s:25: Error: immediate expression requires a # prefix -- `sub %r11,%rsp'
chacha.s: chacha.s:26: Error: ARM register expected -- `mov %rdi,%r8'
chacha.s: chacha.s:27: Error: ARM register expected -- `mov %rsi,%rsi'
chacha.s: chacha.s:28: Error: ARM register expected -- `mov %rsi,%rdi'
chacha.s: chacha.s:29: Error: ARM register expected -- `mov %rdx,%rdx'
chacha.s: chacha.s:30: Error: ARM register expected -- `cmp $0,%rdx'
chacha.s: chacha.s:32: Error: bad instruction `jbe ._done'
chacha.s: chacha.s:34: Error: ARM register expected -- `mov $0,%rax'
chacha.s: chacha.s:36: Error: ARM register expected -- `mov %rdx,%rcx'
chacha.s: chacha.s:38: Error: bad instruction `rep stosb'
chacha.s: chacha.s:40: Error: ARM register expected -- `sub %rdx,%rdi'
chacha.s: chacha.s:42: Error: bad instruction `jmp ._start'
chacha.s: chacha.s:50: Error: ARM register expected -- `mov %rsp,%r11'
chacha.s: chacha.s:51: Error: ARM register expected -- `and $31,%r11'
chacha.s: chacha.s:52: Error: ARM register expected -- `add $384,%r11'
chacha.s: chacha.s:53: Error: immediate expression requires a # prefix -- `sub %r11,%rsp'
chacha.s: chacha.s:55: Error: ARM register expected -- `mov %rdi,%r8'
chacha.s: chacha.s:57: Error: ARM register expected -- `mov %rsi,%rsi'
chacha.s: chacha.s:59: Error: ARM register expected -- `mov %rdx,%rdi'
chacha.s: chacha.s:61: Error: ARM register expected -- `mov %rcx,%rdx'
chacha.s: chacha.s:63: Error: ARM register expected -- `cmp $0,%rdx'
chacha.s: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 amd64-ssse3
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 amd64-ssse3
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os amd64-ssse3
gcc -mcpu=native -mfpu=neon-vfpv4 -O2 amd64-ssse3
gcc -mcpu=native -mfpu=neon-vfpv4 -O3 amd64-ssse3
gcc -mcpu=native -mfpu=neon-vfpv4 -Os amd64-ssse3

Compiler output

Implementation: crypto_stream/chacha20/goll_gueron
Compiler: gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2
stream.c: stream.c:11:23: fatal error: immintrin.h: No such file or directory
stream.c: #include gt;
stream.c: ^
stream.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 goll_gueron
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 goll_gueron
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os goll_gueron
gcc -mcpu=native -mfpu=neon-vfpv4 -O2 goll_gueron
gcc -mcpu=native -mfpu=neon-vfpv4 -O3 goll_gueron
gcc -mcpu=native -mfpu=neon-vfpv4 -Os goll_gueron

Compiler output

Implementation: crypto_stream/chacha20/krovetz/avx2
Compiler: gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2
stream.c: stream.c:8:23: fatal error: immintrin.h: No such file or directory
stream.c: #include gt;
stream.c: ^
stream.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 krovetz/avx2
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 krovetz/avx2
gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os krovetz/avx2
gcc -mcpu=native -mfpu=neon-vfpv4 -O2 krovetz/avx2
gcc -mcpu=native -mfpu=neon-vfpv4 -O3 krovetz/avx2
gcc -mcpu=native -mfpu=neon-vfpv4 -Os krovetz/avx2