Implementation notes: armeabi, rpi3b, crypto_stream/chacha20

Computer: rpi3b
Architecture: armeabi
CPU ID: unknown CPU ID
SUPERCOP version: 20191221
Operation: crypto_stream
Primitive: chacha20
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
460642480 0 413831 416 744e/mergedgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
486263128 0 417684 428 752e/regsgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
487513112 0 417668 428 752e/refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
521903008 0 414999 424 752e/mergedgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
524393960 0 418516 428 752e/mergedgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
595653976 0 119406 428 776e/mergedclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010720191221
606262844 0 414831 424 752e/regsgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
648773560 0 415631 424 752e/mergedgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
680622808 0 118234 428 776e/refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010720191221
732503076 0 118498 428 776e/regsclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010720191221
771262112 0 414103 424 752e/refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
783752808 0 414879 424 752e/regsgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
809392272 0 413619 416 744e/regsgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
825642308 0 414383 424 752e/refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
887521672 0 413019 416 744e/refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
1002528724 0 420819 424 752dolbeau/generic-gccsimd128gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
1102527812 0 419175 416 744dolbeau/generic-gccsimd128gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
12444010236 0 422243 424 752dolbeau/generic-gccsimd128gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
13100317100 0 429195 424 752dolbeau/generic-gccsimd256gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
13344011292 0 425872 428 752dolbeau/generic-gccsimd128gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
14425316480 0 427847 416 744dolbeau/generic-gccsimd256gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
14444020092 0 432099 424 752dolbeau/generic-gccsimd256gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221
14606521280 0 435856 428 752dolbeau/generic-gccsimd256gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010720191221

Compiler output

Implementation: dolbeau/arm-neon
Security model: unknown
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
chacha.c: In file included from chacha.c:12:
chacha.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.1/include/arm_neon.h:28:2: error: "NEON support not enabled"
chacha.c: #error "NEON support not enabled"
chacha.c: ^
chacha.c: 1 error generated.

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE dolbeau/arm-neon

Compiler output

Implementation: dolbeau/arm-neon
Security model: unknown
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
chacha.c: In file included from chacha.c:12:0:
chacha.c: u4.h: In function 'crypto_stream_chacha20_dolbeau_arm_neon_ECRYPT_encrypt_bytes':
chacha.c: /usr/lib/gcc/arm-linux-gnueabihf/6/include/arm_neon.h:5816:1: error: inlining failed in call to always_inline 'vdupq_n_u32': target specific option mismatch
chacha.c: vdupq_n_u32 (uint32_t __a)
chacha.c: ^~~~~~~~~~~
chacha.c: In file included from chacha.c:95:0:
chacha.c: u4.h:45:14: note: called from here
chacha.c: uint32x4_t x_15 = vdupq_n_u32(x[15]);
chacha.c: ^~~~
chacha.c: In file included from chacha.c:12:0:
chacha.c: /usr/lib/gcc/arm-linux-gnueabihf/6/include/arm_neon.h:5816:1: error: inlining failed in call to always_inline 'vdupq_n_u32': target specific option mismatch
chacha.c: vdupq_n_u32 (uint32_t __a)
chacha.c: ^~~~~~~~~~~
chacha.c: In file included from chacha.c:95:0:
chacha.c: u4.h:44:14: note: called from here
chacha.c: uint32x4_t x_14 = vdupq_n_u32(x[14]);
chacha.c: ^~~~
chacha.c: In file included from chacha.c:12:0:
chacha.c: /usr/lib/gcc/arm-linux-gnueabihf/6/include/arm_neon.h:5816:1: error: inlining failed in call to always_inline 'vdupq_n_u32': target specific option mismatch
chacha.c: vdupq_n_u32 (uint32_t __a)
chacha.c: ^~~~~~~~~~~
chacha.c: In file included from chacha.c:95:0:
chacha.c: u4.h:41:14: note: called from here
chacha.c: uint32x4_t x_11 = vdupq_n_u32(x[11]);
chacha.c: ^~~~
chacha.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE dolbeau/arm-neon
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE dolbeau/arm-neon
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE dolbeau/arm-neon
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE dolbeau/arm-neon

Compiler output

Implementation: dolbeau/generic-gccsimd128
Security model: unknown
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
chacha.c: In file included from chacha.c:94:
chacha.c: ./u4.h:127:7: error: cannot convert between vector values of different size ('V' (vector of 4 'unsigned int' values) and 'int')
chacha.c: VEC4_QUARTERROUND( 0, 4, 8,12);
chacha.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
chacha.c: ./u4.h:29:56: note: expanded from macro 'VEC4_QUARTERROUND'
chacha.c: x_##a = x_##a + x_##b; t_##a = x_##d ^ x_##a; x_##d = VEC4_ROT(t_##a, 16); \
chacha.c: ^ ~~~~~ ~~
chacha.c: ./u4.h:23:29: note: expanded from macro 'VEC4_ROT'
chacha.c: #define VEC4_ROT(a,imm) ((a << imm) | (a >> (32-imm)))
chacha.c: ~ ^ ~~~
chacha.c: ./u4.h:127:7: error: cannot convert between vector values of different size ('V' (vector of 4 'unsigned int' values) and 'int')
chacha.c: VEC4_QUARTERROUND( 0, 4, 8,12);
chacha.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
chacha.c: ./u4.h:29:56: note: expanded from macro 'VEC4_QUARTERROUND'
chacha.c: x_##a = x_##a + x_##b; t_##a = x_##d ^ x_##a; x_##d = VEC4_ROT(t_##a, 16); \
chacha.c: ^~~~~~~~~~~~~~~~~~~
chacha.c: ./u4.h:23:42: note: expanded from macro 'VEC4_ROT'
chacha.c: #define VEC4_ROT(a,imm) ((a << imm) | (a >> (32-imm)))
chacha.c: ~ ^ ~~~~~~~~
chacha.c: ./u4.h:127:7: error: cannot convert between vector values of different size ('V' (vector of 4 'unsigned int' values) and 'int')
chacha.c: VEC4_QUARTERROUND( 0, 4, 8,12);
chacha.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
chacha.c: ./u4.h:30:56: note: expanded from macro 'VEC4_QUARTERROUND'
chacha.c: x_##c = x_##c + x_##d; t_##c = x_##b ^ x_##c; x_##b = VEC4_ROT(t_##c, 12); \
chacha.c: ^ ~~~~~ ~~
chacha.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE dolbeau/generic-gccsimd128

Compiler output

Implementation: dolbeau/generic-gccsimd256
Security model: unknown
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
chacha.c: In file included from chacha.c:94:
chacha.c: ./u8.h:138:7: error: cannot convert between vector values of different size ('V' (vector of 8 'unsigned int' values) and 'int')
chacha.c: VEC8_QUARTERROUND( 0, 4, 8,12);
chacha.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
chacha.c: ./u8.h:32:56: note: expanded from macro 'VEC8_QUARTERROUND'
chacha.c: x_##a = x_##a + x_##b; t_##a = x_##d ^ x_##a; x_##d = VEC8_ROT(t_##a, 16); \
chacha.c: ^ ~~~~~ ~~
chacha.c: ./u8.h:26:29: note: expanded from macro 'VEC8_ROT'
chacha.c: #define VEC8_ROT(a,imm) ((a << imm) | (a >> (32-imm)))
chacha.c: ~ ^ ~~~
chacha.c: ./u8.h:138:7: error: cannot convert between vector values of different size ('V' (vector of 8 'unsigned int' values) and 'int')
chacha.c: VEC8_QUARTERROUND( 0, 4, 8,12);
chacha.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
chacha.c: ./u8.h:32:56: note: expanded from macro 'VEC8_QUARTERROUND'
chacha.c: x_##a = x_##a + x_##b; t_##a = x_##d ^ x_##a; x_##d = VEC8_ROT(t_##a, 16); \
chacha.c: ^~~~~~~~~~~~~~~~~~~
chacha.c: ./u8.h:26:42: note: expanded from macro 'VEC8_ROT'
chacha.c: #define VEC8_ROT(a,imm) ((a << imm) | (a >> (32-imm)))
chacha.c: ~ ^ ~~~~~~~~
chacha.c: ./u8.h:138:7: error: cannot convert between vector values of different size ('V' (vector of 8 'unsigned int' values) and 'int')
chacha.c: VEC8_QUARTERROUND( 0, 4, 8,12);
chacha.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
chacha.c: ./u8.h:33:56: note: expanded from macro 'VEC8_QUARTERROUND'
chacha.c: x_##c = x_##c + x_##d; t_##c = x_##b ^ x_##c; x_##b = VEC8_ROT(t_##c, 12); \
chacha.c: ^ ~~~~~ ~~
chacha.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE dolbeau/generic-gccsimd256

Compiler output

Implementation: krovetz/vec128
Security model: unknown
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
stream.c: stream.c:80:2: error: -- Implementation supports only machines with neon, altivec or SSE2
stream.c: #error -- Implementation supports only machines with neon, altivec or SSE2
stream.c: ^
stream.c: stream.c:151:14: warning: implicit declaration of function 'NONCE' is invalid in C99 [-Wimplicit-function-declaration]
stream.c: vec s3 = NONCE(np);
stream.c: ^
stream.c: stream.c:151:9: error: initializing 'vec' (vector of 4 'unsigned int' values) with an expression of incompatible type 'int'
stream.c: vec s3 = NONCE(np);
stream.c: ^ ~~~~~~~~~
stream.c: stream.c:152:36: error: use of undeclared identifier 'VBPI'
stream.c: for (iters = 0; iters < inlen/(BPI*64); iters++) {
stream.c: ^
stream.c: stream.c:91:19: note: expanded from macro 'BPI'
stream.c: #define BPI (VBPI + GPR_TOO) /* Blocks computed per loop iteration */
stream.c: ^
stream.c: stream.c:152:36: error: use of undeclared identifier 'GPR_TOO'
stream.c: stream.c:91:26: note: expanded from macro 'BPI'
stream.c: #define BPI (VBPI + GPR_TOO) /* Blocks computed per loop iteration */
stream.c: ^
stream.c: stream.c:155:19: error: use of undeclared identifier 'ONE'
stream.c: v7 = v3 + ONE;
stream.c: ^
stream.c: stream.c:176:13: warning: implicit declaration of function 'ROTW16' is invalid in C99 [-Wimplicit-function-declaration]
stream.c: DQROUND_VECTORS(v0,v1,v2,v3)
stream.c: ^
stream.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE krovetz/vec128

Compiler output

Implementation: krovetz/vec128
Security model: unknown
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
stream.c: stream.c:80:2: error: #error -- Implementation supports only machines with neon, altivec or SSE2
stream.c: #error -- Implementation supports only machines with neon, altivec or SSE2
stream.c: ^~~~~
stream.c: stream.c: In function 'crypto_stream_chacha20_krovetz_vec128_xor':
stream.c: stream.c:151:14: warning: implicit declaration of function 'NONCE' [-Wimplicit-function-declaration]
stream.c: vec s3 = NONCE(np);
stream.c: ^~~~~
stream.c: stream.c:151:14: error: incompatible types when initializing type 'vec {aka __vector(4) unsigned int}' using type 'int'
stream.c: stream.c:91:19: error: 'VBPI' undeclared (first use in this function)
stream.c: #define BPI (VBPI + GPR_TOO) /* Blocks computed per loop iteration */
stream.c: ^
stream.c: stream.c:152:36: note: in expansion of macro 'BPI'
stream.c: for (iters = 0; iters < inlen/(BPI*64); iters++) {
stream.c: ^~~
stream.c: stream.c:91:19: note: each undeclared identifier is reported only once for each function it appears in
stream.c: #define BPI (VBPI + GPR_TOO) /* Blocks computed per loop iteration */
stream.c: ^
stream.c: stream.c:152:36: note: in expansion of macro 'BPI'
stream.c: for (iters = 0; iters < inlen/(BPI*64); iters++) {
stream.c: ^~~
stream.c: stream.c:91:26: error: 'GPR_TOO' undeclared (first use in this function)
stream.c: #define BPI (VBPI + GPR_TOO) /* Blocks computed per loop iteration */
stream.c: ^
stream.c: stream.c:152:36: note: in expansion of macro 'BPI'
stream.c: for (iters = 0; iters < inlen/(BPI*64); iters++) {
stream.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE krovetz/vec128
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE krovetz/vec128
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE krovetz/vec128
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE krovetz/vec128

Compiler output

Implementation: moon/armv6/32
Security model: unknown
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: crypto_stream_chacha20.a(crypto_stream.o): In function `crypto_stream_chacha20_moon_armv6_32':
try.c: crypto_stream.c:(.text+0x...): multiple definition of `crypto_stream_chacha20_moon_armv6_32'
try.c: crypto_stream_chacha20.a(chacha.o):(.text+0x...): first defined here
try.c: crypto_stream_chacha20.a(crypto_stream.o): In function `crypto_stream_chacha20_moon_armv6_32_xor':
try.c: crypto_stream.c:(.text+0x...): undefined reference to `crypto_stream_chacha12_moon_armv6_32'
try.c: crypto_stream_chacha20.a(crypto_stream.o): In function `crypto_stream_chacha20_moon_armv6_32':
try.c: crypto_stream.c:(.text+0x...): undefined reference to `crypto_stream_chacha12_moon_armv6_32'
try.c: clang: error: linker command failed with exit code 1 (use -v to see invocation)

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE moon/armv6/32

Compiler output

Implementation: moon/armv6/32
Security model: unknown
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
try.c: crypto_stream_chacha20.a(crypto_stream.o): In function `crypto_stream_chacha20_moon_armv6_32':
try.c: crypto_stream.c:(.text+0x...): multiple definition of `crypto_stream_chacha20_moon_armv6_32'
try.c: crypto_stream_chacha20.a(chacha.o):(.text+0x...): first defined here
try.c: crypto_stream_chacha20.a(crypto_stream.o): In function `crypto_stream_chacha20_moon_armv6_32_xor':
try.c: crypto_stream.c:(.text+0x...): undefined reference to `crypto_stream_chacha12_moon_armv6_32'
try.c: crypto_stream_chacha20.a(crypto_stream.o): In function `crypto_stream_chacha20_moon_armv6_32':
try.c: crypto_stream.c:(.text+0x...): undefined reference to `crypto_stream_chacha12_moon_armv6_32'
try.c: collect2: error: ld returned 1 exit status

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE moon/armv6/32
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE moon/armv6/32
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE moon/armv6/32
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE moon/armv6/32

Compiler output

Implementation: moon/neon/32
Security model: unknown
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: crypto_stream_chacha20.a(crypto_stream.o): In function `crypto_stream_chacha20_moon_neon_32':
try.c: crypto_stream.c:(.text+0x...): multiple definition of `crypto_stream_chacha20_moon_neon_32'
try.c: crypto_stream_chacha20.a(chacha.o):(.text+0x...): first defined here
try.c: crypto_stream_chacha20.a(crypto_stream.o): In function `crypto_stream_chacha20_moon_neon_32_xor':
try.c: crypto_stream.c:(.text+0x...): undefined reference to `crypto_stream_chacha12_moon_neon_32'
try.c: crypto_stream_chacha20.a(crypto_stream.o): In function `crypto_stream_chacha20_moon_neon_32':
try.c: crypto_stream.c:(.text+0x...): undefined reference to `crypto_stream_chacha12_moon_neon_32'
try.c: clang: error: linker command failed with exit code 1 (use -v to see invocation)

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE moon/neon/32

Compiler output

Implementation: moon/neon/32
Security model: unknown
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
try.c: crypto_stream_chacha20.a(crypto_stream.o): In function `crypto_stream_chacha20_moon_neon_32':
try.c: crypto_stream.c:(.text+0x...): multiple definition of `crypto_stream_chacha20_moon_neon_32'
try.c: crypto_stream_chacha20.a(chacha.o):(.text+0x...): first defined here
try.c: crypto_stream_chacha20.a(crypto_stream.o): In function `crypto_stream_chacha20_moon_neon_32_xor':
try.c: crypto_stream.c:(.text+0x...): undefined reference to `crypto_stream_chacha12_moon_neon_32'
try.c: crypto_stream_chacha20.a(crypto_stream.o): In function `crypto_stream_chacha20_moon_neon_32':
try.c: crypto_stream.c:(.text+0x...): undefined reference to `crypto_stream_chacha12_moon_neon_32'
try.c: collect2: error: ld returned 1 exit status

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE moon/neon/32
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE moon/neon/32
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE moon/neon/32
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE moon/neon/32