Implementation notes: armeabi, novena, crypto_hashblocks/sha256

Computer: novena
Architecture: armeabi
CPU ID: unknown CPU ID
SUPERCOP version: 20220506
Operation: crypto_hashblocks
Primitive: sha256
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
408047428 0 015099 420 704inplacegcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022060420220506
412047188 0 014355 412 704refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022060420220506
416037400 0 016327 420 704refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022060420220506
416047432 0 016363 420 704inplacegcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022060420220506
416047384 0 015055 420 704refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022060420220506
420037224 0 014399 412 704inplacegcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022060420220506
460038704 0 019597 424 704arm11clang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022060420220506
460038688 0 016359 420 704arm11gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022060420220506
460038688 0 015859 412 704arm11gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022060420220506
460048704 0 019845 424 712arm11clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022060420220506
460048688 0 017615 420 704arm11gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022060420220506
460048688 0 016238 416 704arm11gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022060420220506
460058704 0 017859 420 704arm11clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022060420220506
460058704 0 028365 424 712arm11clang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022060420220506
480048704 0 018003 420 704arm11clang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022060420220506
5080410576 0 021721 424 712inplaceclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022060420220506
508049988 0 019279 420 704inplaceclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022060420220506
508047304 0 014858 416 704inplacegcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022060420220506
5080410576 0 021717 424 712refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022060420220506
508047350 0 014902 416 704refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022060420220506
508059988 0 019275 420 704refclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022060420220506
5160410576 0 021473 424 704inplaceclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022060420220506
5160510576 0 019735 420 704inplaceclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022060420220506
5160510576 0 021469 424 704refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022060420220506
5160510576 0 019731 420 704refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022060420220506
5280510620 0 030281 424 712inplaceclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022060420220506
5320510620 0 030277 424 712refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022060420220506

Compiler output

Implementation: dolbeau/amd64-sha
Security model: constbranchindex
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
blocks.c: In file included from blocks.c:37:
blocks.c: In file included from /usr/lib/llvm-11/lib/clang/11.0.1/include/immintrin.h:15:
blocks.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/mmintrin.h:50:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
blocks.c: return (__m64)__builtin_ia32_vec_init_v2si(__i, 0);
blocks.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/mmintrin.h:129:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
blocks.c: return (__m64)__builtin_ia32_packsswb((__v4hi)__m1, (__v4hi)__m2);
blocks.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/mmintrin.h:159:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
blocks.c: return (__m64)__builtin_ia32_packssdw((__v2si)__m1, (__v2si)__m2);
blocks.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/mmintrin.h:189:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
blocks.c: return (__m64)__builtin_ia32_packuswb((__v4hi)__m1, (__v4hi)__m2);
blocks.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/mmintrin.h:216:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
blocks.c: return (__m64)__builtin_ia32_punpckhbw((__v8qi)__m1, (__v8qi)__m2);
blocks.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/mmintrin.h:239:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
blocks.c: return (__m64)__builtin_ia32_punpckhwd((__v4hi)__m1, (__v4hi)__m2);
blocks.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/mmintrin.h:260:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
blocks.c: return (__m64)__builtin_ia32_punpckhdq((__v2si)__m1, (__v2si)__m2);
blocks.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/mmintrin.h:287:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
blocks.c: return (__m64)__builtin_ia32_punpcklbw((__v8qi)__m1, (__v8qi)__m2);
blocks.c: ...

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE dolbeau/amd64-sha
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE dolbeau/amd64-sha
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE dolbeau/amd64-sha
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE dolbeau/amd64-sha
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE dolbeau/amd64-sha

Compiler output

Implementation: dolbeau/amd64-sha
Security model: constbranchindex
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
blocks.c: blocks.c:37:10: fatal error: immintrin.h: No such file or directory
blocks.c: 37 | #include <immintrin.h>
blocks.c: | ^~~~~~~~~~~~~
blocks.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE dolbeau/amd64-sha
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE dolbeau/amd64-sha
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE dolbeau/amd64-sha
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE dolbeau/amd64-sha

Compiler output

Implementation: dolbeau/armv8crypto
Security model: constbranchindex
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
blocks.c: In file included from blocks.c:37:
blocks.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/arm_neon.h:32:2: error: "NEON support not enabled"
blocks.c: #error "NEON support not enabled"
blocks.c: ^
blocks.c: blocks.c:59:2: error: use of undeclared identifier 'uint32x4_t'
blocks.c: uint32x4_t c0 = vld1q_u32(s256cst + 0);
blocks.c: ^
blocks.c: blocks.c:60:2: error: use of undeclared identifier 'uint32x4_t'
blocks.c: uint32x4_t c1 = vld1q_u32(s256cst + 4);
blocks.c: ^
blocks.c: blocks.c:61:2: error: use of undeclared identifier 'uint32x4_t'
blocks.c: uint32x4_t c2 = vld1q_u32(s256cst + 8);
blocks.c: ^
blocks.c: blocks.c:62:2: error: use of undeclared identifier 'uint32x4_t'
blocks.c: uint32x4_t c3 = vld1q_u32(s256cst + 12);
blocks.c: ^
blocks.c: blocks.c:63:2: error: use of undeclared identifier 'uint32x4_t'
blocks.c: uint32x4_t c4 = vld1q_u32(s256cst + 16);
blocks.c: ^
blocks.c: blocks.c:64:2: error: use of undeclared identifier 'uint32x4_t'
blocks.c: uint32x4_t c5 = vld1q_u32(s256cst + 20);
blocks.c: ^
blocks.c: blocks.c:65:2: error: use of undeclared identifier 'uint32x4_t'
blocks.c: uint32x4_t c6 = vld1q_u32(s256cst + 24);
blocks.c: ^
blocks.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE dolbeau/armv8crypto
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE dolbeau/armv8crypto
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE dolbeau/armv8crypto
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE dolbeau/armv8crypto

Compiler output

Implementation: dolbeau/armv8crypto
Security model: constbranchindex
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
blocks.c: blocks.c:134:3: warning: implicit declaration of function 'vsha256hq_u32' is invalid in C99 [-Wimplicit-function-declaration]
blocks.c: DO16ROUNDS(i0, i1, i2, i3, c0, c1, c2, c3);
blocks.c: ^
blocks.c: blocks.c:108:8: note: expanded from macro 'DO16ROUNDS'
blocks.c: x0 = vsha256hq_u32(s0, s1, h0); \
blocks.c: ^
blocks.c: blocks.c:134:3: error: assigning to 'uint32x4_t' (vector of 4 'uint32_t' values) from incompatible type 'int'
blocks.c: DO16ROUNDS(i0, i1, i2, i3, c0, c1, c2, c3);
blocks.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: blocks.c:108:6: note: expanded from macro 'DO16ROUNDS'
blocks.c: x0 = vsha256hq_u32(s0, s1, h0); \
blocks.c: ^ ~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: blocks.c:134:3: warning: implicit declaration of function 'vsha256h2q_u32' is invalid in C99 [-Wimplicit-function-declaration]
blocks.c: blocks.c:109:8: note: expanded from macro 'DO16ROUNDS'
blocks.c: x1 = vsha256h2q_u32(s1, s0, h0); \
blocks.c: ^
blocks.c: blocks.c:134:3: error: assigning to 'uint32x4_t' (vector of 4 'uint32_t' values) from incompatible type 'int'
blocks.c: DO16ROUNDS(i0, i1, i2, i3, c0, c1, c2, c3);
blocks.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: blocks.c:109:6: note: expanded from macro 'DO16ROUNDS'
blocks.c: x1 = vsha256h2q_u32(s1, s0, h0); \
blocks.c: ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: blocks.c:134:3: error: assigning to 'uint32x4_t' (vector of 4 'uint32_t' values) from incompatible type 'int'
blocks.c: DO16ROUNDS(i0, i1, i2, i3, c0, c1, c2, c3);
blocks.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE dolbeau/armv8crypto

Compiler output

Implementation: dolbeau/armv8crypto
Security model: constbranchindex
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
blocks.c: In file included from blocks.c:37:
blocks.c: blocks.c: In function ‘crypto_hashblocks_sha256_dolbeau_armv8crypto_constbranchindex’:
blocks.c: /usr/lib/gcc/arm-linux-gnueabihf/10/include/arm_neon.h:9080:1: error: inlining failed in call to ‘always_inline’ ‘vrev32q_u8’: target specific option mismatch
blocks.c: 9080 | vrev32q_u8 (uint8x16_t __a)
blocks.c: | ^~~~~~~~~~
blocks.c: blocks.c:81:19: note: called from here
blocks.c: 81 | d1 = (uint32x4_t)vrev32q_u8((uint8x16_t)d1);
blocks.c: | ^~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: In file included from blocks.c:37:
blocks.c: /usr/lib/gcc/arm-linux-gnueabihf/10/include/arm_neon.h:9080:1: error: inlining failed in call to ‘always_inline’ ‘vrev32q_u8’: target specific option mismatch
blocks.c: 9080 | vrev32q_u8 (uint8x16_t __a)
blocks.c: | ^~~~~~~~~~
blocks.c: blocks.c:80:19: note: called from here
blocks.c: 80 | d0 = (uint32x4_t)vrev32q_u8((uint8x16_t)d0);
blocks.c: | ^~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: In file included from blocks.c:37:
blocks.c: /usr/lib/gcc/arm-linux-gnueabihf/10/include/arm_neon.h:10417:1: error: inlining failed in call to ‘always_inline’ ‘vld1q_u32’: target specific option mismatch
blocks.c: 10417 | vld1q_u32 (const uint32_t * __a)
blocks.c: | ^~~~~~~~~
blocks.c: blocks.c:77:18: note: called from here
blocks.c: 77 | uint32x4_t d1 = vld1q_u32((uint32_t*)(statebytes + 16));
blocks.c: | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blocks.c: In file included from blocks.c:37:
blocks.c: /usr/lib/gcc/arm-linux-gnueabihf/10/include/arm_neon.h:10417:1: error: inlining failed in call to ‘always_inline’ ‘vld1q_u32’: target specific option mismatch
blocks.c: 10417 | vld1q_u32 (const uint32_t * __a)
blocks.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE dolbeau/armv8crypto
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE dolbeau/armv8crypto
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE dolbeau/armv8crypto
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE dolbeau/armv8crypto