Implementation notes: armeabi, cubox, crypto_hash/cubehash512

Computer: cubox
Architecture: armeabi
CPU ID: unknown CPU ID
SUPERCOP version: 20161026
Operation: crypto_hash
Primitive: cubehash512
TimeImplementationCompilerBenchmark dateSUPERCOP version
106520unrolled3gcc -funroll-loops -mcpu=marvell-pj4 -Os2016120820161026
110492armgcc -funroll-loops -mcpu=marvell-pj4 -O32016120820161026
110496armgcc -mcpu=marvell-pj4 -O32016120820161026
110537armgcc -funroll-loops -mcpu=marvell-pj4 -O22016120820161026
110744armgcc -funroll-loops -mcpu=marvell-pj4 -Os2016120820161026
110748armgcc -mcpu=marvell-pj4 -Os2016120820161026
110756armgcc -mcpu=marvell-pj4 -O22016120820161026
118004unrolled5gcc -funroll-loops -mcpu=marvell-pj4 -Os2016120820161026
120640unrolled3gcc -mcpu=marvell-pj4 -Os2016120820161026
136223unrolled5gcc -funroll-loops -mcpu=marvell-pj4 -O22016120820161026
139508unrolled5gcc -mcpu=marvell-pj4 -Os2016120820161026
142436unrolled4gcc -funroll-loops -mcpu=marvell-pj4 -Os2016120820161026
142898unrolled4gcc -funroll-loops -mcpu=marvell-pj4 -O22016120820161026
142987unrolled3gcc -funroll-loops -mcpu=marvell-pj4 -O32016120820161026
143212unrolled3gcc -funroll-loops -mcpu=marvell-pj4 -O22016120820161026
146181unrolled5gcc -funroll-loops -mcpu=marvell-pj4 -O32016120820161026
149692unrolled5gcc -mcpu=marvell-pj4 -O22016120820161026
151345unrolled4gcc -funroll-loops -mcpu=marvell-pj4 -O32016120820161026
151436unrolled3gcc -mcpu=marvell-pj4 -O22016120820161026
154476unrolled4gcc -mcpu=marvell-pj4 -Os2016120820161026
155144unrolled5gcc -mcpu=marvell-pj4 -O32016120820161026
159418unrolled4gcc -mcpu=marvell-pj4 -O32016120820161026
159992unrolled3gcc -mcpu=marvell-pj4 -O32016120820161026
160436unrolled4gcc -mcpu=marvell-pj4 -O22016120820161026
164685unrolledgcc -funroll-loops -mcpu=marvell-pj4 -O32016120820161026
171781simplegcc -funroll-loops -mcpu=marvell-pj4 -O22016120820161026
175604unrolledgcc -mcpu=marvell-pj4 -O32016120820161026
175979unrolledgcc -funroll-loops -mcpu=marvell-pj4 -O22016120820161026
176455unrolled2gcc -funroll-loops -mcpu=marvell-pj4 -O32016120820161026
179593simplegcc -funroll-loops -mcpu=marvell-pj4 -O32016120820161026
187448unrolled2gcc -mcpu=marvell-pj4 -O32016120820161026
187787unrolled2gcc -funroll-loops -mcpu=marvell-pj4 -O22016120820161026
188316unrolledgcc -mcpu=marvell-pj4 -O22016120820161026
188844unrolled2gcc -funroll-loops -mcpu=marvell-pj4 -Os2016120820161026
190992simplegcc -mcpu=marvell-pj4 -O32016120820161026
192056unrolledgcc -funroll-loops -mcpu=marvell-pj4 -Os2016120820161026
198656unrolled2gcc -mcpu=marvell-pj4 -O22016120820161026
201636unrolledgcc -mcpu=marvell-pj4 -Os2016120820161026
206692unrolled2gcc -mcpu=marvell-pj4 -Os2016120820161026
959185simplegcc -mcpu=marvell-pj4 -O22016120820161026
979175simplegcc -funroll-loops -mcpu=marvell-pj4 -Os2016120820161026
983752simplegcc -mcpu=marvell-pj4 -Os2016120820161026

Compiler output

Implementation: crypto_hash/cubehash512/armneon
Compiler: gcc -funroll-loops -mcpu=marvell-pj4 -O2
hash.c: In file included from hash.c:2:0:
hash.c: hash.c: In function 'crypto_hash_cubehash512_armneon':
hash.c: /usr/lib/gcc/armv7l-unknown-linux-gnueabihf/6.2.1/include/arm_neon.h:9035:1: error: inlining failed in call to always_inline 'vld1q_u32': target specific option mismatch
hash.c: vld1q_u32 (const uint32_t * __a)
hash.c: ^~~~~~~~~
hash.c: hash.c:40:6: note: called from here
hash.c: x7 = vld1q_u32(constants + 28);
hash.c: ~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~
hash.c: In file included from hash.c:2:0:
hash.c: /usr/lib/gcc/armv7l-unknown-linux-gnueabihf/6.2.1/include/arm_neon.h:9035:1: error: inlining failed in call to always_inline 'vld1q_u32': target specific option mismatch
hash.c: ...
hash.c: hash.c:152:3: note: called from here
hash.c: vst1q_u32((uint32_t *) (out + 16),x1);
hash.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
hash.c: In file included from hash.c:2:0:
hash.c: /usr/lib/gcc/armv7l-unknown-linux-gnueabihf/6.2.1/include/arm_neon.h:9535:1: error: inlining failed in call to always_inline 'vst1q_u32': target specific option mismatch
hash.c: vst1q_u32 (uint32_t * __a, uint32x4_t __b)
hash.c: ^~~~~~~~~
hash.c: hash.c:151:3: note: called from here
hash.c: vst1q_u32((uint32_t *) out,x0);
hash.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=marvell-pj4 -O2 armneon
gcc -funroll-loops -mcpu=marvell-pj4 -O3 armneon
gcc -funroll-loops -mcpu=marvell-pj4 -Os armneon
gcc -mcpu=marvell-pj4 -O2 armneon
gcc -mcpu=marvell-pj4 -O3 armneon
gcc -mcpu=marvell-pj4 -Os armneon