Implementation notes: armeabi, cubox, crypto_hash/round3jh256

Computer: cubox
Architecture: armeabi
CPU ID: unknown CPU ID
SUPERCOP version: 20161026
Operation: crypto_hash
Primitive: round3jh256
TimeImplementationCompilerBenchmark dateSUPERCOP version
173676arm11gcc -mcpu=marvell-pj4 -O22016121020161026
173931arm11gcc -funroll-loops -mcpu=marvell-pj4 -O22016121020161026
174241arm11gcc -funroll-loops -mcpu=marvell-pj4 -O32016121020161026
174359arm11gcc -mcpu=marvell-pj4 -O32016121020161026
174672arm11gcc -funroll-loops -mcpu=marvell-pj4 -Os2016121020161026
174672arm11gcc -mcpu=marvell-pj4 -Os2016121020161026
339099sphlibgcc -funroll-loops -mcpu=marvell-pj4 -Os2016121020161026
373488sphlib-smallgcc -funroll-loops -mcpu=marvell-pj4 -O32016121020161026
392401sphlib-smallgcc -mcpu=marvell-pj4 -O32016121020161026
397188sphlib-smallgcc -funroll-loops -mcpu=marvell-pj4 -Os2016121020161026
417420sphlibgcc -mcpu=marvell-pj4 -Os2016121020161026
453591sphlib-smallgcc -funroll-loops -mcpu=marvell-pj4 -O22016121020161026
461577sphlib-smallgcc -mcpu=marvell-pj4 -Os2016121020161026
474527sphlib-smallgcc -mcpu=marvell-pj4 -O22016121020161026
534115sphlibgcc -funroll-loops -mcpu=marvell-pj4 -O32016121020161026
534202simplegcc -funroll-loops -mcpu=marvell-pj4 -O22016121020161026
534725simplegcc -funroll-loops -mcpu=marvell-pj4 -O32016121020161026
555881sphlibgcc -funroll-loops -mcpu=marvell-pj4 -O22016121020161026
577991simplegcc -mcpu=marvell-pj4 -O22016121020161026
580380simplegcc -mcpu=marvell-pj4 -O32016121020161026
585284sphlibgcc -mcpu=marvell-pj4 -O32016121020161026
603038sphlibgcc -mcpu=marvell-pj4 -O22016121020161026
3500761simplegcc -funroll-loops -mcpu=marvell-pj4 -Os2016121020161026
3518051simplegcc -mcpu=marvell-pj4 -Os2016121020161026

Test failure

Implementation: crypto_hash/round3jh256/neon2
Compiler: gcc -funroll-loops -mcpu=marvell-pj4 -O2
error 111

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=marvell-pj4 -O2 neon2
gcc -funroll-loops -mcpu=marvell-pj4 -O3 neon2
gcc -funroll-loops -mcpu=marvell-pj4 -Os neon2
gcc -mcpu=marvell-pj4 -O2 neon2
gcc -mcpu=marvell-pj4 -O3 neon2
gcc -mcpu=marvell-pj4 -Os neon2

Compiler output

Implementation: crypto_hash/round3jh256/neon
Compiler: gcc -funroll-loops -mcpu=marvell-pj4 -O2
hash.c: In file included from hash.c:3:0:
hash.c: hash.c: In function 'F8':
hash.c: /usr/lib/gcc/armv7l-unknown-linux-gnueabihf/6.2.1/include/arm_neon.h:9035:1: error: inlining failed in call to always_inline 'vld1q_u32': target specific option mismatch
hash.c: vld1q_u32 (const uint32_t * __a)
hash.c: ^~~~~~~~~
hash.c: hash.c:72:31: note: called from here
hash.c: #define LOAD(p) vld1q_u32((uint32_t *)(p))
hash.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~
hash.c: hash.c:160:9: note: in expansion of macro 'LOAD'
hash.c: x3 ^= LOAD(3+(uint32x4_t *) buf);
hash.c: ...
hash.c: In file included from hash.c:3:0:
hash.c: /usr/lib/gcc/armv7l-unknown-linux-gnueabihf/6.2.1/include/arm_neon.h:9035:1: error: inlining failed in call to always_inline 'vld1q_u32': target specific option mismatch
hash.c: vld1q_u32 (const uint32_t * __a)
hash.c: ^~~~~~~~~
hash.c: hash.c:72:31: note: called from here
hash.c: #define LOAD(p) vld1q_u32((uint32_t *)(p))
hash.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~
hash.c: hash.c:186:9: note: in expansion of macro 'LOAD'
hash.c: x4 ^= LOAD(0+(uint32x4_t *) buf);
hash.c: ^~~~

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=marvell-pj4 -O2 neon
gcc -funroll-loops -mcpu=marvell-pj4 -O3 neon
gcc -funroll-loops -mcpu=marvell-pj4 -Os neon
gcc -mcpu=marvell-pj4 -O2 neon
gcc -mcpu=marvell-pj4 -O3 neon
gcc -mcpu=marvell-pj4 -Os neon