Implementation notes: armeabi, cubox, crypto_aead/iscream12v2

Computer: cubox
Architecture: armeabi
CPU ID: unknown CPU ID
SUPERCOP version: 20161026
Operation: crypto_aead
Primitive: iscream12v2
TimeImplementationCompilerBenchmark dateSUPERCOP version
886100refgcc -funroll-loops -mcpu=marvell-pj4 -O32016121120161026
909962refgcc -mcpu=marvell-pj4 -O32016121120161026
2177064refgcc -funroll-loops -mcpu=marvell-pj4 -O22016121120161026
2894807refgcc -mcpu=marvell-pj4 -O22016121120161026
2997199refgcc -funroll-loops -mcpu=marvell-pj4 -Os2016121120161026
3033490refgcc -mcpu=marvell-pj4 -Os2016121120161026

Compiler output

Implementation: crypto_aead/iscream12v2/neon
Compiler: gcc -funroll-loops -mcpu=marvell-pj4 -O2
iscream.c: In file included from tae.h:7:0,
iscream.c: from iscream.c:10:
iscream.c: helper.h: In function 'write128':
iscream.c: /usr/lib/gcc/armv7l-unknown-linux-gnueabihf/6.2.1/include/arm_neon.h:8519:1: error: inlining failed in call to always_inline 'vzipq_u8': target specific option mismatch
iscream.c: vzipq_u8 (uint8x16_t __a, uint8x16_t __b)
iscream.c: ^~~~~~~~
iscream.c: In file included from iscream.c:13:0:
iscream.c: helper.h:10:18: note: called from here
iscream.c: uint8x16x2_t c__ = vzipq_u8 (X(i), X(j)); ^
iscream.c: ...
iscream.c: /usr/lib/gcc/armv7l-unknown-linux-gnueabihf/6.2.1/include/arm_neon.h:9523:1: error: inlining failed in call to always_inline 'vst1q_u8': target specific option mismatch
iscream.c: vst1q_u8 (uint8_t * __a, uint8x16_t __b)
iscream.c: ^~~~~~~~
iscream.c: In file included from iscream.c:10:0:
iscream.c: tae.h:11:25: note: called from here
iscream.c: #define neon_store(p,x) vst1q_u8((uint8_t*)(p), x)
iscream.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~
iscream.c: helper.h:85:5: note: in expansion of macro 'neon_store'
iscream.c: neon_store(out8+16*0 , X(0 ));
iscream.c: ^~~~~~~~~~

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=marvell-pj4 -O2 neon
gcc -funroll-loops -mcpu=marvell-pj4 -O3 neon
gcc -funroll-loops -mcpu=marvell-pj4 -Os neon
gcc -mcpu=marvell-pj4 -O2 neon
gcc -mcpu=marvell-pj4 -O3 neon
gcc -mcpu=marvell-pj4 -Os neon

Compiler output

Implementation: crypto_aead/iscream12v2/sse
Compiler: gcc -funroll-loops -mcpu=marvell-pj4 -O2
iscream.c: iscream.c: In function 'LBox16P':
iscream.c: iscream.c:16:32: warning: implicit declaration of function '__builtin_ia32_psrldi128' [-Wimplicit-function-declaration]
iscream.c: #define shift_right(x) ((v16qi)__builtin_ia32_psrldi128((v4si)x, 4))
iscream.c: ^
iscream.c: iscream.c:181:10: note: in expansion of macro 'shift_right'
iscream.c: t0 = shift_right(in[0]) & V(0xf);
iscream.c: ^~~~~~~~~~~
iscream.c: iscream.c:181:5: error: can't convert a value of type 'int' to vector type '__vector(16) char' which has different size
iscream.c: t0 = shift_right(in[0]) & V(0xf);
iscream.c: ^~
iscream.c: ...
iscream.c: iscream.c:301:2: error: can't convert a value of type 'int' to vector type '__vector(16) char' which has different size
iscream.c: iscream.c:301:2: error: can't convert a value of type 'int' to vector type '__vector(16) char' which has different size
iscream.c: iscream.c:301:2: error: can't convert a value of type 'int' to vector type '__vector(16) char' which has different size
iscream.c: iscream.c: In function 'tweakey_set':
iscream.c: iscream.c:312:3: error: can't convert a value of type 'int' to vector type '__vector(16) char' which has different size
iscream.c: UPDATE(j);
iscream.c: ^~~~~~
iscream.c: iscream.c:312:3: error: can't convert a value of type 'int' to vector type '__vector(16) char' which has different size
iscream.c: iscream.c:312:3: error: can't convert a value of type 'int' to vector type '__vector(16) char' which has different size
iscream.c: iscream.c:312:3: error: can't convert a value of type 'int' to vector type '__vector(16) char' which has different size

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -mcpu=marvell-pj4 -O2 sse
gcc -funroll-loops -mcpu=marvell-pj4 -O3 sse
gcc -funroll-loops -mcpu=marvell-pj4 -Os sse
gcc -mcpu=marvell-pj4 -O2 sse
gcc -mcpu=marvell-pj4 -O3 sse
gcc -mcpu=marvell-pj4 -Os sse