Implementation notes: x86, thoth, crypto_aead/pi16cipher128v2

Computer: thoth
Architecture: x86
CPU ID: AuthenticAMD-00000622-0183f9ff
SUPERCOP version: 20160806
Operation: crypto_aead
Primitive: pi16cipher128v2
TimeImplementationCompilerBenchmark dateSUPERCOP version
604088ref2gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016072720160724
847692ref3clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016072720160724
875742ref3gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016072720160724
984567ref3gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016072720160724
1023918ref3gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016072720160724
1176510ref3gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016072720160724
1351782ref2clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016072720160724
4131807ref2gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016072720160724
4301658ref2gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016072720160724
5963541ref2gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016072720160724
9821978goptvgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016080320160731
17640231goptvgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016080320160731
24157500goptvgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016080320160731
29047248goptvgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016080320160731

Test failure

Implementation: crypto_aead/pi16cipher128v2/optimized_nonSSE
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
error 111
crypto_aead_encrypt returns more than crypto_aead_ABYTES extra bytes

Number of similar (compiler,implementation) pairs: 10, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments optimized_nonSSE ref
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv optimized_nonSSE ref
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv optimized_nonSSE ref
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv optimized_nonSSE ref
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv optimized_nonSSE ref

Compiler output

Implementation: crypto_aead/pi16cipher128v2/goptv
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
pi-cipher.c: pi-cipher.c:273:15: error: cannot convert between vector values of different size ('qword_t' (vector of 4 'word_t' values) and 'uint8_t' (aka 'unsigned char'))
pi-cipher.c: return (x gt;>gt; ((PI_WORD_SIZE) - n));
pi-cipher.c: ~ ^ ~
pi-cipher.c: pi-cipher.c:273:26: error: cannot convert between vector values of different size ('qword_t' (vector of 4 'word_t' values) and 'int')
pi-cipher.c: return (x gt;>gt; ((PI_WORD_SIZE) - n));
pi-cipher.c: ~ ^ ~~~~~~~~~~~~~~~~~~~~
pi-cipher.c: pi-cipher.c:286:9: error: use of unknown builtin '__builtin_shuffle' [-Wimplicit-function-declaration]
pi-cipher.c: n_t += __builtin_shuffle(y, g_mask);
pi-cipher.c: ^
pi-cipher.c: pi-cipher.c:286:6: error: cannot convert between vector values of different size ('vchunk_t' (vector of 4 'word_t' values) and 'int')
pi-cipher.c: n_t += __builtin_shuffle(y, g_mask);
pi-cipher.c: ~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~
pi-cipher.c: pi-cipher.c:287:6: error: cannot convert between vector values of different size ('vchunk_t' (vector of 4 'word_t' values) and 'int')
pi-cipher.c: n_t += __builtin_shuffle(y, n_mask);
pi-cipher.c: ~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~
pi-cipher.c: pi-cipher.c:289:8: error: cannot convert between vector values of different size ('unsigned int' and 'vchunk_t' (vector of 4 'word_t' values))
pi-cipher.c: n_t = ROTL(n_t, n_rot);
pi-cipher.c: ^~~~~~~~~~~~~~~~
pi-cipher.c: pi-cipher.c:267:64: note: expanded from macro 'ROTL'
pi-cipher.c: #define ROTL(x, n) (((x) gt;>gt; (sizeof(word_t) * 8 - (n))))
pi-cipher.c: ~~~~~~~~~~~~~~~~~~ ^ ~~~
pi-cipher.c: pi-cipher.c:290:6: error: cannot convert between vector values of different size ('vchunk_t' (vector of 4 'word_t' values) and 'int')
pi-cipher.c: n_t ^= __builtin_shuffle(n_t, n_x_1) ^ __builtin_shuffle(n_t, n_x_2);
pi-cipher.c: ~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
pi-cipher.c: pi-cipher.c:305:6: error: cannot convert between vector values of different size ('vchunk_t' (vector of 4 'word_t' values) and 'int')
pi-cipher.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments goptv