Implementation notes: amd64, genji239, crypto_aead/pi16cipher096v2

Computer: genji239
Architecture: amd64
CPU ID: GenuineIntel-00050654-bfebfbff
SUPERCOP version: 20191017
Operation: crypto_aead
Primitive: pi16cipher096v2
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
34597447219 8 062921 824 896goptvgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102020191017
35192234459 8 047412 816 896goptvgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102020191017
35654832631 8 045524 816 896goptvgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102020191017
38062244100 8 062405 864 864ref3icc_-march=corei7-avx_-mtune=corei7-avx_-O2_-fomit-frame-pointer2019102020191017
38073443580 8 060589 864 864ref3icc_-march=corei7_-mtune=corei7_-O2_-fomit-frame-pointer2019102020191017
38081645140 8 063797 864 864ref3icc_-march=broadwell_-mtune=broadwell_-O2_-fomit-frame-pointer2019102020191017
38085244100 8 062405 864 864ref3icc_-march=ivybridge_-mtune=ivybridge_-O2_-fomit-frame-pointer2019102020191017
38085444100 8 062405 864 864ref3icc_-march=core-avx-i_-mtune=core-avx-i_-O2_-fomit-frame-pointer2019102020191017
38103245140 8 063797 864 864ref3icc_-march=haswell_-mtune=haswell_-O2_-fomit-frame-pointer2019102020191017
38126645140 8 063797 864 864ref3icc_-march=skylake_-mtune=skylake_-O2_-fomit-frame-pointer2019102020191017
38173044100 8 062405 864 864ref3icc_-march=sandybridge_-mtune=sandybridge_-O2_-fomit-frame-pointer2019102020191017
38224045140 8 063797 864 864ref3icc_-march=core-avx2_-mtune=core-avx2_-O2_-fomit-frame-pointer2019102020191017
3999406766 8 022417 824 896ref3gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102020191017
4080825670 8 018620 816 896ref3gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102020191017
4082984662 8 017564 816 896ref3gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102020191017
49024821524 8 038541 864 864ref2icc_-march=corei7_-mtune=corei7_-O2_-fomit-frame-pointer2019102020191017
49093821588 8 039901 864 864ref2icc_-march=sandybridge_-mtune=sandybridge_-O2_-fomit-frame-pointer2019102020191017
49179021588 8 039901 864 864ref2icc_-march=corei7-avx_-mtune=corei7-avx_-O2_-fomit-frame-pointer2019102020191017
49277821588 8 039901 864 864ref2icc_-march=core-avx-i_-mtune=core-avx-i_-O2_-fomit-frame-pointer2019102020191017
49369621588 8 039901 864 864ref2icc_-march=ivybridge_-mtune=ivybridge_-O2_-fomit-frame-pointer2019102020191017
50156622788 8 041749 864 928ref2icc_-march=skylake-avx512_-mtune=skylake-avx512_-O2_-fomit-frame-pointer2019102020191017
5377684160 8 016020 800 896ref3gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102020191017
54939810399 8 022372 800 896goptvgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102020191017
68383423380 8 042061 864 864ref2icc_-march=haswell_-mtune=haswell_-O2_-fomit-frame-pointer2019102020191017
68499223380 8 042061 864 864ref2icc_-march=skylake_-mtune=skylake_-O2_-fomit-frame-pointer2019102020191017
68553623380 8 042061 864 864ref2icc_-march=broadwell_-mtune=broadwell_-O2_-fomit-frame-pointer2019102020191017
69798823380 8 042061 864 864ref2icc_-march=core-avx2_-mtune=core-avx2_-O2_-fomit-frame-pointer2019102020191017
73925051628 8 070565 864 928ref3icc_-march=skylake-avx512_-mtune=skylake-avx512_-O2_-fomit-frame-pointer2019102020191017
82848253804 8 073269 864 928ref3icc_-march=icelake-client_-mtune=icelake-client_-O2_-fomit-frame-pointer2019102020191017
84279224452 8 043941 864 928ref2icc_-march=icelake-client_-mtune=icelake-client_-O2_-fomit-frame-pointer2019102020191017
84895212318 8 027977 824 896ref2gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102020191017
11101485123 8 018044 816 896ref2gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102020191017
11170765846 8 018828 816 896ref2gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102020191017
15400924501 8 016380 800 896ref2gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102020191017

Checksum failure

Implementation: ref2
Security model: unknown
Compiler: icc -march=broadwell -mtune=broadwell -O3 -fomit-frame-pointer
546bbefde2a3ae42c5d7a4f76b3194ed3047fa6c10fdedd32adc665f5f2f264f
Number of similar (compiler,implementation) pairs: 22, namely:
CompilerImplementations
icc -march=broadwell -mtune=broadwell -O3 -fomit-frame-pointer ref2
icc -march=core-avx-i -mtune=core-avx-i -O3 -fomit-frame-pointer ref2
icc -march=core-avx2 -mtune=core-avx2 -O3 -fomit-frame-pointer ref2
icc -march=corei7-avx -mtune=corei7-avx -O3 -fomit-frame-pointer ref2
icc -march=corei7 -mtune=corei7 -O3 -fomit-frame-pointer ref2
icc -march=haswell -mtune=haswell -O3 -fomit-frame-pointer ref2
icc -march=icelake-client -mtune=icelake-client -O3 -fomit-frame-pointer ref2
icc -march=ivybridge -mtune=ivybridge -O3 -fomit-frame-pointer ref2
icc -march=sandybridge -mtune=sandybridge -O3 -fomit-frame-pointer ref2
icc -march=skylake-avx512 -mtune=skylake-avx512 -O3 -fomit-frame-pointer ref2
icc -march=skylake -mtune=skylake -O3 -fomit-frame-pointer ref2
icc -march=broadwell -mtune=broadwell -O3 -fomit-frame-pointer ref3
icc -march=core-avx-i -mtune=core-avx-i -O3 -fomit-frame-pointer ref3
icc -march=core-avx2 -mtune=core-avx2 -O3 -fomit-frame-pointer ref3
icc -march=corei7-avx -mtune=corei7-avx -O3 -fomit-frame-pointer ref3
icc -march=corei7 -mtune=corei7 -O3 -fomit-frame-pointer ref3
icc -march=haswell -mtune=haswell -O3 -fomit-frame-pointer ref3
icc -march=icelake-client -mtune=icelake-client -O3 -fomit-frame-pointer ref3
icc -march=ivybridge -mtune=ivybridge -O3 -fomit-frame-pointer ref3
icc -march=sandybridge -mtune=sandybridge -O3 -fomit-frame-pointer ref3
icc -march=skylake-avx512 -mtune=skylake-avx512 -O3 -fomit-frame-pointer ref3
icc -march=skylake -mtune=skylake -O3 -fomit-frame-pointer ref3

Test failure

Implementation: optimized_nonSSE
Security model: unknown
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
error 111
crypto_aead_encrypt returns more than crypto_aead_ABYTES extra bytes

Number of similar (compiler,implementation) pairs: 56, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE optimized_nonSSE
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE optimized_nonSSE
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE optimized_nonSSE
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE optimized_nonSSE
icc -march=broadwell -mtune=broadwell -O2 -fomit-frame-pointer optimized_nonSSE
icc -march=broadwell -mtune=broadwell -O3 -fomit-frame-pointer optimized_nonSSE
icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer optimized_nonSSE
icc -march=cannonlake -mtune=cannonlake -O3 -fomit-frame-pointer optimized_nonSSE
icc -march=core-avx-i -mtune=core-avx-i -O2 -fomit-frame-pointer optimized_nonSSE
icc -march=core-avx-i -mtune=core-avx-i -O3 -fomit-frame-pointer optimized_nonSSE
icc -march=core-avx2 -mtune=core-avx2 -O2 -fomit-frame-pointer optimized_nonSSE
icc -march=core-avx2 -mtune=core-avx2 -O3 -fomit-frame-pointer optimized_nonSSE
icc -march=corei7-avx -mtune=corei7-avx -O2 -fomit-frame-pointer optimized_nonSSE
icc -march=corei7-avx -mtune=corei7-avx -O3 -fomit-frame-pointer optimized_nonSSE
icc -march=corei7 -mtune=corei7 -O2 -fomit-frame-pointer optimized_nonSSE
icc -march=corei7 -mtune=corei7 -O3 -fomit-frame-pointer optimized_nonSSE
icc -march=haswell -mtune=haswell -O2 -fomit-frame-pointer optimized_nonSSE
icc -march=haswell -mtune=haswell -O3 -fomit-frame-pointer optimized_nonSSE
icc -march=icelake-client -mtune=icelake-client -O2 -fomit-frame-pointer optimized_nonSSE
icc -march=icelake-client -mtune=icelake-client -O3 -fomit-frame-pointer optimized_nonSSE
icc -march=ivybridge -mtune=ivybridge -O2 -fomit-frame-pointer optimized_nonSSE
icc -march=ivybridge -mtune=ivybridge -O3 -fomit-frame-pointer optimized_nonSSE
icc -march=sandybridge -mtune=sandybridge -O2 -fomit-frame-pointer optimized_nonSSE
icc -march=sandybridge -mtune=sandybridge -O3 -fomit-frame-pointer optimized_nonSSE
icc -march=skylake-avx512 -mtune=skylake-avx512 -O2 -fomit-frame-pointer optimized_nonSSE
icc -march=skylake-avx512 -mtune=skylake-avx512 -O3 -fomit-frame-pointer optimized_nonSSE
icc -march=skylake -mtune=skylake -O2 -fomit-frame-pointer optimized_nonSSE
icc -march=skylake -mtune=skylake -O3 -fomit-frame-pointer optimized_nonSSE
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
icc -march=broadwell -mtune=broadwell -O2 -fomit-frame-pointer ref
icc -march=broadwell -mtune=broadwell -O3 -fomit-frame-pointer ref
icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer ref
icc -march=cannonlake -mtune=cannonlake -O3 -fomit-frame-pointer ref
icc -march=core-avx-i -mtune=core-avx-i -O2 -fomit-frame-pointer ref
icc -march=core-avx-i -mtune=core-avx-i -O3 -fomit-frame-pointer ref
icc -march=core-avx2 -mtune=core-avx2 -O2 -fomit-frame-pointer ref
icc -march=core-avx2 -mtune=core-avx2 -O3 -fomit-frame-pointer ref
icc -march=corei7-avx -mtune=corei7-avx -O2 -fomit-frame-pointer ref
icc -march=corei7-avx -mtune=corei7-avx -O3 -fomit-frame-pointer ref
icc -march=corei7 -mtune=corei7 -O2 -fomit-frame-pointer ref
icc -march=corei7 -mtune=corei7 -O3 -fomit-frame-pointer ref
icc -march=haswell -mtune=haswell -O2 -fomit-frame-pointer ref
icc -march=haswell -mtune=haswell -O3 -fomit-frame-pointer ref
icc -march=icelake-client -mtune=icelake-client -O2 -fomit-frame-pointer ref
icc -march=icelake-client -mtune=icelake-client -O3 -fomit-frame-pointer ref
icc -march=ivybridge -mtune=ivybridge -O2 -fomit-frame-pointer ref
icc -march=ivybridge -mtune=ivybridge -O3 -fomit-frame-pointer ref
icc -march=sandybridge -mtune=sandybridge -O2 -fomit-frame-pointer ref
icc -march=sandybridge -mtune=sandybridge -O3 -fomit-frame-pointer ref
icc -march=skylake-avx512 -mtune=skylake-avx512 -O2 -fomit-frame-pointer ref
icc -march=skylake-avx512 -mtune=skylake-avx512 -O3 -fomit-frame-pointer ref
icc -march=skylake -mtune=skylake -O2 -fomit-frame-pointer ref
icc -march=skylake -mtune=skylake -O3 -fomit-frame-pointer ref

Test failure

Implementation: ref2
Security model: unknown
Compiler: icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer
error 111

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer ref2
icc -march=cannonlake -mtune=cannonlake -O3 -fomit-frame-pointer ref2
icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer ref3
icc -march=cannonlake -mtune=cannonlake -O3 -fomit-frame-pointer ref3

Compiler output

Implementation: goptv
Security model: unknown
Compiler: icc -march=broadwell -mtune=broadwell -O2 -fomit-frame-pointer
pi-cipher.c: pi-cipher.c(273): error: vector operation not supported with these operand types
pi-cipher.c: return (x << n) | (x >> ((PI_WORD_SIZE) - n));
pi-cipher.c: ^
pi-cipher.c:
pi-cipher.c: pi-cipher.c(273): error: vector operation not supported with these operand types
pi-cipher.c: return (x << n) | (x >> ((PI_WORD_SIZE) - n));
pi-cipher.c: ^
pi-cipher.c:
pi-cipher.c: pi-cipher.c(286): error: "__builtin_shuffle" is not supported
pi-cipher.c: n_t += __builtin_shuffle(y, g_mask);
pi-cipher.c: ^
pi-cipher.c:
pi-cipher.c: pi-cipher.c(287): error: "__builtin_shuffle" is not supported
pi-cipher.c: n_t += __builtin_shuffle(y, n_mask);
pi-cipher.c: ^
pi-cipher.c:
pi-cipher.c: pi-cipher.c(290): error: "__builtin_shuffle" is not supported
pi-cipher.c: n_t ^= __builtin_shuffle(n_t, n_x_1) ^ __builtin_shuffle(n_t, n_x_2);
pi-cipher.c: ^
pi-cipher.c:
pi-cipher.c: pi-cipher.c(290): error: "__builtin_shuffle" is not supported
pi-cipher.c: n_t ^= __builtin_shuffle(n_t, n_x_1) ^ __builtin_shuffle(n_t, n_x_2);
pi-cipher.c: ^
pi-cipher.c:
pi-cipher.c: pi-cipher.c(305): error: "__builtin_shuffle" is not supported
pi-cipher.c: ...

Number of similar (compiler,implementation) pairs: 24, namely:
CompilerImplementations
icc -march=broadwell -mtune=broadwell -O2 -fomit-frame-pointer goptv
icc -march=broadwell -mtune=broadwell -O3 -fomit-frame-pointer goptv
icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer goptv
icc -march=cannonlake -mtune=cannonlake -O3 -fomit-frame-pointer goptv
icc -march=core-avx-i -mtune=core-avx-i -O2 -fomit-frame-pointer goptv
icc -march=core-avx-i -mtune=core-avx-i -O3 -fomit-frame-pointer goptv
icc -march=core-avx2 -mtune=core-avx2 -O2 -fomit-frame-pointer goptv
icc -march=core-avx2 -mtune=core-avx2 -O3 -fomit-frame-pointer goptv
icc -march=corei7-avx -mtune=corei7-avx -O2 -fomit-frame-pointer goptv
icc -march=corei7-avx -mtune=corei7-avx -O3 -fomit-frame-pointer goptv
icc -march=corei7 -mtune=corei7 -O2 -fomit-frame-pointer goptv
icc -march=corei7 -mtune=corei7 -O3 -fomit-frame-pointer goptv
icc -march=haswell -mtune=haswell -O2 -fomit-frame-pointer goptv
icc -march=haswell -mtune=haswell -O3 -fomit-frame-pointer goptv
icc -march=icelake-client -mtune=icelake-client -O2 -fomit-frame-pointer goptv
icc -march=icelake-client -mtune=icelake-client -O3 -fomit-frame-pointer goptv
icc -march=ivybridge -mtune=ivybridge -O2 -fomit-frame-pointer goptv
icc -march=ivybridge -mtune=ivybridge -O3 -fomit-frame-pointer goptv
icc -march=sandybridge -mtune=sandybridge -O2 -fomit-frame-pointer goptv
icc -march=sandybridge -mtune=sandybridge -O3 -fomit-frame-pointer goptv
icc -march=skylake-avx512 -mtune=skylake-avx512 -O2 -fomit-frame-pointer goptv
icc -march=skylake-avx512 -mtune=skylake-avx512 -O3 -fomit-frame-pointer goptv
icc -march=skylake -mtune=skylake -O2 -fomit-frame-pointer goptv
icc -march=skylake -mtune=skylake -O3 -fomit-frame-pointer goptv