Implementation notes: amd64, pmnod076, crypto_aead/pi64cipher128v2

Computer: pmnod076
Architecture: amd64
CPU ID: GenuineIntel-00050657-bfebfbff
SUPERCOP version: 20191017
Operation: crypto_aead
Primitive: pi64cipher128v2
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
2930624366 8 040483 840 896goptvgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102320191017
3020218017 8 030998 832 896goptvgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102320191017
3044020786 8 034182 832 896goptvgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102320191017
547028251 8 020294 816 896goptvgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102320191017
1162969028 8 025115 840 896ref3gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102320191017
1175326982 8 020342 832 896ref3gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102320191017
1177005256 8 018222 832 896ref3gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102320191017
1317164535 8 016470 816 896ref3gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102320191017
18255221220 8 038237 864 864ref3icc_-march=corei7_-mtune=corei7_-O2_-fomit-frame-pointer2019102320191017
18522822884 8 041197 864 864ref3icc_-march=sandybridge_-mtune=sandybridge_-O2_-fomit-frame-pointer2019102320191017
18528222884 8 041197 864 864ref3icc_-march=corei7-avx_-mtune=corei7-avx_-O2_-fomit-frame-pointer2019102320191017
18540022884 8 041197 864 864ref3icc_-march=ivybridge_-mtune=ivybridge_-O2_-fomit-frame-pointer2019102320191017
18655622884 8 041197 864 864ref3icc_-march=core-avx-i_-mtune=core-avx-i_-O2_-fomit-frame-pointer2019102320191017
18765438396 8 057869 864 928ref2icc_-march=icelake-client_-mtune=icelake-client_-O2_-fomit-frame-pointer2019102320191017
21169442284 8 060949 864 864ref2icc_-march=core-avx2_-mtune=core-avx2_-O2_-fomit-frame-pointer2019102320191017
21174842284 8 060949 864 864ref2icc_-march=haswell_-mtune=haswell_-O2_-fomit-frame-pointer2019102320191017
21183042252 8 060917 864 864ref2icc_-march=skylake_-mtune=skylake_-O2_-fomit-frame-pointer2019102320191017
21307042252 8 060917 864 864ref2icc_-march=broadwell_-mtune=broadwell_-O2_-fomit-frame-pointer2019102320191017
22874019276 8 038221 864 928ref3icc_-march=skylake-avx512_-mtune=skylake-avx512_-O2_-fomit-frame-pointer2019102320191017
23240842700 8 061013 864 864ref2icc_-march=corei7-avx_-mtune=corei7-avx_-O2_-fomit-frame-pointer2019102320191017
23253842700 8 061013 864 864ref2icc_-march=core-avx-i_-mtune=core-avx-i_-O2_-fomit-frame-pointer2019102320191017
23284642444 8 059461 864 864ref2icc_-march=corei7_-mtune=corei7_-O2_-fomit-frame-pointer2019102320191017
23291042700 8 061013 864 864ref2icc_-march=ivybridge_-mtune=ivybridge_-O2_-fomit-frame-pointer2019102320191017
23320442700 8 061013 864 864ref2icc_-march=sandybridge_-mtune=sandybridge_-O2_-fomit-frame-pointer2019102320191017
25585419348 8 038013 864 864ref3icc_-march=skylake_-mtune=skylake_-O2_-fomit-frame-pointer2019102320191017
25588019348 8 038013 864 864ref3icc_-march=haswell_-mtune=haswell_-O2_-fomit-frame-pointer2019102320191017
25681619348 8 038013 864 864ref3icc_-march=broadwell_-mtune=broadwell_-O2_-fomit-frame-pointer2019102320191017
25743419348 8 038013 864 864ref3icc_-march=core-avx2_-mtune=core-avx2_-O2_-fomit-frame-pointer2019102320191017
27934037692 8 056637 864 928ref2icc_-march=skylake-avx512_-mtune=skylake-avx512_-O2_-fomit-frame-pointer2019102320191017
3014806279 8 019270 832 896ref2gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102320191017
3020387102 8 020510 832 896ref2gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102320191017
31121421492 8 040965 864 928ref3icc_-march=icelake-client_-mtune=icelake-client_-O2_-fomit-frame-pointer2019102320191017
3126364932 8 016894 816 896ref2gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102320191017
44331610974 8 027067 840 896ref2gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102320191017

Checksum failure

Implementation: crypto_aead/pi64cipher128v2/ref2
Compiler: icc -march=broadwell -mtune=broadwell -O3 -fomit-frame-pointer
f483bb81e48750bf13761c1c923dbc6a911ed7dc8a4a03b4b3853bf72de6f0c9
Number of similar (compiler,implementation) pairs: 22, namely:
CompilerImplementations
icc -march=broadwell -mtune=broadwell -O3 -fomit-frame-pointer ref2 ref3
icc -march=core-avx-i -mtune=core-avx-i -O3 -fomit-frame-pointer ref2 ref3
icc -march=core-avx2 -mtune=core-avx2 -O3 -fomit-frame-pointer ref2 ref3
icc -march=corei7-avx -mtune=corei7-avx -O3 -fomit-frame-pointer ref2 ref3
icc -march=corei7 -mtune=corei7 -O3 -fomit-frame-pointer ref2 ref3
icc -march=haswell -mtune=haswell -O3 -fomit-frame-pointer ref2 ref3
icc -march=icelake-client -mtune=icelake-client -O3 -fomit-frame-pointer ref2 ref3
icc -march=ivybridge -mtune=ivybridge -O3 -fomit-frame-pointer ref2 ref3
icc -march=sandybridge -mtune=sandybridge -O3 -fomit-frame-pointer ref2 ref3
icc -march=skylake-avx512 -mtune=skylake-avx512 -O3 -fomit-frame-pointer ref2 ref3
icc -march=skylake -mtune=skylake -O3 -fomit-frame-pointer ref2 ref3

Test failure

Implementation: crypto_aead/pi64cipher128v2/optimized_nonSSE
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
error 111
crypto_aead_encrypt returns more than crypto_aead_ABYTES extra bytes

Number of similar (compiler,implementation) pairs: 56, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE optimized_nonSSE ref
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE optimized_nonSSE ref
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE optimized_nonSSE ref
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE optimized_nonSSE ref
icc -march=broadwell -mtune=broadwell -O2 -fomit-frame-pointer optimized_nonSSE ref
icc -march=broadwell -mtune=broadwell -O3 -fomit-frame-pointer optimized_nonSSE ref
icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer optimized_nonSSE ref
icc -march=cannonlake -mtune=cannonlake -O3 -fomit-frame-pointer optimized_nonSSE ref
icc -march=core-avx-i -mtune=core-avx-i -O2 -fomit-frame-pointer optimized_nonSSE ref
icc -march=core-avx-i -mtune=core-avx-i -O3 -fomit-frame-pointer optimized_nonSSE ref
icc -march=core-avx2 -mtune=core-avx2 -O2 -fomit-frame-pointer optimized_nonSSE ref
icc -march=core-avx2 -mtune=core-avx2 -O3 -fomit-frame-pointer optimized_nonSSE ref
icc -march=corei7-avx -mtune=corei7-avx -O2 -fomit-frame-pointer optimized_nonSSE ref
icc -march=corei7-avx -mtune=corei7-avx -O3 -fomit-frame-pointer optimized_nonSSE ref
icc -march=corei7 -mtune=corei7 -O2 -fomit-frame-pointer optimized_nonSSE ref
icc -march=corei7 -mtune=corei7 -O3 -fomit-frame-pointer optimized_nonSSE ref
icc -march=haswell -mtune=haswell -O2 -fomit-frame-pointer optimized_nonSSE ref
icc -march=haswell -mtune=haswell -O3 -fomit-frame-pointer optimized_nonSSE ref
icc -march=icelake-client -mtune=icelake-client -O2 -fomit-frame-pointer optimized_nonSSE ref
icc -march=icelake-client -mtune=icelake-client -O3 -fomit-frame-pointer optimized_nonSSE ref
icc -march=ivybridge -mtune=ivybridge -O2 -fomit-frame-pointer optimized_nonSSE ref
icc -march=ivybridge -mtune=ivybridge -O3 -fomit-frame-pointer optimized_nonSSE ref
icc -march=sandybridge -mtune=sandybridge -O2 -fomit-frame-pointer optimized_nonSSE ref
icc -march=sandybridge -mtune=sandybridge -O3 -fomit-frame-pointer optimized_nonSSE ref
icc -march=skylake-avx512 -mtune=skylake-avx512 -O2 -fomit-frame-pointer optimized_nonSSE ref
icc -march=skylake-avx512 -mtune=skylake-avx512 -O3 -fomit-frame-pointer optimized_nonSSE ref
icc -march=skylake -mtune=skylake -O2 -fomit-frame-pointer optimized_nonSSE ref
icc -march=skylake -mtune=skylake -O3 -fomit-frame-pointer optimized_nonSSE ref

Test failure

Implementation: crypto_aead/pi64cipher128v2/ref2
Compiler: icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer
error 111

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer ref2 ref3
icc -march=cannonlake -mtune=cannonlake -O3 -fomit-frame-pointer ref2 ref3

Compiler output

Implementation: crypto_aead/pi64cipher128v2/goptv
Compiler: icc -march=broadwell -mtune=broadwell -O2 -fomit-frame-pointer
pi-cipher.c: pi-cipher.c(286): error: "__builtin_shuffle" is not supported
pi-cipher.c: n_t += __builtin_shuffle(y, g_mask);
pi-cipher.c: ^
pi-cipher.c:
pi-cipher.c: pi-cipher.c(287): error: "__builtin_shuffle" is not supported
pi-cipher.c: n_t += __builtin_shuffle(y, n_mask);
pi-cipher.c: ^
pi-cipher.c:
pi-cipher.c: pi-cipher.c(290): error: "__builtin_shuffle" is not supported
pi-cipher.c: n_t ^= __builtin_shuffle(n_t, n_x_1) ^ __builtin_shuffle(n_t, n_x_2);
pi-cipher.c: ^
pi-cipher.c:
pi-cipher.c: pi-cipher.c(290): error: "__builtin_shuffle" is not supported
pi-cipher.c: n_t ^= __builtin_shuffle(n_t, n_x_1) ^ __builtin_shuffle(n_t, n_x_2);
pi-cipher.c: ^
pi-cipher.c:
pi-cipher.c: pi-cipher.c(305): error: "__builtin_shuffle" is not supported
pi-cipher.c: m_t += __builtin_shuffle(x, g_mask);
pi-cipher.c: ^
pi-cipher.c:
pi-cipher.c: pi-cipher.c(306): error: "__builtin_shuffle" is not supported
pi-cipher.c: m_t += __builtin_shuffle(x, m_mask);
pi-cipher.c: ^
pi-cipher.c:
pi-cipher.c: pi-cipher.c(309): error: "__builtin_shuffle" is not supported
pi-cipher.c: ...

Number of similar (compiler,implementation) pairs: 24, namely:
CompilerImplementations
icc -march=broadwell -mtune=broadwell -O2 -fomit-frame-pointer goptv
icc -march=broadwell -mtune=broadwell -O3 -fomit-frame-pointer goptv
icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer goptv
icc -march=cannonlake -mtune=cannonlake -O3 -fomit-frame-pointer goptv
icc -march=core-avx-i -mtune=core-avx-i -O2 -fomit-frame-pointer goptv
icc -march=core-avx-i -mtune=core-avx-i -O3 -fomit-frame-pointer goptv
icc -march=core-avx2 -mtune=core-avx2 -O2 -fomit-frame-pointer goptv
icc -march=core-avx2 -mtune=core-avx2 -O3 -fomit-frame-pointer goptv
icc -march=corei7-avx -mtune=corei7-avx -O2 -fomit-frame-pointer goptv
icc -march=corei7-avx -mtune=corei7-avx -O3 -fomit-frame-pointer goptv
icc -march=corei7 -mtune=corei7 -O2 -fomit-frame-pointer goptv
icc -march=corei7 -mtune=corei7 -O3 -fomit-frame-pointer goptv
icc -march=haswell -mtune=haswell -O2 -fomit-frame-pointer goptv
icc -march=haswell -mtune=haswell -O3 -fomit-frame-pointer goptv
icc -march=icelake-client -mtune=icelake-client -O2 -fomit-frame-pointer goptv
icc -march=icelake-client -mtune=icelake-client -O3 -fomit-frame-pointer goptv
icc -march=ivybridge -mtune=ivybridge -O2 -fomit-frame-pointer goptv
icc -march=ivybridge -mtune=ivybridge -O3 -fomit-frame-pointer goptv
icc -march=sandybridge -mtune=sandybridge -O2 -fomit-frame-pointer goptv
icc -march=sandybridge -mtune=sandybridge -O3 -fomit-frame-pointer goptv
icc -march=skylake-avx512 -mtune=skylake-avx512 -O2 -fomit-frame-pointer goptv
icc -march=skylake-avx512 -mtune=skylake-avx512 -O3 -fomit-frame-pointer goptv
icc -march=skylake -mtune=skylake -O2 -fomit-frame-pointer goptv
icc -march=skylake -mtune=skylake -O3 -fomit-frame-pointer goptv