Implementation notes: amd64, genji262, crypto_aead/pi16cipher128v1

Computer: genji262
Architecture: amd64
CPU ID: AuthenticAMD-00800f12-178bfbff
SUPERCOP version: 20191017
Operation: crypto_aead
Primitive: pi16cipher128v1
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
290048111200 0 0129373 840 1024optimized_nonSSEicc_-march=haswell_-mtune=haswell_-O3_-fomit-frame-pointer2019102420191017
290092111200 0 0129373 840 1024optimized_nonSSEicc_-march=broadwell_-mtune=broadwell_-O3_-fomit-frame-pointer2019102420191017
290422111200 0 0129373 840 1024optimized_nonSSEicc_-march=core-avx2_-mtune=core-avx2_-O3_-fomit-frame-pointer2019102420191017
291148109624 0 0126549 840 1024optimized_nonSSEicc_-march=corei7_-mtune=corei7_-O2_-fomit-frame-pointer2019102420191017
291676113440 0 0132013 840 1024optimized_nonSSEicc_-march=skylake_-mtune=skylake_-O2_-fomit-frame-pointer2019102420191017
291698111184 0 0129437 840 1024optimized_nonSSEicc_-march=core-avx-i_-mtune=core-avx-i_-O3_-fomit-frame-pointer2019102420191017
291698113440 0 0132013 840 1024optimized_nonSSEicc_-march=core-avx2_-mtune=core-avx2_-O2_-fomit-frame-pointer2019102420191017
291808113440 0 0132013 840 1024optimized_nonSSEicc_-march=broadwell_-mtune=broadwell_-O2_-fomit-frame-pointer2019102420191017
292314111184 0 0129437 840 1024optimized_nonSSEicc_-march=sandybridge_-mtune=sandybridge_-O3_-fomit-frame-pointer2019102420191017
292468108712 0 0125693 840 1024optimized_nonSSEicc_-march=corei7_-mtune=corei7_-O3_-fomit-frame-pointer2019102420191017
292490111184 0 0129437 840 1024optimized_nonSSEicc_-march=ivybridge_-mtune=ivybridge_-O3_-fomit-frame-pointer2019102420191017
292842113536 0 0131757 840 1024optimized_nonSSEicc_-march=corei7-avx_-mtune=corei7-avx_-O2_-fomit-frame-pointer2019102420191017
293040111184 0 0129437 840 1024optimized_nonSSEicc_-march=corei7-avx_-mtune=corei7-avx_-O3_-fomit-frame-pointer2019102420191017
293084113536 0 0131757 840 1024optimized_nonSSEicc_-march=ivybridge_-mtune=ivybridge_-O2_-fomit-frame-pointer2019102420191017
293304113536 0 0131757 840 1024optimized_nonSSEicc_-march=core-avx-i_-mtune=core-avx-i_-O2_-fomit-frame-pointer2019102420191017
293700113440 0 0132013 840 1024optimized_nonSSEicc_-march=haswell_-mtune=haswell_-O2_-fomit-frame-pointer2019102420191017
293810113536 0 0131757 840 1024optimized_nonSSEicc_-march=sandybridge_-mtune=sandybridge_-O2_-fomit-frame-pointer2019102420191017
293920111200 0 0129373 840 1024optimized_nonSSEicc_-march=skylake_-mtune=skylake_-O3_-fomit-frame-pointer2019102420191017
37589263936 0 082509 840 896reficc_-march=skylake_-mtune=skylake_-O2_-fomit-frame-pointer2019102420191017
37626663936 0 082509 840 896reficc_-march=haswell_-mtune=haswell_-O2_-fomit-frame-pointer2019102420191017
37646463936 0 082509 840 896reficc_-march=core-avx2_-mtune=core-avx2_-O2_-fomit-frame-pointer2019102420191017
37738865008 0 083229 840 896reficc_-march=core-avx-i_-mtune=core-avx-i_-O2_-fomit-frame-pointer2019102420191017
37741065008 0 083229 840 896reficc_-march=sandybridge_-mtune=sandybridge_-O2_-fomit-frame-pointer2019102420191017
37743265008 0 083229 840 896reficc_-march=corei7-avx_-mtune=corei7-avx_-O2_-fomit-frame-pointer2019102420191017
37745465008 0 083229 840 896reficc_-march=ivybridge_-mtune=ivybridge_-O2_-fomit-frame-pointer2019102420191017
38060063208 0 080133 840 896reficc_-march=corei7_-mtune=corei7_-O2_-fomit-frame-pointer2019102420191017
38390062560 0 080813 840 896reficc_-march=ivybridge_-mtune=ivybridge_-O3_-fomit-frame-pointer2019102420191017
38416462560 0 080813 840 896reficc_-march=corei7-avx_-mtune=corei7-avx_-O3_-fomit-frame-pointer2019102420191017
38420861728 0 079901 840 896reficc_-march=broadwell_-mtune=broadwell_-O3_-fomit-frame-pointer2019102420191017
38420861728 0 079901 840 896reficc_-march=core-avx2_-mtune=core-avx2_-O3_-fomit-frame-pointer2019102420191017
38425263936 0 082509 840 896reficc_-march=broadwell_-mtune=broadwell_-O2_-fomit-frame-pointer2019102420191017
38427462560 0 080813 840 896reficc_-march=core-avx-i_-mtune=core-avx-i_-O3_-fomit-frame-pointer2019102420191017
38429662560 0 080813 840 896reficc_-march=sandybridge_-mtune=sandybridge_-O3_-fomit-frame-pointer2019102420191017
38442861728 0 079901 840 896reficc_-march=haswell_-mtune=haswell_-O3_-fomit-frame-pointer2019102420191017
38451661728 0 079901 840 896reficc_-march=skylake_-mtune=skylake_-O3_-fomit-frame-pointer2019102420191017
38781661224 0 078205 840 896reficc_-march=corei7_-mtune=corei7_-O3_-fomit-frame-pointer2019102420191017

Test failure

Implementation: optimized_nonSSE
Security model: unknown
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
error 111
crypto_aead_decrypt returns nonzero

Number of similar (compiler,implementation) pairs: 18, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE optimized_nonSSE
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE optimized_nonSSE
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE optimized_nonSSE
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE optimized_nonSSE
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE ref

Test failure

Implementation: optimized_nonSSE
Security model: unknown
Compiler: icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer
error 111

Number of similar (compiler,implementation) pairs: 12, namely:
CompilerImplementations
icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer optimized_nonSSE
icc -march=cannonlake -mtune=cannonlake -O3 -fomit-frame-pointer optimized_nonSSE
icc -march=icelake-client -mtune=icelake-client -O2 -fomit-frame-pointer optimized_nonSSE
icc -march=icelake-client -mtune=icelake-client -O3 -fomit-frame-pointer optimized_nonSSE
icc -march=skylake-avx512 -mtune=skylake-avx512 -O2 -fomit-frame-pointer optimized_nonSSE
icc -march=skylake-avx512 -mtune=skylake-avx512 -O3 -fomit-frame-pointer optimized_nonSSE
icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer ref
icc -march=cannonlake -mtune=cannonlake -O3 -fomit-frame-pointer ref
icc -march=icelake-client -mtune=icelake-client -O2 -fomit-frame-pointer ref
icc -march=icelake-client -mtune=icelake-client -O3 -fomit-frame-pointer ref
icc -march=skylake-avx512 -mtune=skylake-avx512 -O2 -fomit-frame-pointer ref
icc -march=skylake-avx512 -mtune=skylake-avx512 -O3 -fomit-frame-pointer ref

Compiler output

Implementation: optimized_nonSSE
Security model: unknown
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
encrypt.c: encrypt.c:362:42: warning: unsequenced modification and access to 'i1' [-Wunsequenced]
encrypt.c: InternalState8[i1] = InternalState8[i1++] ^ ad[b+i];
encrypt.c: ~~ ^
encrypt.c: encrypt.c:488:68: warning: unsequenced modification and access to 'i1' [-Wunsequenced]
encrypt.c: c[CRYPTO_NSECBYTES+b+i] = InternalState8[i1] = InternalState8[i1++] ^ m[b+i];
encrypt.c: ~~ ^
encrypt.c: encrypt.c:650:42: warning: unsequenced modification and access to 'i1' [-Wunsequenced]
encrypt.c: InternalState8[i1] = InternalState8[i1++] ^ ad[b+i];
encrypt.c: ~~ ^
encrypt.c: 3 warnings generated.

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE

Compiler output

Implementation: ref
Security model: unknown
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
encrypt.c: encrypt.c:248:42: warning: unsequenced modification and access to 'i1' [-Wunsequenced]
encrypt.c: InternalState8[i1] = InternalState8[i1++] ^ ad[b+i];
encrypt.c: ~~ ^
encrypt.c: encrypt.c:374:68: warning: unsequenced modification and access to 'i1' [-Wunsequenced]
encrypt.c: c[CRYPTO_NSECBYTES+b+i] = InternalState8[i1] = InternalState8[i1++] ^ m[b+i];
encrypt.c: ~~ ^
encrypt.c: encrypt.c:536:42: warning: unsequenced modification and access to 'i1' [-Wunsequenced]
encrypt.c: InternalState8[i1] = InternalState8[i1++] ^ ad[b+i];
encrypt.c: ~~ ^
encrypt.c: 3 warnings generated.

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref