Implementation notes: amd64, genji346, crypto_aead/pi32cipher128v1

Computer: genji346
Architecture: amd64
CPU ID: AuthenticAMD-00830f10-178bfbff
SUPERCOP version: 20191017
Operation: crypto_aead
Primitive: pi32cipher128v1
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
124040102832 0 0121421 856 992optimized_nonSSEicc_-march=core-avx2_-mtune=core-avx2_-O2_-fomit-frame-pointer2019102520191017
124080102832 0 0121421 856 992optimized_nonSSEicc_-march=broadwell_-mtune=broadwell_-O2_-fomit-frame-pointer2019102520191017
124160102832 0 0121421 856 992optimized_nonSSEicc_-march=haswell_-mtune=haswell_-O2_-fomit-frame-pointer2019102520191017
124300100872 0 0117869 856 992optimized_nonSSEicc_-march=corei7_-mtune=corei7_-O3_-fomit-frame-pointer2019102520191017
124420109248 0 0127437 856 992optimized_nonSSEicc_-march=core-avx2_-mtune=core-avx2_-O3_-fomit-frame-pointer2019102520191017
12446092840 0 0109781 856 992optimized_nonSSEicc_-march=corei7_-mtune=corei7_-O2_-fomit-frame-pointer2019102520191017
124460109248 0 0127437 856 992optimized_nonSSEicc_-march=skylake_-mtune=skylake_-O3_-fomit-frame-pointer2019102520191017
124560109248 0 0127437 856 992optimized_nonSSEicc_-march=broadwell_-mtune=broadwell_-O3_-fomit-frame-pointer2019102520191017
124640109248 0 0127437 856 992optimized_nonSSEicc_-march=haswell_-mtune=haswell_-O3_-fomit-frame-pointer2019102520191017
124700102832 0 0121421 856 992optimized_nonSSEicc_-march=skylake_-mtune=skylake_-O2_-fomit-frame-pointer2019102520191017
15174051976 0 068973 856 928reficc_-march=corei7_-mtune=corei7_-O3_-fomit-frame-pointer2019102520191017
15380055240 0 073429 856 928reficc_-march=haswell_-mtune=haswell_-O3_-fomit-frame-pointer2019102520191017
15384055240 0 073429 856 928reficc_-march=core-avx2_-mtune=core-avx2_-O3_-fomit-frame-pointer2019102520191017
15390055240 0 073429 856 928reficc_-march=broadwell_-mtune=broadwell_-O3_-fomit-frame-pointer2019102520191017
15428055240 0 073429 856 928reficc_-march=skylake_-mtune=skylake_-O3_-fomit-frame-pointer2019102520191017
15510055416 0 072357 856 928reficc_-march=corei7_-mtune=corei7_-O2_-fomit-frame-pointer2019102520191017
15756058608 0 077197 856 928reficc_-march=broadwell_-mtune=broadwell_-O2_-fomit-frame-pointer2019102520191017
15762058608 0 077197 856 928reficc_-march=haswell_-mtune=haswell_-O2_-fomit-frame-pointer2019102520191017
15768058608 0 077197 856 928reficc_-march=skylake_-mtune=skylake_-O2_-fomit-frame-pointer2019102520191017
15786058608 0 077197 856 928reficc_-march=core-avx2_-mtune=core-avx2_-O2_-fomit-frame-pointer2019102520191017
209600106656 0 0124925 856 992optimized_nonSSEicc_-march=sandybridge_-mtune=sandybridge_-O3_-fomit-frame-pointer2019102520191017
209680106656 0 0124925 856 992optimized_nonSSEicc_-march=core-avx-i_-mtune=core-avx-i_-O3_-fomit-frame-pointer2019102520191017
20978099488 0 0117725 856 992optimized_nonSSEicc_-march=corei7-avx_-mtune=corei7-avx_-O2_-fomit-frame-pointer2019102520191017
209840106656 0 0124925 856 992optimized_nonSSEicc_-march=corei7-avx_-mtune=corei7-avx_-O3_-fomit-frame-pointer2019102520191017
209880106656 0 0124925 856 992optimized_nonSSEicc_-march=ivybridge_-mtune=ivybridge_-O3_-fomit-frame-pointer2019102520191017
20998099488 0 0117725 856 992optimized_nonSSEicc_-march=core-avx-i_-mtune=core-avx-i_-O2_-fomit-frame-pointer2019102520191017
21004099488 0 0117725 856 992optimized_nonSSEicc_-march=sandybridge_-mtune=sandybridge_-O2_-fomit-frame-pointer2019102520191017
21012099488 0 0117725 856 992optimized_nonSSEicc_-march=ivybridge_-mtune=ivybridge_-O2_-fomit-frame-pointer2019102520191017
26556054192 0 072461 856 928reficc_-march=core-avx-i_-mtune=core-avx-i_-O3_-fomit-frame-pointer2019102520191017
26564054192 0 072461 856 928reficc_-march=corei7-avx_-mtune=corei7-avx_-O3_-fomit-frame-pointer2019102520191017
26624054192 0 072461 856 928reficc_-march=ivybridge_-mtune=ivybridge_-O3_-fomit-frame-pointer2019102520191017
26690054192 0 072461 856 928reficc_-march=sandybridge_-mtune=sandybridge_-O3_-fomit-frame-pointer2019102520191017
27416057696 0 075933 856 928reficc_-march=corei7-avx_-mtune=corei7-avx_-O2_-fomit-frame-pointer2019102520191017
27416057696 0 075933 856 928reficc_-march=sandybridge_-mtune=sandybridge_-O2_-fomit-frame-pointer2019102520191017
27418057696 0 075933 856 928reficc_-march=ivybridge_-mtune=ivybridge_-O2_-fomit-frame-pointer2019102520191017
27500057696 0 075933 856 928reficc_-march=core-avx-i_-mtune=core-avx-i_-O2_-fomit-frame-pointer2019102520191017

Test failure

Implementation: optimized_nonSSE
Security model: unknown
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
error 111
crypto_aead_decrypt returns nonzero

Number of similar (compiler,implementation) pairs: 18, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE optimized_nonSSE
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE optimized_nonSSE
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE optimized_nonSSE
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE optimized_nonSSE
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE ref

Test failure

Implementation: optimized_nonSSE
Security model: unknown
Compiler: icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer
error 111

Number of similar (compiler,implementation) pairs: 12, namely:
CompilerImplementations
icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer optimized_nonSSE
icc -march=cannonlake -mtune=cannonlake -O3 -fomit-frame-pointer optimized_nonSSE
icc -march=icelake-client -mtune=icelake-client -O2 -fomit-frame-pointer optimized_nonSSE
icc -march=icelake-client -mtune=icelake-client -O3 -fomit-frame-pointer optimized_nonSSE
icc -march=skylake-avx512 -mtune=skylake-avx512 -O2 -fomit-frame-pointer optimized_nonSSE
icc -march=skylake-avx512 -mtune=skylake-avx512 -O3 -fomit-frame-pointer optimized_nonSSE
icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer ref
icc -march=cannonlake -mtune=cannonlake -O3 -fomit-frame-pointer ref
icc -march=icelake-client -mtune=icelake-client -O2 -fomit-frame-pointer ref
icc -march=icelake-client -mtune=icelake-client -O3 -fomit-frame-pointer ref
icc -march=skylake-avx512 -mtune=skylake-avx512 -O2 -fomit-frame-pointer ref
icc -march=skylake-avx512 -mtune=skylake-avx512 -O3 -fomit-frame-pointer ref

Compiler output

Implementation: optimized_nonSSE
Security model: unknown
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
encrypt.c: encrypt.c:354:42: warning: unsequenced modification and access to 'i1' [-Wunsequenced]
encrypt.c: InternalState8[i1] = InternalState8[i1++] ^ ad[b+i];
encrypt.c: ~~ ^
encrypt.c: encrypt.c:468:68: warning: unsequenced modification and access to 'i1' [-Wunsequenced]
encrypt.c: c[CRYPTO_NSECBYTES+b+i] = InternalState8[i1] = InternalState8[i1++] ^ m[b+i];
encrypt.c: ~~ ^
encrypt.c: encrypt.c:622:42: warning: unsequenced modification and access to 'i1' [-Wunsequenced]
encrypt.c: InternalState8[i1] = InternalState8[i1++] ^ ad[b+i];
encrypt.c: ~~ ^
encrypt.c: 3 warnings generated.

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE optimized_nonSSE

Compiler output

Implementation: ref
Security model: unknown
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
encrypt.c: encrypt.c:240:42: warning: unsequenced modification and access to 'i1' [-Wunsequenced]
encrypt.c: InternalState8[i1] = InternalState8[i1++] ^ ad[b+i];
encrypt.c: ~~ ^
encrypt.c: encrypt.c:354:68: warning: unsequenced modification and access to 'i1' [-Wunsequenced]
encrypt.c: c[CRYPTO_NSECBYTES+b+i] = InternalState8[i1] = InternalState8[i1++] ^ m[b+i];
encrypt.c: ~~ ^
encrypt.c: encrypt.c:508:42: warning: unsequenced modification and access to 'i1' [-Wunsequenced]
encrypt.c: InternalState8[i1] = InternalState8[i1++] ^ ad[b+i];
encrypt.c: ~~ ^
encrypt.c: 3 warnings generated.

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref