Implementation notes: amd64, genji262, crypto_stream/simon64128ctr

Computer: genji262
Architecture: amd64
CPU ID: AuthenticAMD-00800f12-178bfbff
SUPERCOP version: 20191017
Operation: crypto_stream
Primitive: simon64128ctr
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
567690255 0 0103545 800 856sse4gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102420191017
572098288 0 0115453 840 824avx2icc_-march=broadwell_-mtune=broadwell_-O2_-fomit-frame-pointer2019102420191017
572098288 0 0115453 840 824avx2icc_-march=core-avx2_-mtune=core-avx2_-O2_-fomit-frame-pointer2019102420191017
572098296 0 0113813 840 824avx2icc_-march=corei7_-mtune=corei7_-O2_-fomit-frame-pointer2019102420191017
572098288 0 0115453 840 824avx2icc_-march=haswell_-mtune=haswell_-O2_-fomit-frame-pointer2019102420191017
572098288 0 0115453 840 824avx2icc_-march=skylake_-mtune=skylake_-O2_-fomit-frame-pointer2019102420191017
5786101624 0 0118437 840 824avx2icc_-march=core-avx-i_-mtune=core-avx-i_-O2_-fomit-frame-pointer2019102420191017
5786101624 0 0118437 840 824avx2icc_-march=corei7-avx_-mtune=corei7-avx_-O2_-fomit-frame-pointer2019102420191017
5786101624 0 0118437 840 824avx2icc_-march=ivybridge_-mtune=ivybridge_-O2_-fomit-frame-pointer2019102420191017
5786101624 0 0118437 840 824avx2icc_-march=sandybridge_-mtune=sandybridge_-O2_-fomit-frame-pointer2019102420191017
5808122441 0 0135769 800 856avx2gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102420191017
591898672 0 0115437 840 824avx2icc_-march=broadwell_-mtune=broadwell_-O3_-fomit-frame-pointer2019102420191017
591898672 0 0115437 840 824avx2icc_-march=core-avx2_-mtune=core-avx2_-O3_-fomit-frame-pointer2019102420191017
591898696 0 0114269 840 824avx2icc_-march=corei7_-mtune=corei7_-O3_-fomit-frame-pointer2019102420191017
591898672 0 0115437 840 824avx2icc_-march=haswell_-mtune=haswell_-O3_-fomit-frame-pointer2019102420191017
591898672 0 0115437 840 824avx2icc_-march=skylake_-mtune=skylake_-O3_-fomit-frame-pointer2019102420191017
5962101976 0 0118821 840 824avx2icc_-march=ivybridge_-mtune=ivybridge_-O3_-fomit-frame-pointer2019102420191017
5962101976 0 0118821 840 824avx2icc_-march=sandybridge_-mtune=sandybridge_-O3_-fomit-frame-pointer2019102420191017
5984101976 0 0118821 840 824avx2icc_-march=core-avx-i_-mtune=core-avx-i_-O3_-fomit-frame-pointer2019102420191017
5984101976 0 0118821 840 824avx2icc_-march=corei7-avx_-mtune=corei7-avx_-O3_-fomit-frame-pointer2019102420191017
752489742 0 0101428 792 856sse4gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102420191017
816297794 0 0109524 792 856avx2gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102420191017
816292473 0 0102868 776 824sse4gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102420191017
836090442 0 0101908 792 856sse4gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102420191017
906498086 0 0109580 792 856avx2gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019102420191017
996689336 0 0106085 840 824sse4icc_-march=broadwell_-mtune=broadwell_-O3_-fomit-frame-pointer2019102420191017
1003290296 0 0107077 840 824sse4icc_-march=corei7-avx_-mtune=corei7-avx_-O2_-fomit-frame-pointer2019102420191017
1003290296 0 0107077 840 824sse4icc_-march=ivybridge_-mtune=ivybridge_-O2_-fomit-frame-pointer2019102420191017
1003290296 0 0107077 840 824sse4icc_-march=sandybridge_-mtune=sandybridge_-O2_-fomit-frame-pointer2019102420191017
1005490296 0 0107077 840 824sse4icc_-march=core-avx-i_-mtune=core-avx-i_-O2_-fomit-frame-pointer2019102420191017
1007690520 0 0107333 840 824sse4icc_-march=corei7-avx_-mtune=corei7-avx_-O3_-fomit-frame-pointer2019102420191017
1007690520 0 0107333 840 824sse4icc_-march=sandybridge_-mtune=sandybridge_-O3_-fomit-frame-pointer2019102420191017
1009890520 0 0107333 840 824sse4icc_-march=core-avx-i_-mtune=core-avx-i_-O3_-fomit-frame-pointer2019102420191017
1009889336 0 0106085 840 824sse4icc_-march=core-avx2_-mtune=core-avx2_-O3_-fomit-frame-pointer2019102420191017
1009889336 0 0106085 840 824sse4icc_-march=haswell_-mtune=haswell_-O3_-fomit-frame-pointer2019102420191017
1009890520 0 0107333 840 824sse4icc_-march=ivybridge_-mtune=ivybridge_-O3_-fomit-frame-pointer2019102420191017
1009889128 0 0106277 840 824sse4icc_-march=skylake_-mtune=skylake_-O2_-fomit-frame-pointer2019102420191017
1009889336 0 0106085 840 824sse4icc_-march=skylake_-mtune=skylake_-O3_-fomit-frame-pointer2019102420191017
1012089128 0 0106277 840 824sse4icc_-march=broadwell_-mtune=broadwell_-O2_-fomit-frame-pointer2019102420191017
1014289128 0 0106277 840 824sse4icc_-march=core-avx2_-mtune=core-avx2_-O2_-fomit-frame-pointer2019102420191017
10274118424 0 0133909 840 824sse4icc_-march=corei7_-mtune=corei7_-O2_-fomit-frame-pointer2019102420191017
10340118376 0 0133917 840 824sse4icc_-march=corei7_-mtune=corei7_-O3_-fomit-frame-pointer2019102420191017
1073689128 0 0106277 840 824sse4icc_-march=haswell_-mtune=haswell_-O2_-fomit-frame-pointer2019102420191017

Test failure

Implementation: avx2
Security model: unknown
Compiler: icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer
error 111

Number of similar (compiler,implementation) pairs: 12, namely:
CompilerImplementations
icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer avx2
icc -march=cannonlake -mtune=cannonlake -O3 -fomit-frame-pointer avx2
icc -march=icelake-client -mtune=icelake-client -O2 -fomit-frame-pointer avx2
icc -march=icelake-client -mtune=icelake-client -O3 -fomit-frame-pointer avx2
icc -march=skylake-avx512 -mtune=skylake-avx512 -O2 -fomit-frame-pointer avx2
icc -march=skylake-avx512 -mtune=skylake-avx512 -O3 -fomit-frame-pointer avx2
icc -march=cannonlake -mtune=cannonlake -O2 -fomit-frame-pointer sse4
icc -march=cannonlake -mtune=cannonlake -O3 -fomit-frame-pointer sse4
icc -march=icelake-client -mtune=icelake-client -O2 -fomit-frame-pointer sse4
icc -march=icelake-client -mtune=icelake-client -O3 -fomit-frame-pointer sse4
icc -march=skylake-avx512 -mtune=skylake-avx512 -O2 -fomit-frame-pointer sse4
icc -march=skylake-avx512 -mtune=skylake-avx512 -O3 -fomit-frame-pointer sse4

Compiler output

Implementation: avx2
Security model: unknown
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
stream.c: In file included from stream.c:20:
stream.c: In file included from ./Simon64128AVX2.h:17:
stream.c: In file included from ./Intrinsics_AVX2_64block.h:5:
stream.c: In file included from /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/immintrin.h:27:
stream.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/mmintrin.h:55:72: warning: unknown attribute '__artificial__' ignored [-Wunknown-attributes]
stream.c: extern __inline void __attribute__((__gnu_inline__, __always_inline__, __artificial__))
stream.c: ^
stream.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/mmintrin.h:61:72: warning: unknown attribute '__artificial__' ignored [-Wunknown-attributes]
stream.c: extern __inline void __attribute__((__gnu_inline__, __always_inline__, __artificial__))
stream.c: ^
stream.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/mmintrin.h:68:74: warning: unknown attribute '__artificial__' ignored [-Wunknown-attributes]
stream.c: extern __inline __m64 __attribute__((__gnu_inline__, __always_inline__, __artificial__))
stream.c: ^
stream.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/mmintrin.h:74:74: warning: unknown attribute '__artificial__' ignored [-Wunknown-attributes]
stream.c: extern __inline __m64 __attribute__((__gnu_inline__, __always_inline__, __artificial__))
stream.c: ^
stream.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/mmintrin.h:84:74: warning: unknown attribute '__artificial__' ignored [-Wunknown-attributes]
stream.c: extern __inline __m64 __attribute__((__gnu_inline__, __always_inline__, __artificial__))
stream.c: ^
stream.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/mmintrin.h:90:74: warning: unknown attribute '__artificial__' ignored [-Wunknown-attributes]
stream.c: extern __inline __m64 __attribute__((__gnu_inline__, __always_inline__, __artificial__))
stream.c: ^
stream.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/mmintrin.h:97:74: warning: unknown attribute '__artificial__' ignored [-Wunknown-attributes]
stream.c: extern __inline __m64 __attribute__((__gnu_inline__, __always_inline__, __artificial__))
stream.c: ^
stream.c: ...

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx2
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx2
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx2
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx2
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx2

Compiler output

Implementation: avx2
Security model: unknown
Compiler: gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE
try.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/../../../../x86_64-pc-linux-gnu/bin/ld: crypto_stream_simon64128ctr.a(stream.o): in function `Encrypt':
try.c: stream.c:(.text+0x...): undefined reference to `Transpose'
try.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/../../../../x86_64-pc-linux-gnu/bin/ld: stream.c:(.text+0x...): undefined reference to `Transpose'
try.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/../../../../x86_64-pc-linux-gnu/bin/ld: stream.c:(.text+0x...): undefined reference to `Transpose'
try.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/../../../../x86_64-pc-linux-gnu/bin/ld: stream.c:(.text+0x...): undefined reference to `Transpose'
try.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/../../../../x86_64-pc-linux-gnu/bin/ld: crypto_stream_simon64128ctr.a(stream.o): in function `Encrypt_Xor':
try.c: stream.c:(.text+0x...): undefined reference to `Transpose'
try.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/../../../../x86_64-pc-linux-gnu/bin/ld: crypto_stream_simon64128ctr.a(stream.o):stream.c:(.text+0x...): more undefined references to `Transpose' follow
try.c: collect2: error: ld returned 1 exit status

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE avx2

Compiler output

Implementation: sse4
Security model: unknown
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
stream.c: In file included from stream.c:20:
stream.c: In file included from ./Simon64128SSE4.h:17:
stream.c: In file included from ./Intrinsics_SSE4_64block.h:5:
stream.c: In file included from /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/smmintrin.h:32:
stream.c: In file included from /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/tmmintrin.h:31:
stream.c: In file included from /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/pmmintrin.h:31:
stream.c: In file included from /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/emmintrin.h:31:
stream.c: In file included from /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/xmmintrin.h:31:
stream.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/mmintrin.h:55:72: warning: unknown attribute '__artificial__' ignored [-Wunknown-attributes]
stream.c: extern __inline void __attribute__((__gnu_inline__, __always_inline__, __artificial__))
stream.c: ^
stream.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/mmintrin.h:61:72: warning: unknown attribute '__artificial__' ignored [-Wunknown-attributes]
stream.c: extern __inline void __attribute__((__gnu_inline__, __always_inline__, __artificial__))
stream.c: ^
stream.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/mmintrin.h:68:74: warning: unknown attribute '__artificial__' ignored [-Wunknown-attributes]
stream.c: extern __inline __m64 __attribute__((__gnu_inline__, __always_inline__, __artificial__))
stream.c: ^
stream.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/mmintrin.h:74:74: warning: unknown attribute '__artificial__' ignored [-Wunknown-attributes]
stream.c: extern __inline __m64 __attribute__((__gnu_inline__, __always_inline__, __artificial__))
stream.c: ^
stream.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/mmintrin.h:84:74: warning: unknown attribute '__artificial__' ignored [-Wunknown-attributes]
stream.c: extern __inline __m64 __attribute__((__gnu_inline__, __always_inline__, __artificial__))
stream.c: ^
stream.c: /software/compilers/gcc/gcc-9.2.0-full+isl+binutils/lib/gcc/x86_64-pc-linux-gnu/9.2.0/include/mmintrin.h:90:74: warning: unknown attribute '__artificial__' ignored [-Wunknown-attributes]
stream.c: extern __inline __m64 __attribute__((__gnu_inline__, __always_inline__, __artificial__))
stream.c: ...

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse4
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse4
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse4
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse4
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse4