Implementation notes: x86, katana, crypto_stream/simon128192ctr

Computer: katana
Architecture: x86
CPU ID: GenuineIntel-000006f6-bfebfbff
SUPERCOP version: 20170105
Operation: crypto_stream
Primitive: simon128192ctr
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
27464? ? ?? ? ?sse4gcc_-m32_-march=corei7_-O2_-fomit-frame-pointer2016101620161009
27944? ? ?? ? ?sse4gcc_-m32_-march=core2_-Os_-fomit-frame-pointer2016101620161009
27944? ? ?? ? ?sse4gcc_-m32_-march=corei7_-Os_-fomit-frame-pointer2016101620161009
27992? ? ?? ? ?sse4gcc_-m32_-march=core2_-msse4_-Os_-fomit-frame-pointer2016101620161009
27992? ? ?? ? ?sse4gcc_-m32_-march=native_-mtune=native_-Os_-fomit-frame-pointer2016101620161009
28216? ? ?? ? ?sse4gcc_-m32_-march=core2_-msse4.1_-Os_-fomit-frame-pointer2016101620161009
28808? ? ?? ? ?sse4gcc_-m32_-march=core2_-msse4_-O_-fomit-frame-pointer2016101620161009
28848? ? ?? ? ?sse4gcc_-m32_-march=native_-mtune=native_-O_-fomit-frame-pointer2016101620161009
28864? ? ?? ? ?sse4gcc_-m32_-march=core2_-O_-fomit-frame-pointer2016101620161009
28864? ? ?? ? ?sse4gcc_-m32_-march=core2_-msse4.1_-O_-fomit-frame-pointer2016101620161009
28896? ? ?? ? ?sse4gcc_-m32_-march=corei7_-O_-fomit-frame-pointer2016101620161009
29680? ? ?? ? ?sse4gcc_-m32_-march=core2_-O3_-fomit-frame-pointer2016101620161009
29752? ? ?? ? ?sse4gcc_-m32_-march=native_-mtune=native_-O3_-fomit-frame-pointer2016101620161009
30168? ? ?? ? ?sse4gcc_-m32_-march=core2_-O2_-fomit-frame-pointer2016101620161009
30192? ? ?? ? ?sse4gcc_-m32_-march=native_-mtune=native_-O2_-fomit-frame-pointer2016101620161009
30200? ? ?? ? ?sse4gcc_-m32_-march=core2_-msse4_-O2_-fomit-frame-pointer2016101620161009
30208? ? ?? ? ?sse4gcc_-m32_-march=core2_-msse4.1_-O2_-fomit-frame-pointer2016101620161009

Test failure

Implementation: crypto_stream/simon128192ctr/sse4
Compiler: gcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer
error 111

Number of similar (compiler,implementation) pairs: 12, namely:
CompilerImplementations
gcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer sse4
gcc -m32 -march=core-avx-i -O -fomit-frame-pointer sse4
gcc -m32 -march=core-avx-i -Os -fomit-frame-pointer sse4
gcc -m32 -march=core-avx2 -O2 -fomit-frame-pointer avx2 sse4
gcc -m32 -march=core-avx2 -O -fomit-frame-pointer avx2 sse4
gcc -m32 -march=core-avx2 -Os -fomit-frame-pointer avx2 sse4
gcc -m32 -march=corei7-avx -O2 -fomit-frame-pointer sse4
gcc -m32 -march=corei7-avx -O -fomit-frame-pointer sse4
gcc -m32 -march=corei7-avx -Os -fomit-frame-pointer sse4

Compiler output

Implementation: crypto_stream/simon128192ctr/avx2
Compiler: gcc -funroll-loops -m32 -O2 -fomit-frame-pointer
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_avx2':
stream.c: stream.c:42:3: warning: implicit declaration of function 'ExpandKey' [-Wimplicit-function-declaration]
stream.c: ExpandKey(K,rk,key);
stream.c: ^
stream.c: stream.c:55:5: warning: implicit declaration of function 'Encrypt' [-Wimplicit-function-declaration]
stream.c: Encrypt(out,nonce,rk,key,320);
stream.c: ^
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_avx2_xor':
stream.c: stream.c:191:5: warning: implicit declaration of function 'Encrypt_Xor' [-Wimplicit-function-declaration]
stream.c: Encrypt_Xor(out,in,nonce,rk,key,320);
stream.c: ^
stream.c: In file included from Simon128192AVX2.h:17:0,
stream.c: from stream.c:20:
stream.c: stream.c: In function 'Encrypt':
stream.c: Intrinsics_AVX2_128block.h:22:21: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
stream.c: #define SET1(X,c) (X=SET(c,c,c,c))
stream.c: ^
stream.c: stream.c:122:3: note: in expansion of macro 'SET1'
stream.c: SET1(X[0],nonce[1]);
stream.c: ^
stream.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:41:0,
stream.c: from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:46,
stream.c: from Intrinsics_AVX2_128block.h:5,
stream.c: from Simon128192AVX2.h:17,
stream.c: from stream.c:20:
stream.c: ...

Number of similar (compiler,implementation) pairs: 157, namely:
CompilerImplementations
gcc -funroll-loops -m32 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=athlon -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=athlon -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=athlon -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=athlon -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=barcelona -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=barcelona -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=barcelona -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=barcelona -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i386 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i386 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i386 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i386 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i486 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i486 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i486 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i486 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-2 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-2 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-2 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-2 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-3 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-3 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-3 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-3 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k8 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k8 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k8 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k8 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=nocona -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=nocona -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=nocona -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=nocona -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-m -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-m -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-m -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-m -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-mmx -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-mmx -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-mmx -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-mmx -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium2 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium2 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium2 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium2 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium3 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium3 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium3 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium3 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium4 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium4 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium4 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium4 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentiumpro -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentiumpro -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentiumpro -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentiumpro -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=prescott -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=prescott -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=prescott -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=prescott -Os -fomit-frame-pointer avx2
gcc -m32 -O2 -fomit-frame-pointer avx2
gcc -m32 -O3 -fomit-frame-pointer avx2
gcc -m32 -O -fomit-frame-pointer avx2
gcc -m32 -Os -fomit-frame-pointer avx2
gcc -m32 -march=athlon -O2 -fomit-frame-pointer avx2
gcc -m32 -march=athlon -O3 -fomit-frame-pointer avx2
gcc -m32 -march=athlon -O -fomit-frame-pointer avx2
gcc -m32 -march=athlon -Os -fomit-frame-pointer avx2
gcc -m32 -march=core2 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=core2 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=core2 -O -fomit-frame-pointer avx2
gcc -m32 -march=core2 -Os -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4.1 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4.1 -O -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4.1 -Os -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4 -O -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4 -Os -fomit-frame-pointer avx2
gcc -m32 -march=corei7 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=corei7 -O -fomit-frame-pointer avx2
gcc -m32 -march=corei7 -Os -fomit-frame-pointer avx2
gcc -m32 -march=i386 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=i386 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=i386 -O -fomit-frame-pointer avx2
gcc -m32 -march=i386 -Os -fomit-frame-pointer avx2
gcc -m32 -march=i486 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=i486 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=i486 -O -fomit-frame-pointer avx2
gcc -m32 -march=i486 -Os -fomit-frame-pointer avx2
gcc -m32 -march=k6-2 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=k6-2 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=k6-2 -O -fomit-frame-pointer avx2
gcc -m32 -march=k6-2 -Os -fomit-frame-pointer avx2
gcc -m32 -march=k6-3 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=k6-3 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=k6-3 -O -fomit-frame-pointer avx2
gcc -m32 -march=k6-3 -Os -fomit-frame-pointer avx2
gcc -m32 -march=k6 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=k6 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=k6 -O -fomit-frame-pointer avx2
gcc -m32 -march=k6 -Os -fomit-frame-pointer avx2
gcc -m32 -march=k8 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=k8 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=k8 -O -fomit-frame-pointer avx2
gcc -m32 -march=k8 -Os -fomit-frame-pointer avx2
gcc -m32 -march=native -mtune=native -O2 -fomit-frame-pointer avx2
gcc -m32 -march=native -mtune=native -O3 -fomit-frame-pointer avx2
gcc -m32 -march=native -mtune=native -O -fomit-frame-pointer avx2
gcc -m32 -march=native -mtune=native -Os -fomit-frame-pointer avx2
gcc -m32 -march=nocona -O2 -fomit-frame-pointer avx2
gcc -m32 -march=nocona -O3 -fomit-frame-pointer avx2
gcc -m32 -march=nocona -O -fomit-frame-pointer avx2
gcc -m32 -march=nocona -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentium-m -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentium-m -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentium-m -O -fomit-frame-pointer avx2
gcc -m32 -march=pentium-m -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentium-mmx -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentium-mmx -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentium-mmx -O -fomit-frame-pointer avx2
gcc -m32 -march=pentium-mmx -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentium2 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentium2 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentium2 -O -fomit-frame-pointer avx2
gcc -m32 -march=pentium2 -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentium3 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentium3 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentium3 -O -fomit-frame-pointer avx2
gcc -m32 -march=pentium3 -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentium4 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentium4 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentium4 -O -fomit-frame-pointer avx2
gcc -m32 -march=pentium4 -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentium -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentium -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentium -O -fomit-frame-pointer avx2
gcc -m32 -march=pentium -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentiumpro -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentiumpro -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentiumpro -O -fomit-frame-pointer avx2
gcc -m32 -march=pentiumpro -Os -fomit-frame-pointer avx2
gcc -m32 -march=prescott -O2 -fomit-frame-pointer avx2
gcc -m32 -march=prescott -O3 -fomit-frame-pointer avx2
gcc -m32 -march=prescott -O -fomit-frame-pointer avx2
gcc -m32 -march=prescott -Os -fomit-frame-pointer avx2

Compiler output

Implementation: crypto_stream/simon128192ctr/sse4
Compiler: gcc -funroll-loops -m32 -O2 -fomit-frame-pointer
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_sse4':
stream.c: stream.c:42:3: warning: implicit declaration of function 'ExpandKey' [-Wimplicit-function-declaration]
stream.c: ExpandKey(K,rk,key);
stream.c: ^
stream.c: stream.c:55:5: warning: implicit declaration of function 'Encrypt' [-Wimplicit-function-declaration]
stream.c: Encrypt(out,nonce,rk,key,128);
stream.c: ^
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_sse4_xor':
stream.c: stream.c:165:5: warning: implicit declaration of function 'Encrypt_Xor' [-Wimplicit-function-declaration]
stream.c: Encrypt_Xor(out,in,nonce,rk,key,128);
stream.c: ^
stream.c: In file included from Simon128192SSE4.h:17:0,
stream.c: from stream.c:20:
stream.c: stream.c: In function 'Encrypt':
stream.c: Intrinsics_SSE4_128block.h:22:21: warning: SSE vector return without SSE enabled changes the ABI [-Wpsabi]
stream.c: #define SET1(X,c) (X=SET(c,c))
stream.c: ^
stream.c: stream.c:102:3: note: in expansion of macro 'SET1'
stream.c: SET1(X[0],nonce[1]);
stream.c: ^
stream.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/xmmintrin.h:1249:0,
stream.c: from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:31,
stream.c: from Intrinsics_SSE4_128block.h:5,
stream.c: from Simon128192SSE4.h:17,
stream.c: from stream.c:20:
stream.c: ...

Number of similar (compiler,implementation) pairs: 88, namely:
CompilerImplementations
gcc -funroll-loops -m32 -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -O -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=athlon -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=athlon -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=athlon -O -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=athlon -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=i386 -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=i386 -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=i386 -O -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=i386 -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=i486 -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=i486 -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=i486 -O -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=i486 -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=k6-2 -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=k6-2 -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=k6-2 -O -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=k6-2 -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=k6-3 -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=k6-3 -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=k6-3 -O -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=k6-3 -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=k6 -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=k6 -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=k6 -O -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=k6 -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium-mmx -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium-mmx -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium-mmx -O -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium-mmx -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium2 -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium2 -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium2 -O -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium2 -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium -O -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentiumpro -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentiumpro -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentiumpro -O -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentiumpro -Os -fomit-frame-pointer sse4
gcc -m32 -O2 -fomit-frame-pointer sse4
gcc -m32 -O3 -fomit-frame-pointer sse4
gcc -m32 -O -fomit-frame-pointer sse4
gcc -m32 -Os -fomit-frame-pointer sse4
gcc -m32 -march=athlon -O2 -fomit-frame-pointer sse4
gcc -m32 -march=athlon -O3 -fomit-frame-pointer sse4
gcc -m32 -march=athlon -O -fomit-frame-pointer sse4
gcc -m32 -march=athlon -Os -fomit-frame-pointer sse4
gcc -m32 -march=i386 -O2 -fomit-frame-pointer sse4
gcc -m32 -march=i386 -O3 -fomit-frame-pointer sse4
gcc -m32 -march=i386 -O -fomit-frame-pointer sse4
gcc -m32 -march=i386 -Os -fomit-frame-pointer sse4
gcc -m32 -march=i486 -O2 -fomit-frame-pointer sse4
gcc -m32 -march=i486 -O3 -fomit-frame-pointer sse4
gcc -m32 -march=i486 -O -fomit-frame-pointer sse4
gcc -m32 -march=i486 -Os -fomit-frame-pointer sse4
gcc -m32 -march=k6-2 -O2 -fomit-frame-pointer sse4
gcc -m32 -march=k6-2 -O3 -fomit-frame-pointer sse4
gcc -m32 -march=k6-2 -O -fomit-frame-pointer sse4
gcc -m32 -march=k6-2 -Os -fomit-frame-pointer sse4
gcc -m32 -march=k6-3 -O2 -fomit-frame-pointer sse4
gcc -m32 -march=k6-3 -O3 -fomit-frame-pointer sse4
gcc -m32 -march=k6-3 -O -fomit-frame-pointer sse4
gcc -m32 -march=k6-3 -Os -fomit-frame-pointer sse4
gcc -m32 -march=k6 -O2 -fomit-frame-pointer sse4
gcc -m32 -march=k6 -O3 -fomit-frame-pointer sse4
gcc -m32 -march=k6 -O -fomit-frame-pointer sse4
gcc -m32 -march=k6 -Os -fomit-frame-pointer sse4
gcc -m32 -march=pentium-mmx -O2 -fomit-frame-pointer sse4
gcc -m32 -march=pentium-mmx -O3 -fomit-frame-pointer sse4
gcc -m32 -march=pentium-mmx -O -fomit-frame-pointer sse4
gcc -m32 -march=pentium-mmx -Os -fomit-frame-pointer sse4
gcc -m32 -march=pentium2 -O2 -fomit-frame-pointer sse4
gcc -m32 -march=pentium2 -O3 -fomit-frame-pointer sse4
gcc -m32 -march=pentium2 -O -fomit-frame-pointer sse4
gcc -m32 -march=pentium2 -Os -fomit-frame-pointer sse4
gcc -m32 -march=pentium -O2 -fomit-frame-pointer sse4
gcc -m32 -march=pentium -O3 -fomit-frame-pointer sse4
gcc -m32 -march=pentium -O -fomit-frame-pointer sse4
gcc -m32 -march=pentium -Os -fomit-frame-pointer sse4
gcc -m32 -march=pentiumpro -O2 -fomit-frame-pointer sse4
gcc -m32 -march=pentiumpro -O3 -fomit-frame-pointer sse4
gcc -m32 -march=pentiumpro -O -fomit-frame-pointer sse4
gcc -m32 -march=pentiumpro -Os -fomit-frame-pointer sse4

Compiler output

Implementation: crypto_stream/simon128192ctr/sse4
Compiler: gcc -funroll-loops -m32 -march=barcelona -O2 -fomit-frame-pointer
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_sse4':
stream.c: stream.c:42:3: warning: implicit declaration of function 'ExpandKey' [-Wimplicit-function-declaration]
stream.c: ExpandKey(K,rk,key);
stream.c: ^
stream.c: stream.c:55:5: warning: implicit declaration of function 'Encrypt' [-Wimplicit-function-declaration]
stream.c: Encrypt(out,nonce,rk,key,128);
stream.c: ^
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_sse4_xor':
stream.c: stream.c:165:5: warning: implicit declaration of function 'Encrypt_Xor' [-Wimplicit-function-declaration]
stream.c: Encrypt_Xor(out,in,nonce,rk,key,128);
stream.c: ^
stream.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:37:0,
stream.c: from Intrinsics_SSE4_128block.h:5,
stream.c: from Simon128192SSE4.h:17,
stream.c: from stream.c:20:
stream.c: stream.c: In function 'Encrypt':
stream.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
stream.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
stream.c: ^
stream.c: In file included from Simon128192SSE4.h:17:0,
stream.c: from stream.c:20:
stream.c: Intrinsics_SSE4_128block.h:15:13: error: called from here
stream.c: #define XOR _mm_xor_si128
stream.c: ^
stream.c: Simon128192SSE4.h:23:42: note: in expansion of macro 'XOR'
stream.c: ...

Number of similar (compiler,implementation) pairs: 44, namely:
CompilerImplementations
gcc -funroll-loops -m32 -march=barcelona -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=barcelona -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=barcelona -O -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=barcelona -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=k8 -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=k8 -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=k8 -O -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=k8 -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=nocona -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=nocona -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=nocona -O -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=nocona -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium-m -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium-m -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium-m -O -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium-m -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium4 -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium4 -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium4 -O -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium4 -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=prescott -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=prescott -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=prescott -O -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=prescott -Os -fomit-frame-pointer sse4
gcc -m32 -march=k8 -O2 -fomit-frame-pointer sse4
gcc -m32 -march=k8 -O3 -fomit-frame-pointer sse4
gcc -m32 -march=k8 -O -fomit-frame-pointer sse4
gcc -m32 -march=k8 -Os -fomit-frame-pointer sse4
gcc -m32 -march=nocona -O2 -fomit-frame-pointer sse4
gcc -m32 -march=nocona -O3 -fomit-frame-pointer sse4
gcc -m32 -march=nocona -O -fomit-frame-pointer sse4
gcc -m32 -march=nocona -Os -fomit-frame-pointer sse4
gcc -m32 -march=pentium-m -O2 -fomit-frame-pointer sse4
gcc -m32 -march=pentium-m -O3 -fomit-frame-pointer sse4
gcc -m32 -march=pentium-m -O -fomit-frame-pointer sse4
gcc -m32 -march=pentium-m -Os -fomit-frame-pointer sse4
gcc -m32 -march=pentium4 -O2 -fomit-frame-pointer sse4
gcc -m32 -march=pentium4 -O3 -fomit-frame-pointer sse4
gcc -m32 -march=pentium4 -O -fomit-frame-pointer sse4
gcc -m32 -march=pentium4 -Os -fomit-frame-pointer sse4
gcc -m32 -march=prescott -O2 -fomit-frame-pointer sse4
gcc -m32 -march=prescott -O3 -fomit-frame-pointer sse4
gcc -m32 -march=prescott -O -fomit-frame-pointer sse4
gcc -m32 -march=prescott -Os -fomit-frame-pointer sse4

Compiler output

Implementation: crypto_stream/simon128192ctr/sse4
Compiler: gcc -funroll-loops -m32 -march=pentium3 -O2 -fomit-frame-pointer
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_sse4':
stream.c: stream.c:42:3: warning: implicit declaration of function 'ExpandKey' [-Wimplicit-function-declaration]
stream.c: ExpandKey(K,rk,key);
stream.c: ^
stream.c: stream.c:55:5: warning: implicit declaration of function 'Encrypt' [-Wimplicit-function-declaration]
stream.c: Encrypt(out,nonce,rk,key,128);
stream.c: ^
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_sse4_xor':
stream.c: stream.c:165:5: warning: implicit declaration of function 'Encrypt_Xor' [-Wimplicit-function-declaration]
stream.c: Encrypt_Xor(out,in,nonce,rk,key,128);
stream.c: ^
stream.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/xmmintrin.h:1249:0,
stream.c: from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:31,
stream.c: from Intrinsics_SSE4_128block.h:5,
stream.c: from Simon128192SSE4.h:17,
stream.c: from stream.c:20:
stream.c: stream.c: In function 'Encrypt':
stream.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/emmintrin.h:585:1: error: inlining failed in call to always_inline '_mm_set_epi64x': target specific option mismatch
stream.c: _mm_set_epi64x (long long __q1, long long __q0)
stream.c: ^
stream.c: In file included from Simon128192SSE4.h:17:0,
stream.c: from stream.c:20:
stream.c: Intrinsics_SSE4_128block.h:21:13: error: called from here
stream.c: #define SET _mm_set_epi64x
stream.c: ^
stream.c: ...

Number of similar (compiler,implementation) pairs: 8, namely:
CompilerImplementations
gcc -funroll-loops -m32 -march=pentium3 -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium3 -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium3 -O -fomit-frame-pointer sse4
gcc -funroll-loops -m32 -march=pentium3 -Os -fomit-frame-pointer sse4
gcc -m32 -march=pentium3 -O2 -fomit-frame-pointer sse4
gcc -m32 -march=pentium3 -O3 -fomit-frame-pointer sse4
gcc -m32 -march=pentium3 -O -fomit-frame-pointer sse4
gcc -m32 -march=pentium3 -Os -fomit-frame-pointer sse4

Compiler output

Implementation: crypto_stream/simon128192ctr/avx2
Compiler: gcc -m32 -march=barcelona -O2 -fomit-frame-pointer
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_avx2':
stream.c: stream.c:42:3: warning: implicit declaration of function 'ExpandKey' [-Wimplicit-function-declaration]
stream.c: ExpandKey(K,rk,key);
stream.c: ^
stream.c: stream.c:55:5: warning: implicit declaration of function 'Encrypt' [-Wimplicit-function-declaration]
stream.c: Encrypt(out,nonce,rk,key,320);
stream.c: ^
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_avx2_xor':
stream.c: stream.c:191:5: warning: implicit declaration of function 'Encrypt_Xor' [-Wimplicit-function-declaration]
stream.c: Encrypt_Xor(out,in,nonce,rk,key,320);
stream.c: ^
stream.c: In file included from Simon128192AVX2.h:17:0,
stream.c: from stream.c:20:
stream.c: stream.c: In function 'Encrypt':
stream.c: Intrinsics_AVX2_128block.h:22:21: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
stream.c: #define SET1(X,c) (X=SET(c,c,c,c))
stream.c: ^
stream.c: stream.c:122:3: note: in expansion of macro 'SET1'
stream.c: SET1(X[0],nonce[1]);
stream.c: ^
stream.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:41:0,
stream.c: from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:46,
stream.c: from Intrinsics_AVX2_128block.h:5,
stream.c: from Simon128192AVX2.h:17,
stream.c: from stream.c:20:
stream.c: ...
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_avx2':
stream.c: stream.c:42:3: warning: implicit declaration of function 'ExpandKey' [-Wimplicit-function-declaration]
stream.c: ExpandKey(K,rk,key);
stream.c: ^
stream.c: stream.c:55:5: warning: implicit declaration of function 'Encrypt' [-Wimplicit-function-declaration]
stream.c: Encrypt(out,nonce,rk,key,320);
stream.c: ^
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_avx2_xor':
stream.c: stream.c:191:5: warning: implicit declaration of function 'Encrypt_Xor' [-Wimplicit-function-declaration]
stream.c: Encrypt_Xor(out,in,nonce,rk,key,320);
stream.c: ^
stream.c: In file included from Simon128192AVX2.h:17:0,
stream.c: from stream.c:20:
stream.c: stream.c: In function 'Encrypt':
stream.c: Intrinsics_AVX2_128block.h:22:21: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
stream.c: #define SET1(X,c) (X=SET(c,c,c,c))
stream.c: ^
stream.c: stream.c:122:3: note: in expansion of macro 'SET1'
stream.c: SET1(X[0],nonce[1]);
stream.c: ^
stream.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:41:0,
stream.c: from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:46,
stream.c: from Intrinsics_AVX2_128block.h:5,
stream.c: from Simon128192AVX2.h:17,
stream.c: from stream.c:20:
stream.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m32 -march=barcelona -O2 -fomit-frame-pointer avx2
gcc -m32 -march=barcelona -O3 -fomit-frame-pointer avx2
gcc -m32 -march=barcelona -O -fomit-frame-pointer avx2
gcc -m32 -march=barcelona -Os -fomit-frame-pointer avx2

Compiler output

Implementation: crypto_stream/simon128192ctr/sse4
Compiler: gcc -m32 -march=barcelona -O2 -fomit-frame-pointer
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_sse4':
stream.c: stream.c:42:3: warning: implicit declaration of function 'ExpandKey' [-Wimplicit-function-declaration]
stream.c: ExpandKey(K,rk,key);
stream.c: ^
stream.c: stream.c:55:5: warning: implicit declaration of function 'Encrypt' [-Wimplicit-function-declaration]
stream.c: Encrypt(out,nonce,rk,key,128);
stream.c: ^
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_sse4_xor':
stream.c: stream.c:165:5: warning: implicit declaration of function 'Encrypt_Xor' [-Wimplicit-function-declaration]
stream.c: Encrypt_Xor(out,in,nonce,rk,key,128);
stream.c: ^
stream.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:37:0,
stream.c: from Intrinsics_SSE4_128block.h:5,
stream.c: from Simon128192SSE4.h:17,
stream.c: from stream.c:20:
stream.c: stream.c: In function 'Encrypt':
stream.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
stream.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
stream.c: ^
stream.c: In file included from Simon128192SSE4.h:17:0,
stream.c: from stream.c:20:
stream.c: Intrinsics_SSE4_128block.h:15:13: error: called from here
stream.c: #define XOR _mm_xor_si128
stream.c: ^
stream.c: Simon128192SSE4.h:23:42: note: in expansion of macro 'XOR'
stream.c: ...
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_sse4':
stream.c: stream.c:42:3: warning: implicit declaration of function 'ExpandKey' [-Wimplicit-function-declaration]
stream.c: ExpandKey(K,rk,key);
stream.c: ^
stream.c: stream.c:55:5: warning: implicit declaration of function 'Encrypt' [-Wimplicit-function-declaration]
stream.c: Encrypt(out,nonce,rk,key,128);
stream.c: ^
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_sse4_xor':
stream.c: stream.c:165:5: warning: implicit declaration of function 'Encrypt_Xor' [-Wimplicit-function-declaration]
stream.c: Encrypt_Xor(out,in,nonce,rk,key,128);
stream.c: ^
stream.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:37:0,
stream.c: from Intrinsics_SSE4_128block.h:5,
stream.c: from Simon128192SSE4.h:17,
stream.c: from stream.c:20:
stream.c: stream.c: In function 'Encrypt':
stream.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
stream.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
stream.c: ^
stream.c: In file included from Simon128192SSE4.h:17:0,
stream.c: from stream.c:20:
stream.c: Intrinsics_SSE4_128block.h:15:13: error: called from here
stream.c: #define XOR _mm_xor_si128
stream.c: ^
stream.c: Simon128192SSE4.h:23:42: note: in expansion of macro 'XOR'
stream.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m32 -march=barcelona -O2 -fomit-frame-pointer sse4
gcc -m32 -march=barcelona -O3 -fomit-frame-pointer sse4
gcc -m32 -march=barcelona -O -fomit-frame-pointer sse4
gcc -m32 -march=barcelona -Os -fomit-frame-pointer sse4

Compiler output

Implementation: crypto_stream/simon128192ctr/avx2
Compiler: gcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_avx2':
stream.c: stream.c:42:3: warning: implicit declaration of function 'ExpandKey' [-Wimplicit-function-declaration]
stream.c: ExpandKey(K,rk,key);
stream.c: ^
stream.c: stream.c:55:5: warning: implicit declaration of function 'Encrypt' [-Wimplicit-function-declaration]
stream.c: Encrypt(out,nonce,rk,key,320);
stream.c: ^
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_avx2_xor':
stream.c: stream.c:191:5: warning: implicit declaration of function 'Encrypt_Xor' [-Wimplicit-function-declaration]
stream.c: Encrypt_Xor(out,in,nonce,rk,key,320);
stream.c: ^
stream.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:43:0,
stream.c: from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:46,
stream.c: from Intrinsics_AVX2_128block.h:5,
stream.c: from Simon128192AVX2.h:17,
stream.c: from stream.c:20:
stream.c: stream.c: In function 'Encrypt':
stream.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/avx2intrin.h:913:1: error: inlining failed in call to always_inline '_mm256_xor_si256': target specific option mismatch
stream.c: _mm256_xor_si256 (__m256i __A, __m256i __B)
stream.c: ^
stream.c: In file included from Simon128192AVX2.h:17:0,
stream.c: from stream.c:20:
stream.c: Intrinsics_AVX2_128block.h:15:13: error: called from here
stream.c: #define XOR _mm256_xor_si256
stream.c: ^
stream.c: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer avx2
gcc -m32 -march=core-avx-i -O -fomit-frame-pointer avx2
gcc -m32 -march=core-avx-i -Os -fomit-frame-pointer avx2
gcc -m32 -march=corei7-avx -O2 -fomit-frame-pointer avx2
gcc -m32 -march=corei7-avx -O -fomit-frame-pointer avx2
gcc -m32 -march=corei7-avx -Os -fomit-frame-pointer avx2

Compiler output

Implementation: crypto_stream/simon128192ctr/sse4
Compiler: gcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_sse4':
stream.c: stream.c:42:3: warning: implicit declaration of function 'ExpandKey' [-Wimplicit-function-declaration]
stream.c: ExpandKey(K,rk,key);
stream.c: ^
stream.c: stream.c:55:5: warning: implicit declaration of function 'Encrypt' [-Wimplicit-function-declaration]
stream.c: Encrypt(out,nonce,rk,key,128);
stream.c: ^
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_sse4_xor':
stream.c: stream.c:165:5: warning: implicit declaration of function 'Encrypt_Xor' [-Wimplicit-function-declaration]
stream.c: Encrypt_Xor(out,in,nonce,rk,key,128);
stream.c: ^

Number of similar (compiler,implementation) pairs: 26, namely:
CompilerImplementations
gcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer sse4
gcc -m32 -march=core-avx-i -O -fomit-frame-pointer sse4
gcc -m32 -march=core-avx-i -Os -fomit-frame-pointer sse4
gcc -m32 -march=core-avx2 -O2 -fomit-frame-pointer sse4
gcc -m32 -march=core-avx2 -O -fomit-frame-pointer sse4
gcc -m32 -march=core-avx2 -Os -fomit-frame-pointer sse4
gcc -m32 -march=core2 -O2 -fomit-frame-pointer sse4
gcc -m32 -march=core2 -O3 -fomit-frame-pointer sse4
gcc -m32 -march=core2 -O -fomit-frame-pointer sse4
gcc -m32 -march=core2 -Os -fomit-frame-pointer sse4
gcc -m32 -march=core2 -msse4.1 -O2 -fomit-frame-pointer sse4
gcc -m32 -march=core2 -msse4.1 -O -fomit-frame-pointer sse4
gcc -m32 -march=core2 -msse4.1 -Os -fomit-frame-pointer sse4
gcc -m32 -march=core2 -msse4 -O2 -fomit-frame-pointer sse4
gcc -m32 -march=core2 -msse4 -O -fomit-frame-pointer sse4
gcc -m32 -march=core2 -msse4 -Os -fomit-frame-pointer sse4
gcc -m32 -march=corei7-avx -O2 -fomit-frame-pointer sse4
gcc -m32 -march=corei7-avx -O -fomit-frame-pointer sse4
gcc -m32 -march=corei7-avx -Os -fomit-frame-pointer sse4
gcc -m32 -march=corei7 -O2 -fomit-frame-pointer sse4
gcc -m32 -march=corei7 -O -fomit-frame-pointer sse4
gcc -m32 -march=corei7 -Os -fomit-frame-pointer sse4
gcc -m32 -march=native -mtune=native -O2 -fomit-frame-pointer sse4
gcc -m32 -march=native -mtune=native -O3 -fomit-frame-pointer sse4
gcc -m32 -march=native -mtune=native -O -fomit-frame-pointer sse4
gcc -m32 -march=native -mtune=native -Os -fomit-frame-pointer sse4

Compiler output

Implementation: crypto_stream/simon128192ctr/avx2
Compiler: gcc -m32 -march=core-avx2 -O2 -fomit-frame-pointer
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_avx2':
stream.c: stream.c:42:3: warning: implicit declaration of function 'ExpandKey' [-Wimplicit-function-declaration]
stream.c: ExpandKey(K,rk,key);
stream.c: ^
stream.c: stream.c:55:5: warning: implicit declaration of function 'Encrypt' [-Wimplicit-function-declaration]
stream.c: Encrypt(out,nonce,rk,key,320);
stream.c: ^
stream.c: stream.c: In function 'crypto_stream_simon128192ctr_avx2_xor':
stream.c: stream.c:191:5: warning: implicit declaration of function 'Encrypt_Xor' [-Wimplicit-function-declaration]
stream.c: Encrypt_Xor(out,in,nonce,rk,key,320);
stream.c: ^

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
gcc -m32 -march=core-avx2 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=core-avx2 -O -fomit-frame-pointer avx2
gcc -m32 -march=core-avx2 -Os -fomit-frame-pointer avx2