Implementation notes: x86, gpu, crypto_sign/sphincss256shake256robust

Computer: gpu
Architecture: x86
CPU ID: GenuineIntel-000206d7-bfebfbff
SUPERCOP version: 20190816
Operation: crypto_sign
Primitive: sphincss256shake256robust
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
5146532769031038 0 055356 420 1560refgcc_-funroll-loops_-m32_-march=i486_-O_-fomit-frame-pointer2019100720190816
5795199676257988 0 082260 420 1560refgcc_-m32_-march=core2_-O3_-fomit-frame-pointer2019100520190816
5851346228454669 0 079260 420 1560refgcc_-m32_-march=core-avx-i_-O3_-fomit-frame-pointer2019100620190816
5903177820674537 0 0102244 416 1560refgcc_-funroll-loops_-m32_-march=pentium-m_-O3_-fomit-frame-pointer2019100620190816
5915651992556516 0 080820 420 1560refgcc_-m32_-march=corei7_-O3_-fomit-frame-pointer2019100520190816
5999409878761789 0 086904 416 1560refgcc_-m32_-march=pentium-m_-O3_-fomit-frame-pointer2019100620190816
6008376727856452 0 080692 420 1560refgcc_-m32_-march=core2_-msse4.1_-O3_-fomit-frame-pointer2019100520190816
6034251169863037 0 088404 416 1560refgcc_-funroll-loops_-m32_-march=prescott_-O3_-fomit-frame-pointer2019100620190816
6077256123056452 0 080756 420 1560refgcc_-m32_-march=core2_-msse4_-O3_-fomit-frame-pointer2019100520190816
6257906081129226 0 054136 416 1560refgcc_-funroll-loops_-m32_-march=prescott_-O2_-fomit-frame-pointer2019100620190816
6318295680054669 0 079260 420 1560refgcc_-m32_-march=native_-mtune=native_-O3_-fomit-frame-pointer2019100520190816
6322938480054669 0 079260 420 1560refgcc_-m32_-march=corei7-avx_-O3_-fomit-frame-pointer2019100520190816
6633042035463840 0 089052 416 1560refgcc_-funroll-loops_-m32_-march=pentium4_-O3_-fomit-frame-pointer2019100620190816
6676829402163037 0 088404 416 1560refgcc_-funroll-loops_-m32_-march=nocona_-O3_-fomit-frame-pointer2019100620190816
6713548777556344 0 080164 416 1560refgcc_-m32_-march=prescott_-O3_-fomit-frame-pointer2019100620190816
6716294299856344 0 080164 416 1560refgcc_-m32_-march=nocona_-O3_-fomit-frame-pointer2019100620190816
6853128061834041 0 060596 416 1560refgcc_-funroll-loops_-m32_-march=pentium-m_-O2_-fomit-frame-pointer2019100620190816
6896712663329226 0 054136 416 1560refgcc_-funroll-loops_-m32_-march=nocona_-O2_-fomit-frame-pointer2019100620190816
6956113114229358 0 054744 416 1560refgcc_-funroll-loops_-m32_-march=prescott_-O_-fomit-frame-pointer2019100620190816
7048354737957866 0 081592 416 1560refgcc_-m32_-march=pentium4_-O3_-fomit-frame-pointer2019100620190816
7055142749429358 0 054744 416 1560refgcc_-funroll-loops_-m32_-march=nocona_-O_-fomit-frame-pointer2019100620190816
7233959353228354 0 052824 420 1560refgcc_-funroll-loops_-m32_-march=barcelona_-O_-fomit-frame-pointer2019100620190816
7286659641928833 0 054176 416 1560refgcc_-funroll-loops_-m32_-march=pentium4_-O_-fomit-frame-pointer2019100620190816
7431073868428942 0 053944 416 1560refgcc_-funroll-loops_-m32_-march=k8_-O_-fomit-frame-pointer2019100620190816
7453419754820006 0 040256 412 1560refgcc_-m32_-march=core2_-msse4_-Os_-fomit-frame-pointer2019100520190816
7507121390131685 0 057648 416 1560refgcc_-funroll-loops_-m32_-march=pentium-m_-O_-fomit-frame-pointer2019100620190816
7542913888228490 0 053272 416 1560refgcc_-funroll-loops_-m32_-march=pentium4_-O2_-fomit-frame-pointer2019100620190816
7624489059620534 0 040772 412 1560refgcc_-funroll-loops_-m32_-march=nocona_-Os_-fomit-frame-pointer2019100620190816
7757481904822027 0 043884 420 1560refgcc_-m32_-march=core-avx-i_-O2_-fomit-frame-pointer2019100620190816
7761414116720069 0 040320 412 1560refgcc_-m32_-march=core-avx-i_-Os_-fomit-frame-pointer2019100620190816
7978577500822027 0 043884 420 1560refgcc_-m32_-march=native_-mtune=native_-O2_-fomit-frame-pointer2019100520190816
8101473603022027 0 043884 420 1560refgcc_-m32_-march=corei7-avx_-O2_-fomit-frame-pointer2019100620190816
8115029685020069 0 040320 412 1560refgcc_-m32_-march=native_-mtune=native_-Os_-fomit-frame-pointer2019100520190816
8130781337422267 0 044140 420 1560refgcc_-m32_-march=core2_-msse4_-O2_-fomit-frame-pointer2019100520190816
8146213185920069 0 040320 412 1560refgcc_-m32_-march=corei7-avx_-Os_-fomit-frame-pointer2019100620190816
8169711870620006 0 040256 412 1560refgcc_-m32_-march=core2_-msse4.1_-Os_-fomit-frame-pointer2019100520190816
8184351288322267 0 044140 420 1560refgcc_-m32_-march=corei7_-O2_-fomit-frame-pointer2019100520190816
8206193885420535 0 040832 412 1560refgcc_-m32_-march=core2_-Os_-fomit-frame-pointer2019100520190816
8210617436131019 0 056244 420 1560refgcc_-funroll-loops_-m32_-O2_-fomit-frame-pointer2019100620190816
8350348564531392 0 055748 424 1560refgcc_-funroll-loops_-m32_-march=k6_-O_-fomit-frame-pointer2019100620190816
8350353749723462 0 045664 416 1560refgcc_-m32_-march=pentium-m_-O2_-fomit-frame-pointer2019100620190816
8351961883820006 0 040256 412 1560refgcc_-m32_-march=corei7_-Os_-fomit-frame-pointer2019100520190816
8382124187722267 0 044140 420 1560refgcc_-m32_-march=core2_-msse4.1_-O2_-fomit-frame-pointer2019100520190816
8435917061420534 0 040832 412 1560refgcc_-m32_-march=nocona_-Os_-fomit-frame-pointer2019100620190816
8445732097220534 0 040772 412 1560refgcc_-funroll-loops_-m32_-march=pentium4_-Os_-fomit-frame-pointer2019100620190816
8463001111520534 0 040832 412 1560refgcc_-m32_-march=pentium4_-Os_-fomit-frame-pointer2019100620190816
8566379380520533 0 040768 412 1560refgcc_-m32_-march=pentium-m_-Os_-fomit-frame-pointer2019100620190816
8581106792720534 0 040832 412 1560refgcc_-m32_-march=prescott_-Os_-fomit-frame-pointer2019100620190816
8611025724021674 0 043208 420 1560refgcc_-m32_-march=core-avx-i_-O_-fomit-frame-pointer2019100620190816
8629565199921674 0 043208 420 1560refgcc_-m32_-march=native_-mtune=native_-O_-fomit-frame-pointer2019100520190816
8658226595421228 0 042508 416 1560refgcc_-m32_-march=nocona_-O2_-fomit-frame-pointer2019100620190816
8668758213920542 0 040768 412 1560refgcc_-funroll-loops_-m32_-march=pentium-m_-Os_-fomit-frame-pointer2019100620190816
8696003015743060 0 042196 420 1560refgcc_-m32_-march=barcelona_-O_-fomit-frame-pointer2019100520190816
8706595255821228 0 042508 416 1560refgcc_-m32_-march=prescott_-O2_-fomit-frame-pointer2019100620190816
8719254623122811 0 044716 420 1560refgcc_-m32_-march=core2_-O2_-fomit-frame-pointer2019100520190816
8798762587221674 0 043208 420 1560refgcc_-m32_-march=corei7-avx_-O_-fomit-frame-pointer2019100620190816
8807983266632056 0 056344 420 1560refgcc_-funroll-loops_-m32_-march=k6_-O2_-fomit-frame-pointer2019100620190816
8855477882121030 0 042464 416 1560refgcc_-m32_-march=pentium4_-O_-fomit-frame-pointer2019100620190816
8863439856621674 0 043208 420 1560refgcc_-m32_-march=corei7_-O_-fomit-frame-pointer2019100520190816
8870236847432744 0 056988 420 1560refgcc_-m32_-O3_-fomit-frame-pointer2019100520190816
8881162572034892 0 058780 420 1560refgcc_-m32_-march=k6_-O3_-fomit-frame-pointer2019100620190816
8893495218043803 0 069640 420 1560refgcc_-funroll-loops_-m32_-march=k6-3_-O3_-fomit-frame-pointer2019100620190816
8908727722543803 0 069640 420 1560refgcc_-funroll-loops_-m32_-march=k6-2_-O3_-fomit-frame-pointer2019100620190816
8919677724338026 0 064060 416 1560refgcc_-funroll-loops_-m32_-march=athlon_-O3_-fomit-frame-pointer2019100620190816
8967539228743139 0 068368 420 1560refgcc_-funroll-loops_-m32_-march=k6_-O3_-fomit-frame-pointer2019100620190816
8981982460231789 0 055600 416 1560refgcc_-m32_-march=athlon_-O3_-fomit-frame-pointer2019100620190816
8983977358538883 0 065072 420 1560refgcc_-funroll-loops_-m32_-O3_-fomit-frame-pointer2019100620190816
8999605916736168 0 060280 420 1560refgcc_-m32_-march=k6-3_-O3_-fomit-frame-pointer2019100620190816
9006099400821676 0 043208 420 1560refgcc_-m32_-march=core2_-msse4_-O_-fomit-frame-pointer2019100520190816
9087442838736168 0 060280 420 1560refgcc_-m32_-march=k6-2_-O3_-fomit-frame-pointer2019100620190816
9092017410045645 0 073356 416 1560refgcc_-funroll-loops_-m32_-march=pentiumpro_-O3_-fomit-frame-pointer2019100620190816
9127539002130264 0 054912 416 1560refgcc_-funroll-loops_-m32_-march=athlon_-O2_-fomit-frame-pointer2019100620190816
9129180101120449 0 041760 416 1560refgcc_-m32_-march=pentium4_-O2_-fomit-frame-pointer2019100620190816
9129637548645645 0 073356 416 1560refgcc_-funroll-loops_-m32_-march=pentium2_-O3_-fomit-frame-pointer2019100620190816
9149338548032056 0 056344 420 1560refgcc_-funroll-loops_-m32_-march=k6-2_-O2_-fomit-frame-pointer2019100620190816
9158666510433251 0 057940 416 1560refgcc_-m32_-march=pentiumpro_-O3_-fomit-frame-pointer2019100620190816
9201405303321917 0 043368 416 1560refgcc_-m32_-march=pentium-m_-O_-fomit-frame-pointer2019100620190816
9209780191246394 0 074316 416 1560refgcc_-funroll-loops_-m32_-march=pentium3_-O3_-fomit-frame-pointer2019100620190816
9225019170033348 0 059272 416 1560refgcc_-funroll-loops_-m32_-march=pentiumpro_-O_-fomit-frame-pointer2019100620190816
9228544332631392 0 055748 424 1560refgcc_-funroll-loops_-m32_-march=k6-3_-O_-fomit-frame-pointer2019100620190816
9242022941432056 0 056344 420 1560refgcc_-funroll-loops_-m32_-march=k6-3_-O2_-fomit-frame-pointer2019100620190816
9245766871230169 0 054856 416 1560refgcc_-funroll-loops_-m32_-march=athlon_-O_-fomit-frame-pointer2019100620190816
9264318258020760 0 041844 416 1560refgcc_-m32_-march=k8_-O_-fomit-frame-pointer2019100620190816
9268352952333348 0 059272 416 1560refgcc_-funroll-loops_-m32_-march=pentium2_-O_-fomit-frame-pointer2019100620190816
9268364089833348 0 059272 416 1560refgcc_-funroll-loops_-m32_-march=pentium3_-O_-fomit-frame-pointer2019100620190816
9270814360842370 0 067504 416 1560refgcc_-funroll-loops_-m32_-march=i486_-O3_-fomit-frame-pointer2019100720190816
9305425256731392 0 055748 424 1560refgcc_-funroll-loops_-m32_-march=k6-2_-O_-fomit-frame-pointer2019100620190816
9308416728920534 0 040772 412 1560refgcc_-funroll-loops_-m32_-march=prescott_-Os_-fomit-frame-pointer2019100620190816
9309133021533251 0 057940 416 1560refgcc_-m32_-march=pentium2_-O3_-fomit-frame-pointer2019100620190816
9335531559321676 0 043208 420 1560refgcc_-m32_-march=core2_-msse4.1_-O_-fomit-frame-pointer2019100520190816
9353506027542348 0 067908 416 1560refgcc_-funroll-loops_-m32_-march=pentium_-O3_-fomit-frame-pointer2019100620190816
9395404446339863 0 064284 420 1560refgcc_-funroll-loops_-m32_-march=i386_-O3_-fomit-frame-pointer2019100720190816
9400173130221676 0 043208 420 1560refgcc_-m32_-march=core2_-O_-fomit-frame-pointer2019100520190816
9405477926430365 0 055512 420 1560refgcc_-funroll-loops_-m32_-O_-fomit-frame-pointer2019100620190816
9430818660921057 0 042456 416 1560refgcc_-m32_-march=nocona_-O_-fomit-frame-pointer2019100620190816
9443443610121057 0 042456 416 1560refgcc_-m32_-march=prescott_-O_-fomit-frame-pointer2019100620190816
9462651961534060 0 059004 416 1560refgcc_-m32_-march=pentium3_-O3_-fomit-frame-pointer2019100620190816
9484179765632088 0 056752 416 1560refgcc_-funroll-loops_-m32_-march=i486_-O2_-fomit-frame-pointer2019100720190816
9501535289433553 0 059360 416 1560refgcc_-funroll-loops_-m32_-march=pentium3_-O2_-fomit-frame-pointer2019100620190816
9516770469333553 0 059360 416 1560refgcc_-funroll-loops_-m32_-march=pentium2_-O2_-fomit-frame-pointer2019100620190816
9523592704835052 0 059076 416 1560refgcc_-m32_-march=pentium-mmx_-O3_-fomit-frame-pointer2019100620190816
9525166791331339 0 054304 420 1560refgcc_-m32_-march=i386_-O3_-fomit-frame-pointer2019100620190816
9546947464542348 0 067908 416 1560refgcc_-funroll-loops_-m32_-march=pentium-mmx_-O3_-fomit-frame-pointer2019100620190816
9548234813430569 0 055076 420 1560refgcc_-funroll-loops_-m32_-march=i386_-O2_-fomit-frame-pointer2019100720190816
9571060831230979 0 055268 424 1560refgcc_-funroll-loops_-m32_-march=pentium-mmx_-O_-fomit-frame-pointer2019100620190816
9604262603433493 0 057140 416 1560refgcc_-m32_-march=i486_-O3_-fomit-frame-pointer2019100620190816
9611215418133553 0 059360 416 1560refgcc_-funroll-loops_-m32_-march=pentiumpro_-O2_-fomit-frame-pointer2019100620190816
9617909219432523 0 057112 416 1560refgcc_-funroll-loops_-m32_-march=pentium_-O2_-fomit-frame-pointer2019100720190816
9661497744632523 0 057112 416 1560refgcc_-funroll-loops_-m32_-march=pentium-mmx_-O2_-fomit-frame-pointer2019100620190816

Test failure

Implementation: avx2
Security model: unknown
Compiler: gcc -m32 -march=core-avx2 -O2 -fomit-frame-pointer
error 111

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -m32 -march=core-avx2 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=core-avx2 -O -fomit-frame-pointer avx2
gcc -m32 -march=core-avx2 -Os -fomit-frame-pointer avx2
gcc -m32 -march=core-avx2 -O2 -fomit-frame-pointer ref
gcc -m32 -march=core-avx2 -O -fomit-frame-pointer ref
gcc -m32 -march=core-avx2 -Os -fomit-frame-pointer ref

Test failure

Implementation: ref
Security model: unknown
Compiler: gcc -funroll-loops -m32 -Os -fomit-frame-pointer
error 142
Alarm clock

Number of similar (compiler,implementation) pairs: 60, namely:
CompilerImplementations
gcc -funroll-loops -m32 -Os -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=athlon -Os -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=barcelona -O2 -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=barcelona -O3 -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=barcelona -Os -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=i386 -O -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=i386 -Os -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=i486 -Os -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=k6-2 -Os -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=k6-3 -Os -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=k6 -Os -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=k8 -O2 -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=k8 -O3 -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=k8 -Os -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=pentium-mmx -Os -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=pentium2 -Os -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=pentium3 -Os -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=pentium -O -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=pentium -Os -fomit-frame-pointer ref
gcc -funroll-loops -m32 -march=pentiumpro -Os -fomit-frame-pointer ref
gcc -m32 -O2 -fomit-frame-pointer ref
gcc -m32 -O -fomit-frame-pointer ref
gcc -m32 -Os -fomit-frame-pointer ref
gcc -m32 -march=athlon -O2 -fomit-frame-pointer ref
gcc -m32 -march=athlon -O -fomit-frame-pointer ref
gcc -m32 -march=athlon -Os -fomit-frame-pointer ref
gcc -m32 -march=i386 -O2 -fomit-frame-pointer ref
gcc -m32 -march=i386 -O -fomit-frame-pointer ref
gcc -m32 -march=i386 -Os -fomit-frame-pointer ref
gcc -m32 -march=i486 -O2 -fomit-frame-pointer ref
gcc -m32 -march=i486 -O -fomit-frame-pointer ref
gcc -m32 -march=i486 -Os -fomit-frame-pointer ref
gcc -m32 -march=k6-2 -O2 -fomit-frame-pointer ref
gcc -m32 -march=k6-2 -O -fomit-frame-pointer ref
gcc -m32 -march=k6-2 -Os -fomit-frame-pointer ref
gcc -m32 -march=k6-3 -O2 -fomit-frame-pointer ref
gcc -m32 -march=k6-3 -O -fomit-frame-pointer ref
gcc -m32 -march=k6-3 -Os -fomit-frame-pointer ref
gcc -m32 -march=k6 -O2 -fomit-frame-pointer ref
gcc -m32 -march=k6 -O -fomit-frame-pointer ref
gcc -m32 -march=k6 -Os -fomit-frame-pointer ref
gcc -m32 -march=k8 -O2 -fomit-frame-pointer ref
gcc -m32 -march=k8 -O3 -fomit-frame-pointer ref
gcc -m32 -march=k8 -Os -fomit-frame-pointer ref
gcc -m32 -march=pentium-mmx -O2 -fomit-frame-pointer ref
gcc -m32 -march=pentium-mmx -O -fomit-frame-pointer ref
gcc -m32 -march=pentium-mmx -Os -fomit-frame-pointer ref
gcc -m32 -march=pentium2 -O2 -fomit-frame-pointer ref
gcc -m32 -march=pentium2 -O -fomit-frame-pointer ref
gcc -m32 -march=pentium2 -Os -fomit-frame-pointer ref
gcc -m32 -march=pentium3 -O2 -fomit-frame-pointer ref
gcc -m32 -march=pentium3 -O -fomit-frame-pointer ref
gcc -m32 -march=pentium3 -Os -fomit-frame-pointer ref
gcc -m32 -march=pentium -O2 -fomit-frame-pointer ref
gcc -m32 -march=pentium -O3 -fomit-frame-pointer ref
gcc -m32 -march=pentium -O -fomit-frame-pointer ref
gcc -m32 -march=pentium -Os -fomit-frame-pointer ref
gcc -m32 -march=pentiumpro -O2 -fomit-frame-pointer ref
gcc -m32 -march=pentiumpro -O -fomit-frame-pointer ref
gcc -m32 -march=pentiumpro -Os -fomit-frame-pointer ref

Test failure

Implementation: ref
Security model: unknown
Compiler: gcc -m32 -march=barcelona -O2 -fomit-frame-pointer
error 142
Alarm clock
error 142
Alarm clock

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
gcc -m32 -march=barcelona -O2 -fomit-frame-pointer ref
gcc -m32 -march=barcelona -O3 -fomit-frame-pointer ref
gcc -m32 -march=barcelona -Os -fomit-frame-pointer ref

Compiler output

Implementation: avx2
Security model: unknown
Compiler: gcc -funroll-loops -m32 -O2 -fomit-frame-pointer
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c: In function 'KeccakP1600times4_AddLanesAll':
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:135:40: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
KeccakP-1600-times4-SIMD256.c: #define Xor_In4( argIndex ) lanes0 = LOAD256u( curData0[argIndex]),\
KeccakP-1600-times4-SIMD256.c: ^
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: note: in expansion of macro 'Xor_In4'
KeccakP-1600-times4-SIMD256.c: Xor_In4( 0 );
KeccakP-1600-times4-SIMD256.c: ^~~~~~~
KeccakP-1600-times4-SIMD256.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
KeccakP-1600-times4-SIMD256.c: from KeccakP-1600-times4-SIMD256.c:21:
KeccakP-1600-times4-SIMD256.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:913:1: error: inlining failed in call to always_inline '_mm256_xor_si256': target specific option mismatch
KeccakP-1600-times4-SIMD256.c: _mm256_xor_si256 (__m256i __A, __m256i __B)
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:55:41: note: called from here
KeccakP-1600-times4-SIMD256.c: #define XOReq256(a, b) a = _mm256_xor_si256(a, b)
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:143:33: note: in expansion of macro 'XOReq256'
KeccakP-1600-times4-SIMD256.c: XOReq256( stateAsLanes[argIndex+3], lanes3 )
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:149:9: note: in expansion of macro 'Xor_In4'
KeccakP-1600-times4-SIMD256.c: Xor_In4( 12 );
KeccakP-1600-times4-SIMD256.c: ^~~~~~~
KeccakP-1600-times4-SIMD256.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
KeccakP-1600-times4-SIMD256.c: from KeccakP-1600-times4-SIMD256.c:21:
KeccakP-1600-times4-SIMD256.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:913:1: error: inlining failed in call to always_inline '_mm256_xor_si256': target specific option mismatch
KeccakP-1600-times4-SIMD256.c: _mm256_xor_si256 (__m256i __A, __m256i __B)
KeccakP-1600-times4-SIMD256.c: ...

Number of similar (compiler,implementation) pairs: 156, namely:
CompilerImplementations
gcc -funroll-loops -m32 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=athlon -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=athlon -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=athlon -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=athlon -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=barcelona -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=barcelona -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=barcelona -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=barcelona -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i386 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i386 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i386 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i386 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i486 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i486 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i486 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i486 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-2 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-2 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-2 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-2 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-3 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-3 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-3 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-3 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k8 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k8 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k8 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k8 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=nocona -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=nocona -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=nocona -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=nocona -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-m -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-m -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-m -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-m -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-mmx -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-mmx -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-mmx -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-mmx -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium2 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium2 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium2 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium2 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium3 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium3 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium3 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium3 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium4 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium4 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium4 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium4 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentiumpro -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentiumpro -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentiumpro -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentiumpro -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=prescott -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=prescott -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=prescott -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=prescott -Os -fomit-frame-pointer avx2
gcc -m32 -O2 -fomit-frame-pointer avx2
gcc -m32 -O3 -fomit-frame-pointer avx2
gcc -m32 -O -fomit-frame-pointer avx2
gcc -m32 -Os -fomit-frame-pointer avx2
gcc -m32 -march=athlon -O2 -fomit-frame-pointer avx2
gcc -m32 -march=athlon -O3 -fomit-frame-pointer avx2
gcc -m32 -march=athlon -O -fomit-frame-pointer avx2
gcc -m32 -march=athlon -Os -fomit-frame-pointer avx2
gcc -m32 -march=core2 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=core2 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=core2 -O -fomit-frame-pointer avx2
gcc -m32 -march=core2 -Os -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4.1 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4.1 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4.1 -O -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4.1 -Os -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4 -O -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4 -Os -fomit-frame-pointer avx2
gcc -m32 -march=corei7 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=corei7 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=corei7 -O -fomit-frame-pointer avx2
gcc -m32 -march=corei7 -Os -fomit-frame-pointer avx2
gcc -m32 -march=i386 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=i386 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=i386 -O -fomit-frame-pointer avx2
gcc -m32 -march=i386 -Os -fomit-frame-pointer avx2
gcc -m32 -march=i486 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=i486 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=i486 -O -fomit-frame-pointer avx2
gcc -m32 -march=i486 -Os -fomit-frame-pointer avx2
gcc -m32 -march=k6-2 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=k6-2 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=k6-2 -O -fomit-frame-pointer avx2
gcc -m32 -march=k6-2 -Os -fomit-frame-pointer avx2
gcc -m32 -march=k6-3 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=k6-3 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=k6-3 -O -fomit-frame-pointer avx2
gcc -m32 -march=k6-3 -Os -fomit-frame-pointer avx2
gcc -m32 -march=k6 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=k6 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=k6 -O -fomit-frame-pointer avx2
gcc -m32 -march=k6 -Os -fomit-frame-pointer avx2
gcc -m32 -march=k8 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=k8 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=k8 -O -fomit-frame-pointer avx2
gcc -m32 -march=k8 -Os -fomit-frame-pointer avx2
gcc -m32 -march=nocona -O2 -fomit-frame-pointer avx2
gcc -m32 -march=nocona -O3 -fomit-frame-pointer avx2
gcc -m32 -march=nocona -O -fomit-frame-pointer avx2
gcc -m32 -march=nocona -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentium-m -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentium-m -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentium-m -O -fomit-frame-pointer avx2
gcc -m32 -march=pentium-m -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentium-mmx -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentium-mmx -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentium-mmx -O -fomit-frame-pointer avx2
gcc -m32 -march=pentium-mmx -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentium2 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentium2 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentium2 -O -fomit-frame-pointer avx2
gcc -m32 -march=pentium2 -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentium3 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentium3 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentium3 -O -fomit-frame-pointer avx2
gcc -m32 -march=pentium3 -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentium4 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentium4 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentium4 -O -fomit-frame-pointer avx2
gcc -m32 -march=pentium4 -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentium -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentium -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentium -O -fomit-frame-pointer avx2
gcc -m32 -march=pentium -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentiumpro -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentiumpro -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentiumpro -O -fomit-frame-pointer avx2
gcc -m32 -march=pentiumpro -Os -fomit-frame-pointer avx2
gcc -m32 -march=prescott -O2 -fomit-frame-pointer avx2
gcc -m32 -march=prescott -O3 -fomit-frame-pointer avx2
gcc -m32 -march=prescott -O -fomit-frame-pointer avx2
gcc -m32 -march=prescott -Os -fomit-frame-pointer avx2

Compiler output

Implementation: avx2
Security model: unknown
Compiler: gcc -m32 -march=barcelona -O2 -fomit-frame-pointer
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c: In function 'KeccakP1600times4_AddLanesAll':
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:135:40: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
KeccakP-1600-times4-SIMD256.c: #define Xor_In4( argIndex ) lanes0 = LOAD256u( curData0[argIndex]),\
KeccakP-1600-times4-SIMD256.c: ^
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: note: in expansion of macro 'Xor_In4'
KeccakP-1600-times4-SIMD256.c: Xor_In4( 0 );
KeccakP-1600-times4-SIMD256.c: ^~~~~~~
KeccakP-1600-times4-SIMD256.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
KeccakP-1600-times4-SIMD256.c: from KeccakP-1600-times4-SIMD256.c:21:
KeccakP-1600-times4-SIMD256.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:913:1: error: inlining failed in call to always_inline '_mm256_xor_si256': target specific option mismatch
KeccakP-1600-times4-SIMD256.c: _mm256_xor_si256 (__m256i __A, __m256i __B)
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:55:41: note: called from here
KeccakP-1600-times4-SIMD256.c: #define XOReq256(a, b) a = _mm256_xor_si256(a, b)
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:143:33: note: in expansion of macro 'XOReq256'
KeccakP-1600-times4-SIMD256.c: XOReq256( stateAsLanes[argIndex+3], lanes3 )
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:149:9: note: in expansion of macro 'Xor_In4'
KeccakP-1600-times4-SIMD256.c: Xor_In4( 12 );
KeccakP-1600-times4-SIMD256.c: ^~~~~~~
KeccakP-1600-times4-SIMD256.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
KeccakP-1600-times4-SIMD256.c: from KeccakP-1600-times4-SIMD256.c:21:
KeccakP-1600-times4-SIMD256.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:913:1: error: inlining failed in call to always_inline '_mm256_xor_si256': target specific option mismatch
KeccakP-1600-times4-SIMD256.c: _mm256_xor_si256 (__m256i __A, __m256i __B)
KeccakP-1600-times4-SIMD256.c: ...
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c: In function 'KeccakP1600times4_AddLanesAll':
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:135:40: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
KeccakP-1600-times4-SIMD256.c: #define Xor_In4( argIndex ) lanes0 = LOAD256u( curData0[argIndex]),\
KeccakP-1600-times4-SIMD256.c: ^
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:146:9: note: in expansion of macro 'Xor_In4'
KeccakP-1600-times4-SIMD256.c: Xor_In4( 0 );
KeccakP-1600-times4-SIMD256.c: ^~~~~~~
KeccakP-1600-times4-SIMD256.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
KeccakP-1600-times4-SIMD256.c: from KeccakP-1600-times4-SIMD256.c:21:
KeccakP-1600-times4-SIMD256.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:913:1: error: inlining failed in call to always_inline '_mm256_xor_si256': target specific option mismatch
KeccakP-1600-times4-SIMD256.c: _mm256_xor_si256 (__m256i __A, __m256i __B)
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:55:41: note: called from here
KeccakP-1600-times4-SIMD256.c: #define XOReq256(a, b) a = _mm256_xor_si256(a, b)
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:143:33: note: in expansion of macro 'XOReq256'
KeccakP-1600-times4-SIMD256.c: XOReq256( stateAsLanes[argIndex+3], lanes3 )
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:149:9: note: in expansion of macro 'Xor_In4'
KeccakP-1600-times4-SIMD256.c: Xor_In4( 12 );
KeccakP-1600-times4-SIMD256.c: ^~~~~~~
KeccakP-1600-times4-SIMD256.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
KeccakP-1600-times4-SIMD256.c: from KeccakP-1600-times4-SIMD256.c:21:
KeccakP-1600-times4-SIMD256.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:913:1: error: inlining failed in call to always_inline '_mm256_xor_si256': target specific option mismatch
KeccakP-1600-times4-SIMD256.c: _mm256_xor_si256 (__m256i __A, __m256i __B)
KeccakP-1600-times4-SIMD256.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m32 -march=barcelona -O2 -fomit-frame-pointer avx2
gcc -m32 -march=barcelona -O3 -fomit-frame-pointer avx2
gcc -m32 -march=barcelona -O -fomit-frame-pointer avx2
gcc -m32 -march=barcelona -Os -fomit-frame-pointer avx2

Compiler output

Implementation: avx2
Security model: unknown
Compiler: gcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer
KeccakP-1600-times4-SIMD256.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
KeccakP-1600-times4-SIMD256.c: from KeccakP-1600-times4-SIMD256.c:21:
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c: In function 'KeccakP1600times4_AddLanesAll':
KeccakP-1600-times4-SIMD256.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:913:1: error: inlining failed in call to always_inline '_mm256_xor_si256': target specific option mismatch
KeccakP-1600-times4-SIMD256.c: _mm256_xor_si256 (__m256i __A, __m256i __B)
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:55:41: note: called from here
KeccakP-1600-times4-SIMD256.c: #define XOReq256(a, b) a = _mm256_xor_si256(a, b)
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:143:33: note: in expansion of macro 'XOReq256'
KeccakP-1600-times4-SIMD256.c: XOReq256( stateAsLanes[argIndex+3], lanes3 )
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:149:9: note: in expansion of macro 'Xor_In4'
KeccakP-1600-times4-SIMD256.c: Xor_In4( 12 );
KeccakP-1600-times4-SIMD256.c: ^~~~~~~
KeccakP-1600-times4-SIMD256.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
KeccakP-1600-times4-SIMD256.c: from KeccakP-1600-times4-SIMD256.c:21:
KeccakP-1600-times4-SIMD256.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:913:1: error: inlining failed in call to always_inline '_mm256_xor_si256': target specific option mismatch
KeccakP-1600-times4-SIMD256.c: _mm256_xor_si256 (__m256i __A, __m256i __B)
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:55:41: note: called from here
KeccakP-1600-times4-SIMD256.c: #define XOReq256(a, b) a = _mm256_xor_si256(a, b)
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:142:33: note: in expansion of macro 'XOReq256'
KeccakP-1600-times4-SIMD256.c: XOReq256( stateAsLanes[argIndex+2], lanes2 ),\
KeccakP-1600-times4-SIMD256.c: ...

Number of similar (compiler,implementation) pairs: 12, namely:
CompilerImplementations
gcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer avx2
gcc -m32 -march=core-avx-i -O3 -fomit-frame-pointer avx2
gcc -m32 -march=core-avx-i -O -fomit-frame-pointer avx2
gcc -m32 -march=core-avx-i -Os -fomit-frame-pointer avx2
gcc -m32 -march=corei7-avx -O2 -fomit-frame-pointer avx2
gcc -m32 -march=corei7-avx -O3 -fomit-frame-pointer avx2
gcc -m32 -march=corei7-avx -O -fomit-frame-pointer avx2
gcc -m32 -march=corei7-avx -Os -fomit-frame-pointer avx2
gcc -m32 -march=native -mtune=native -O2 -fomit-frame-pointer avx2
gcc -m32 -march=native -mtune=native -O3 -fomit-frame-pointer avx2
gcc -m32 -march=native -mtune=native -O -fomit-frame-pointer avx2
gcc -m32 -march=native -mtune=native -Os -fomit-frame-pointer avx2