Implementation notes: amd64, par, crypto_hash/bmw256

Computer: par
Architecture: amd64
CPU ID: GenuineIntel-000406c3-bfebfbff
SUPERCOP version: 20161026
Operation: crypto_hash
Primitive: bmw256
TimeImplementationCompilerBenchmark dateSUPERCOP version
15780amd_k10/x86-64/sse2gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
15840amd_k10/x86-64/sse2gcc -march=native -mcpu=native -Os2016121420161026
15940amd_k10/x86-64/sse2gcc -march=native -mcpu=native -O32016121420161026
16020amd_k10/x86-64/sse2gcc -march=native -mcpu=native -O22016121420161026
16120amd_k10/x86-64/sse2gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
16180amd_k10/x86-64/sse2gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
16680core_i7_45nm/x86-64/sse4_v1gcc -march=native -mcpu=native -O32016121420161026
16700core_i7_45nm/x86-64/sse4_v1gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
16700core_i7_45nm/x86-64/sse4_v1gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
16700core_i7_45nm/x86-64/sse4_v1gcc -march=native -mcpu=native -O22016121420161026
16760core_i7_45nm/x86-64/sse4_v1gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
16760core_i7_45nm/x86-64/sse4_v1gcc -march=native -mcpu=native -Os2016121420161026
17200core_2_45nm/x86-64/sse4_v1gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
17200core_2_45nm/x86-64/sse4_v1gcc -march=native -mcpu=native -O32016121420161026
17220core_2_45nm/x86-64/sse4_v1gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
17220core_2_45nm/x86-64/sse4_v1gcc -march=native -mcpu=native -O22016121420161026
17280core_2_45nm/x86-64/sse4_v1gcc -march=native -mcpu=native -Os2016121420161026
17320core_2_65nm/x86-64/ssse3_v1gcc -march=native -mcpu=native -O22016121420161026
17320core_2_65nm/x86-64/ssse3_v1gcc -march=native -mcpu=native -O32016121420161026
17340core_2_65nm/x86-64/ssse3_v1gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
17360core_2_65nm/x86-64/ssse3_v1gcc -march=native -mcpu=native -Os2016121420161026
17380opt29_v5_ssse3_asm64gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
17380core_2_45nm/x86-64/sse4_v1gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
17400core_2_65nm/x86-64/ssse3_v1gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
17400opt29_v5_ssse3_asm64gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
17400opt29_v5_ssse3_asm64gcc -march=native -mcpu=native -O22016121420161026
17420opt29_v5_ssse3_asm64gcc -march=native -mcpu=native -O32016121420161026
17460opt29_v5_ssse3_asm64gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
17460opt29_v5_ssse3_asm64gcc -march=native -mcpu=native -Os2016121420161026
17480core_2_65nm/x86-64/ssse3_v1gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
19040opt31ssse3_asm64gcc -march=native -mcpu=native -O32016121420161026
19080opt31ssse3_asm64gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
19080opt31ssse3_asm64gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
19080opt31ssse3_asm64gcc -march=native -mcpu=native -O22016121420161026
19140opt31ssse3_asm64gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
19160opt31ssse3_asm64gcc -march=native -mcpu=native -Os2016121420161026
23960amd_k8/x86-64/noSSEgcc -march=native -mcpu=native -O22016121420161026
23980amd_k8/x86-64/noSSEgcc -march=native -mcpu=native -O32016121420161026
24000amd_k8/x86-64/noSSEgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
24000amd_k8/x86-64/noSSEgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
24020amd_k8/x86-64/noSSEgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
24020amd_k8/x86-64/noSSEgcc -march=native -mcpu=native -Os2016121420161026
24660optc01gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
24680optabegcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
24780optbbegcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
25060optc04gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
25060optabegcc -march=native -mcpu=native -Os2016121420161026
25080optc04gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
25120optbbegcc -march=native -mcpu=native -Os2016121420161026
25160optc01gcc -march=native -mcpu=native -Os2016121420161026
25360optc03gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
25360optc03gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
25420optc04gcc -march=native -mcpu=native -O32016121420161026
25440optc04gcc -march=native -mcpu=native -O22016121420161026
25600optc02gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
25620optc02gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
25620optc03gcc -march=native -mcpu=native -O32016121420161026
25700optc03gcc -march=native -mcpu=native -O22016121420161026
26000optc01gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
26060optc01gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
26080optbbegcc -funroll-loops -march=native -mcpu=native -O32016121420161026
26100optabegcc -funroll-loops -march=native -mcpu=native -O22016121420161026
26100optbbegcc -funroll-loops -march=native -mcpu=native -O22016121420161026
26100optabegcc -funroll-loops -march=native -mcpu=native -O32016121420161026
26340optc02gcc -march=native -mcpu=native -O22016121420161026
26360optc02gcc -march=native -mcpu=native -O32016121420161026
26580optc01gcc -march=native -mcpu=native -O22016121420161026
26620optc01gcc -march=native -mcpu=native -O32016121420161026
26640optbbegcc -march=native -mcpu=native -O32016121420161026
26660optbgbegcc -funroll-loops -march=native -mcpu=native -O32016121420161026
26700optbgbegcc -funroll-loops -march=native -mcpu=native -O22016121420161026
26700optbbegcc -march=native -mcpu=native -O22016121420161026
26740optabegcc -march=native -mcpu=native -O22016121420161026
26800optabegcc -march=native -mcpu=native -O32016121420161026
27120optbgbegcc -march=native -mcpu=native -O32016121420161026
27240optbgbegcc -march=native -mcpu=native -Os2016121420161026
27260optbgbegcc -march=native -mcpu=native -O22016121420161026
27340optbgbegcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
27560sphlibgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
28380sphlibgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
28400sphlibgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
28420sphlibgcc -march=native -mcpu=native -Os2016121420161026
28720sphlib-smallgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
28800sphlib-smallgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
29100sphlibgcc -march=native -mcpu=native -O32016121420161026
29120optc02gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
29180sphlibgcc -march=native -mcpu=native -O22016121420161026
29200optc02gcc -march=native -mcpu=native -Os2016121420161026
29420optc04gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
29480sphlib-smallgcc -march=native -mcpu=native -O32016121420161026
29860optc03gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
30020sphlib-smallgcc -march=native -mcpu=native -O22016121420161026
30080sphlib-smallgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
30120optc04gcc -march=native -mcpu=native -Os2016121420161026
30300sphlib-smallgcc -march=native -mcpu=native -Os2016121420161026
30420optc03gcc -march=native -mcpu=native -Os2016121420161026
50600refgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
50700refgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
51480refgcc -march=native -mcpu=native -O32016121420161026
52220refgcc -march=native -mcpu=native -O22016121420161026
97720refgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
100200refgcc -march=native -mcpu=native -Os2016121420161026

Compiler output

Implementation: crypto_hash/bmw256/amd_k10/x86-64/sse2
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
BMW256_supercop_wrapper4.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
Compress256.s: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 24, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 amd_k10/x86-64/sse2 core_2_45nm/x86-64/sse4_v1 core_2_65nm/x86-64/ssse3_v1 core_i7_45nm/x86-64/sse4_v1
gcc -funroll-loops -march=native -mcpu=native -O3 amd_k10/x86-64/sse2 core_2_45nm/x86-64/sse4_v1 core_2_65nm/x86-64/ssse3_v1 core_i7_45nm/x86-64/sse4_v1
gcc -funroll-loops -march=native -mcpu=native -Os amd_k10/x86-64/sse2 core_2_45nm/x86-64/sse4_v1 core_2_65nm/x86-64/ssse3_v1 core_i7_45nm/x86-64/sse4_v1
gcc -march=native -mcpu=native -O2 amd_k10/x86-64/sse2 core_2_45nm/x86-64/sse4_v1 core_2_65nm/x86-64/ssse3_v1 core_i7_45nm/x86-64/sse4_v1
gcc -march=native -mcpu=native -O3 amd_k10/x86-64/sse2 core_2_45nm/x86-64/sse4_v1 core_2_65nm/x86-64/ssse3_v1 core_i7_45nm/x86-64/sse4_v1
gcc -march=native -mcpu=native -Os amd_k10/x86-64/sse2 core_2_45nm/x86-64/sse4_v1 core_2_65nm/x86-64/ssse3_v1 core_i7_45nm/x86-64/sse4_v1

Compiler output

Implementation: crypto_hash/bmw256/amd_k8/x86-64/noSSE
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
BMW256_supercop_wrapper4.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
Compress256_icc.s: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 amd_k8/x86-64/noSSE
gcc -funroll-loops -march=native -mcpu=native -O3 amd_k8/x86-64/noSSE
gcc -funroll-loops -march=native -mcpu=native -Os amd_k8/x86-64/noSSE
gcc -march=native -mcpu=native -O2 amd_k8/x86-64/noSSE
gcc -march=native -mcpu=native -O3 amd_k8/x86-64/noSSE
gcc -march=native -mcpu=native -Os amd_k8/x86-64/noSSE

Compiler output

Implementation: crypto_hash/bmw256/optc01
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
BlueMidnightWish.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
hash.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 30, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 optc01 optc02 optc03 optc04 ref
gcc -funroll-loops -march=native -mcpu=native -O3 optc01 optc02 optc03 optc04 ref
gcc -funroll-loops -march=native -mcpu=native -Os optc01 optc02 optc03 optc04 ref
gcc -march=native -mcpu=native -O2 optc01 optc02 optc03 optc04 ref
gcc -march=native -mcpu=native -O3 optc01 optc02 optc03 optc04 ref
gcc -march=native -mcpu=native -Os optc01 optc02 optc03 optc04 ref

Compiler output

Implementation: crypto_hash/bmw256/optc31ssse3
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
Compress256_sse_31.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
Compress256_sse_31.c: Compress256_sse_31.c: In function 'Compress256':
Compress256_sse_31.c: Compress256_sse_31.c:374:18: warning: passing argument 1 of '_mm_store_si128' from incompatible pointer type [-Wincompatible-pointer-types]
Compress256_sse_31.c: _mm_store_si128(&p256[00], copyTemp);
Compress256_sse_31.c: ^
Compress256_sse_31.c: In file included from /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/pmmintrin.h:31:0,
Compress256_sse_31.c: from /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/tmmintrin.h:31,
Compress256_sse_31.c: from Compress256_sse_31.c:12:
Compress256_sse_31.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/emmintrin.h:708:1: note: expected '__vector(2) long long int *' but argument is of type 'u_int32_t * {aka unsigned int *}'
Compress256_sse_31.c: _mm_store_si128 (__m128i *__P, __m128i __B)
Compress256_sse_31.c: ^~~~~~~~~~~~~~~
Compress256_sse_31.c: Compress256_sse_31.c:377:28: warning: passing argument 1 of '_mm_load_si128' from incompatible pointer type [-Wincompatible-pointer-types]
Compress256_sse_31.c: copyTemp = _mm_load_si128(&hashState256(state)->gt;DoublePipe[ 4]);
Compress256_sse_31.c: ^
Compress256_sse_31.c: In file included from /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/pmmintrin.h:31:0,
Compress256_sse_31.c: from /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/tmmintrin.h:31,
Compress256_sse_31.c: from Compress256_sse_31.c:12:
Compress256_sse_31.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/emmintrin.h:690:1: note: expected 'const __vector(2) long long int *' but argument is of type 'u_int32_t * {aka unsigned int *}'
Compress256_sse_31.c: _mm_load_si128 (__m128i const *__P)
Compress256_sse_31.c: ^~~~~~~~~~~~~~
Compress256_sse_31.c: Compress256_sse_31.c:378:18: warning: passing argument 1 of '_mm_store_si128' from incompatible pointer type [-Wincompatible-pointer-types]
Compress256_sse_31.c: _mm_store_si128(&p256[04], copyTemp);
Compress256_sse_31.c: ^
Compress256_sse_31.c: In file included from /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/pmmintrin.h:31:0,
Compress256_sse_31.c: from /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/tmmintrin.h:31,
Compress256_sse_31.c: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 optc31ssse3
gcc -funroll-loops -march=native -mcpu=native -O3 optc31ssse3
gcc -funroll-loops -march=native -mcpu=native -Os optc31ssse3
gcc -march=native -mcpu=native -O2 optc31ssse3
gcc -march=native -mcpu=native -O3 optc31ssse3
gcc -march=native -mcpu=native -Os optc31ssse3

Compiler output

Implementation: crypto_hash/bmw256/sphlib
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
bmw.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
hash.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 12, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 sphlib sphlib-small
gcc -funroll-loops -march=native -mcpu=native -O3 sphlib sphlib-small
gcc -funroll-loops -march=native -mcpu=native -Os sphlib sphlib-small
gcc -march=native -mcpu=native -O2 sphlib sphlib-small
gcc -march=native -mcpu=native -O3 sphlib sphlib-small
gcc -march=native -mcpu=native -Os sphlib sphlib-small

Compiler output

Implementation: crypto_hash/bmw256/optbgbe
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
bmw_small_speed.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
hash.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
memxor.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 optbgbe
gcc -funroll-loops -march=native -mcpu=native -O3 optbgbe
gcc -funroll-loops -march=native -mcpu=native -Os optbgbe
gcc -march=native -mcpu=native -O2 optbgbe
gcc -march=native -mcpu=native -O3 optbgbe
gcc -march=native -mcpu=native -Os optbgbe

Compiler output

Implementation: crypto_hash/bmw256/optabe
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
hash.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 12, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 optabe optbbe
gcc -funroll-loops -march=native -mcpu=native -O3 optabe optbbe
gcc -funroll-loops -march=native -mcpu=native -Os optabe optbbe
gcc -march=native -mcpu=native -O2 optabe optbbe
gcc -march=native -mcpu=native -O3 optabe optbbe
gcc -march=native -mcpu=native -Os optabe optbbe

Compiler output

Implementation: crypto_hash/bmw256/opt31ssse3_asm64
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
wrapper.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
wrapper.c: wrapper.c:37:0: warning: "crypto_hash_BYTES" redefined
wrapper.c: #define crypto_hash_BYTES 32
wrapper.c:
wrapper.c: In file included from wrapper.c:20:0:
wrapper.c: crypto_hash.h:7:0: note: this is the location of the previous definition
wrapper.c: #define crypto_hash_BYTES crypto_hash_bmw256_BYTES
wrapper.c:
wrapper.c: wrapper.c:39:8: warning: type defaults to 'int' in declaration of 'Compress256' [-Wimplicit-int]
wrapper.c: extern Compress256(u_int32_t *data32, u_int32_t *data32_end, hashState *state);
wrapper.c: ^~~~~~~~~~~
wrapper.c: wrapper.c: In function 'crypto_hash_bmw256_opt31ssse3_asm64':
wrapper.c: wrapper.c:238:34: warning: passing argument 3 of 'Compress256' from incompatible pointer type [-Wincompatible-pointer-types]
wrapper.c: Compress256(data32, data32 +16, CONST32final-4); // TODO: This is a HACK...
wrapper.c: ^~~~~~~~~~~~
wrapper.c: wrapper.c:39:8: note: expected 'hashState * {aka struct gt; *}' but argument is of type 'u_int32_t * {aka unsigned int *}'
wrapper.c: extern Compress256(u_int32_t *data32, u_int32_t *data32_end, hashState *state);
wrapper.c: ^~~~~~~~~~~
Compress256_sse_31_O1.s: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 opt31ssse3_asm64
gcc -funroll-loops -march=native -mcpu=native -O3 opt31ssse3_asm64
gcc -funroll-loops -march=native -mcpu=native -Os opt31ssse3_asm64
gcc -march=native -mcpu=native -O2 opt31ssse3_asm64
gcc -march=native -mcpu=native -O3 opt31ssse3_asm64
gcc -march=native -mcpu=native -Os opt31ssse3_asm64

Compiler output

Implementation: crypto_hash/bmw256/opt29_v5_ssse3_asm64
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
wrapper4.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
Compress256.s: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 opt29_v5_ssse3_asm64
gcc -funroll-loops -march=native -mcpu=native -O3 opt29_v5_ssse3_asm64
gcc -funroll-loops -march=native -mcpu=native -Os opt29_v5_ssse3_asm64
gcc -march=native -mcpu=native -O2 opt29_v5_ssse3_asm64
gcc -march=native -mcpu=native -O3 opt29_v5_ssse3_asm64
gcc -march=native -mcpu=native -Os opt29_v5_ssse3_asm64