Implementation notes: amd64, skylake, crypto_aead/trivia0v2

Computer: skylake
Architecture: amd64
CPU ID: GenuineIntel-000506e3-bfebfbff
SUPERCOP version: 20161026
Operation: crypto_aead
Primitive: trivia0v2
TimeImplementationCompilerBenchmark dateSUPERCOP version
45464sse4clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments2016121720161026
45660sse4clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer -Qunused-arguments2016121720161026
46944sse4clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments2016121720161026
46960sse4clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016121720161026
56626sse4gcc -m64 -march=core-avx-i -O3 -fomit-frame-pointer2016121720161026
56988sse4gcc -m64 -march=corei7-avx -O3 -fomit-frame-pointer2016121720161026
57704sse4gcc -m64 -march=core-avx2 -O3 -fomit-frame-pointer2016121720161026
58124sse4gcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer2016121720161026
58258sse4gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016121720161026
67818sse4gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016121720161026
67840sse4gcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer2016121720161026
68476sse4gcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer2016121720161026
70240sse4gcc -m64 -march=corei7-avx -O2 -fomit-frame-pointer2016121720161026
70580sse4gcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer2016121720161026
72056sse4gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016121720161026
72626sse4gcc -m64 -march=core-avx2 -Os -fomit-frame-pointer2016121720161026
73334sse4gcc -m64 -march=core-avx-i -Os -fomit-frame-pointer2016121720161026
73422sse4gcc -m64 -march=corei7-avx -Os -fomit-frame-pointer2016121720161026
73672sse4gcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer2016121720161026
77292sse4gcc -m64 -march=core-avx2 -O -fomit-frame-pointer2016121720161026
77618sse4gcc -m64 -march=native -mtune=native -O -fomit-frame-pointer2016121720161026
77760sse4gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016121720161026
77894sse4gcc -m64 -march=core-avx-i -O -fomit-frame-pointer2016121720161026
78034sse4gcc -m64 -march=corei7-avx -O -fomit-frame-pointer2016121720161026
162326refclang -O3 -fomit-frame-pointer -Qunused-arguments2016121720161026
162648refclang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments2016121720161026
162830refclang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016121720161026
162912refclang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer -Qunused-arguments2016121720161026
162932refclang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016121720161026
162978refclang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments2016121720161026
164062refclang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016121720161026
164734refclang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments2016121720161026
179254refclang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments2016121720161026
180428refclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016121720161026
210134refgcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer2016121720161026
210308refgcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer2016121720161026
211598refgcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer2016121720161026
211638refgcc -fno-schedule-insns -O3 -fomit-frame-pointer2016121720161026
212020refgcc -m64 -march=barcelona -O3 -fomit-frame-pointer2016121720161026
212974refgcc -march=barcelona -O3 -fomit-frame-pointer2016121720161026
213098refgcc -m64 -march=corei7-avx -O3 -fomit-frame-pointer2016121720161026
213106refgcc -m64 -march=core-avx-i -O3 -fomit-frame-pointer2016121720161026
213440refgcc -O3 -fomit-frame-pointer2016121720161026
213684refgcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer2016121720161026
214008refgcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer2016121720161026
214444refgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016121720161026
214494refgcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer2016121720161026
214582refgcc -funroll-loops -O2 -fomit-frame-pointer2016121720161026
214756refgcc -m64 -march=corei7 -O3 -fomit-frame-pointer2016121720161026
214784refgcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer2016121720161026
214784refgcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer2016121720161026
215026refgcc -funroll-loops -m64 -O2 -fomit-frame-pointer2016121720161026
215144refgcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer2016121720161026
215268refgcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer2016121720161026
215294refgcc -march=nocona -O3 -fomit-frame-pointer2016121720161026
215398refgcc -m64 -march=core2 -O3 -fomit-frame-pointer2016121720161026
215458refgcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer2016121720161026
215686refgcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer2016121720161026
215704refgcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer2016121720161026
215782refgcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer2016121720161026
215838refgcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer2016121720161026
215914refgcc -funroll-loops -m64 -O3 -fomit-frame-pointer2016121720161026
216628refgcc -m64 -march=core-avx2 -O3 -fomit-frame-pointer2016121720161026
216762refgcc -m64 -march=nocona -O3 -fomit-frame-pointer2016121720161026
217120refgcc -march=k8 -O3 -fomit-frame-pointer2016121720161026
217190refgcc -m64 -march=k8 -O3 -fomit-frame-pointer2016121720161026
217454refgcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer2016121720161026
217524refgcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer2016121720161026
218612refgcc -m64 -O3 -fomit-frame-pointer2016121720161026
220000refgcc -funroll-loops -O3 -fomit-frame-pointer2016121720161026
229024refgcc -m64 -march=barcelona -O2 -fomit-frame-pointer2016121720161026
229324refgcc -march=barcelona -O2 -fomit-frame-pointer2016121720161026
229534refgcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer2016121720161026
231106refgcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer2016121720161026
231854refgcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer2016121720161026
232708refgcc -m64 -march=corei7-avx -O2 -fomit-frame-pointer2016121720161026
232732refgcc -m64 -march=corei7 -O2 -fomit-frame-pointer2016121720161026
233356refgcc -march=k8 -O2 -fomit-frame-pointer2016121720161026
233668refgcc -m64 -march=core2 -O2 -fomit-frame-pointer2016121720161026
233880refgcc -m64 -march=k8 -O2 -fomit-frame-pointer2016121720161026
233980refgcc -O2 -fomit-frame-pointer2016121720161026
234148refgcc -fno-schedule-insns -O2 -fomit-frame-pointer2016121720161026
234840refgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016121720161026
234878refgcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer2016121720161026
235568refgcc -m64 -march=nocona -O2 -fomit-frame-pointer2016121720161026
236058refgcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer2016121720161026
236176refgcc -march=nocona -O2 -fomit-frame-pointer2016121720161026
236702refgcc -m64 -O2 -fomit-frame-pointer2016121720161026
271802refgcc -march=k8 -O -fomit-frame-pointer2016121720161026
271824refgcc -m64 -march=barcelona -O -fomit-frame-pointer2016121720161026
272296refgcc -march=barcelona -O -fomit-frame-pointer2016121720161026
273132refgcc -m64 -march=k8 -O -fomit-frame-pointer2016121720161026
273318refgcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer2016121720161026
273596refgcc -funroll-loops -march=k8 -O -fomit-frame-pointer2016121720161026
274608refgcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer2016121720161026
275444refgcc -funroll-loops -march=nocona -O -fomit-frame-pointer2016121720161026
278376refgcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer2016121720161026
280412refgcc -funroll-loops -march=barcelona -O -fomit-frame-pointer2016121720161026
282056refgcc -O -fomit-frame-pointer2016121720161026
282348refgcc -m64 -march=core-avx-i -O -fomit-frame-pointer2016121720161026
282648refgcc -fno-schedule-insns -O -fomit-frame-pointer2016121720161026
282686refgcc -funroll-loops -O -fomit-frame-pointer2016121720161026
283420refgcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer2016121720161026
283530refgcc -m64 -O -fomit-frame-pointer2016121720161026
285906refgcc -funroll-loops -m64 -O -fomit-frame-pointer2016121720161026
288330refgcc -m64 -march=core-avx2 -O -fomit-frame-pointer2016121720161026
288774refgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016121720161026
291010refgcc -m64 -march=native -mtune=native -O -fomit-frame-pointer2016121720161026
297906refgcc -m64 -march=corei7-avx -O -fomit-frame-pointer2016121720161026
298182refgcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer2016121720161026
298246refgcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer2016121720161026
298950refgcc -m64 -march=nocona -O -fomit-frame-pointer2016121720161026
299078refgcc -march=nocona -O -fomit-frame-pointer2016121720161026
299196refgcc -m64 -march=corei7 -O -fomit-frame-pointer2016121720161026
299238refgcc -m64 -march=core2 -O -fomit-frame-pointer2016121720161026
398078refgcc -m64 -march=corei7-avx -Os -fomit-frame-pointer2016121720161026
400240refgcc -m64 -march=core-avx-i -Os -fomit-frame-pointer2016121720161026
409518refgcc -fno-schedule-insns -Os -fomit-frame-pointer2016121720161026
411336refgcc -Os -fomit-frame-pointer2016121720161026
412864refgcc -m64 -Os -fomit-frame-pointer2016121720161026
413600refgcc -m64 -march=core2 -Os -fomit-frame-pointer2016121720161026
414184refgcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer2016121720161026
414410refgcc -m64 -march=corei7 -Os -fomit-frame-pointer2016121720161026
414464refgcc -march=k8 -Os -fomit-frame-pointer2016121720161026
414990refgcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer2016121720161026
415528refgcc -m64 -march=k8 -Os -fomit-frame-pointer2016121720161026
416590refgcc -march=barcelona -Os -fomit-frame-pointer2016121720161026
417344refgcc -m64 -march=barcelona -Os -fomit-frame-pointer2016121720161026
440402refgcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer2016121720161026
443806refgcc -funroll-loops -m64 -Os -fomit-frame-pointer2016121720161026
447936refgcc -funroll-loops -Os -fomit-frame-pointer2016121720161026
453010refgcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer2016121720161026
453762refgcc -funroll-loops -march=k8 -Os -fomit-frame-pointer2016121720161026
455448refgcc -m64 -march=core-avx2 -Os -fomit-frame-pointer2016121720161026
455982refgcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer2016121720161026
456050refgcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer2016121720161026
457200refgcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer2016121720161026
464150refgcc -march=nocona -Os -fomit-frame-pointer2016121720161026
469084refgcc -m64 -march=nocona -Os -fomit-frame-pointer2016121720161026
507068refgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016121720161026
512704refgcc -funroll-loops -march=nocona -Os -fomit-frame-pointer2016121720161026
513804refgcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer2016121720161026
1727172refgcc -funroll-loops2016121720161026
1732920refcc2016121720161026
1739268refgcc2016121720161026

Compiler output

Implementation: crypto_aead/trivia0v2/sse4
Compiler: cc
encrypt.c: encrypt.c: In function 'Field_Mult_32':
encrypt.c: encrypt.c:306:4: error: '__builtin_ia32_pclmulqdq128' needs isa option -m32 -mpclmul
encrypt.c: x = _mm_clmulepi64_si128(x,y,0x0);
encrypt.c: ^
encrypt.c: encrypt.c:313:4: error: '__builtin_ia32_pclmulqdq128' needs isa option -m32 -mpclmul
encrypt.c: x = _mm_clmulepi64_si128(x,y,0x0);
encrypt.c: ^
encrypt.c: encrypt.c:319:4: error: '__builtin_ia32_pclmulqdq128' needs isa option -m32 -mpclmul
encrypt.c: x = _mm_clmulepi64_si128(x,y,0x0);
encrypt.c: ^

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
cc sse4
gcc sse4
gcc -funroll-loops sse4

Compiler output

Implementation: crypto_aead/trivia0v2/sse4
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
encrypt.c: encrypt.c:229:1: warning: '/*' within block comment [-Wcomment]
encrypt.c: /*----------------------------------------------------------------*/
encrypt.c: ^
encrypt.c: encrypt.c:306:6: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
encrypt.c: x = _mm_clmulepi64_si128(x,y,0x0);
encrypt.c: ^
encrypt.c: /usr/bin/../lib/clang/3.9.0/include/__wmmintrin_pclmul.h:55:13: note: expanded from macro '_mm_clmulepi64_si128'
encrypt.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), \
encrypt.c: ^
encrypt.c: encrypt.c:313:6: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
encrypt.c: x = _mm_clmulepi64_si128(x,y,0x0);
encrypt.c: ^
encrypt.c: /usr/bin/../lib/clang/3.9.0/include/__wmmintrin_pclmul.h:55:13: note: expanded from macro '_mm_clmulepi64_si128'
encrypt.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), \
encrypt.c: ^
encrypt.c: encrypt.c:319:6: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
encrypt.c: x = _mm_clmulepi64_si128(x,y,0x0);
encrypt.c: ^
encrypt.c: /usr/bin/../lib/clang/3.9.0/include/__wmmintrin_pclmul.h:55:13: note: expanded from macro '_mm_clmulepi64_si128'
encrypt.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), \
encrypt.c: ^
encrypt.c: 1 warning and 3 errors generated.

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments sse4
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments sse4
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments sse4
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments sse4
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments sse4
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments sse4

Compiler output

Implementation: crypto_aead/trivia0v2/sse4
Compiler: clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments
encrypt.c: encrypt.c:229:1: warning: '/*' within block comment [-Wcomment]
encrypt.c: /*----------------------------------------------------------------*/
encrypt.c: ^
encrypt.c: 1 warning generated.

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments sse4
clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer -Qunused-arguments sse4
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments sse4
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments sse4

Compiler output

Implementation: crypto_aead/trivia0v2/sse4
Compiler: gcc -O2 -fomit-frame-pointer
encrypt.c: In file included from encrypt.c:26:0:
encrypt.c: encrypt.c: In function 'Field_Mult_32':
encrypt.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/smmintrin.h:447:1: error: inlining failed in call to always_inline '_mm_extract_epi32': target specific option mismatch
encrypt.c: _mm_extract_epi32 (__m128i __X, const int __N)
encrypt.c: ^~~~~~~~~~~~~~~~~
encrypt.c: encrypt.c:321:6: note: called from here
encrypt.c: b = _mm_extract_epi32(x, 0x0);
encrypt.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
encrypt.c: In file included from encrypt.c:24:0:
encrypt.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
encrypt.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
encrypt.c: ^~~~~~~~~~~~~~~~~~~~
encrypt.c: encrypt.c:319:4: note: called from here
encrypt.c: x = _mm_clmulepi64_si128(x,y,0x0);
encrypt.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
encrypt.c: In file included from encrypt.c:26:0:
encrypt.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/smmintrin.h:447:1: error: inlining failed in call to always_inline '_mm_extract_epi32': target specific option mismatch
encrypt.c: _mm_extract_epi32 (__m128i __X, const int __N)
encrypt.c: ^~~~~~~~~~~~~~~~~
encrypt.c: encrypt.c:315:6: note: called from here
encrypt.c: b = _mm_extract_epi32(x, 0x1);
encrypt.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
encrypt.c: In file included from encrypt.c:24:0:
encrypt.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
encrypt.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
encrypt.c: ...

Number of similar (compiler,implementation) pairs: 72, namely:
CompilerImplementations
gcc -O2 -fomit-frame-pointer sse4
gcc -O3 -fomit-frame-pointer sse4
gcc -O -fomit-frame-pointer sse4
gcc -Os -fomit-frame-pointer sse4
gcc -fno-schedule-insns -O2 -fomit-frame-pointer sse4
gcc -fno-schedule-insns -O3 -fomit-frame-pointer sse4
gcc -fno-schedule-insns -O -fomit-frame-pointer sse4
gcc -fno-schedule-insns -Os -fomit-frame-pointer sse4
gcc -funroll-loops -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -O -fomit-frame-pointer sse4
gcc -funroll-loops -Os -fomit-frame-pointer sse4
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer sse4
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m64 -O -fomit-frame-pointer sse4
gcc -funroll-loops -m64 -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer sse4
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer sse4
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer sse4
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer sse4
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer sse4
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer sse4
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer sse4
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer sse4
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer sse4
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer sse4
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer sse4
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer sse4
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer sse4
gcc -m64 -O2 -fomit-frame-pointer sse4
gcc -m64 -O3 -fomit-frame-pointer sse4
gcc -m64 -O -fomit-frame-pointer sse4
gcc -m64 -Os -fomit-frame-pointer sse4
gcc -m64 -march=core2 -O2 -fomit-frame-pointer sse4
gcc -m64 -march=core2 -O3 -fomit-frame-pointer sse4
gcc -m64 -march=core2 -O -fomit-frame-pointer sse4
gcc -m64 -march=core2 -Os -fomit-frame-pointer sse4
gcc -m64 -march=k8 -O2 -fomit-frame-pointer sse4
gcc -m64 -march=k8 -O3 -fomit-frame-pointer sse4
gcc -m64 -march=k8 -O -fomit-frame-pointer sse4
gcc -m64 -march=k8 -Os -fomit-frame-pointer sse4
gcc -m64 -march=nocona -O2 -fomit-frame-pointer sse4
gcc -m64 -march=nocona -O3 -fomit-frame-pointer sse4
gcc -m64 -march=nocona -O -fomit-frame-pointer sse4
gcc -m64 -march=nocona -Os -fomit-frame-pointer sse4
gcc -march=barcelona -O2 -fomit-frame-pointer sse4
gcc -march=barcelona -O3 -fomit-frame-pointer sse4
gcc -march=barcelona -O -fomit-frame-pointer sse4
gcc -march=barcelona -Os -fomit-frame-pointer sse4
gcc -march=k8 -O2 -fomit-frame-pointer sse4
gcc -march=k8 -O3 -fomit-frame-pointer sse4
gcc -march=k8 -O -fomit-frame-pointer sse4
gcc -march=k8 -Os -fomit-frame-pointer sse4
gcc -march=nocona -O2 -fomit-frame-pointer sse4
gcc -march=nocona -O3 -fomit-frame-pointer sse4
gcc -march=nocona -O -fomit-frame-pointer sse4
gcc -march=nocona -Os -fomit-frame-pointer sse4

Compiler output

Implementation: crypto_aead/trivia0v2/sse4
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
encrypt.c: In file included from encrypt.c:26:0:
encrypt.c: encrypt.c: In function 'Field_Mult_32':
encrypt.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/smmintrin.h:447:1: error: inlining failed in call to always_inline '_mm_extract_epi32': target specific option mismatch
encrypt.c: _mm_extract_epi32 (__m128i __X, const int __N)
encrypt.c: ^~~~~~~~~~~~~~~~~
encrypt.c: encrypt.c:321:6: note: called from here
encrypt.c: b = _mm_extract_epi32(x, 0x0);
encrypt.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
encrypt.c: In file included from encrypt.c:24:0:
encrypt.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
encrypt.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
encrypt.c: ^~~~~~~~~~~~~~~~~~~~
encrypt.c: encrypt.c:319:4: note: called from here
encrypt.c: x = _mm_clmulepi64_si128(x,y,0x0);
encrypt.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
encrypt.c: In file included from encrypt.c:26:0:
encrypt.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/smmintrin.h:447:1: error: inlining failed in call to always_inline '_mm_extract_epi32': target specific option mismatch
encrypt.c: _mm_extract_epi32 (__m128i __X, const int __N)
encrypt.c: ^~~~~~~~~~~~~~~~~
encrypt.c: encrypt.c:315:6: note: called from here
encrypt.c: b = _mm_extract_epi32(x, 0x1);
encrypt.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
encrypt.c: In file included from encrypt.c:24:0:
encrypt.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
encrypt.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
encrypt.c: ...
encrypt.c: In file included from encrypt.c:26:0:
encrypt.c: encrypt.c: In function 'Field_Mult_32':
encrypt.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/smmintrin.h:447:1: error: inlining failed in call to always_inline '_mm_extract_epi32': target specific option mismatch
encrypt.c: _mm_extract_epi32 (__m128i __X, const int __N)
encrypt.c: ^~~~~~~~~~~~~~~~~
encrypt.c: encrypt.c:321:6: note: called from here
encrypt.c: b = _mm_extract_epi32(x, 0x0);
encrypt.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
encrypt.c: In file included from encrypt.c:24:0:
encrypt.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
encrypt.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
encrypt.c: ^~~~~~~~~~~~~~~~~~~~
encrypt.c: encrypt.c:319:4: note: called from here
encrypt.c: x = _mm_clmulepi64_si128(x,y,0x0);
encrypt.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
encrypt.c: In file included from encrypt.c:26:0:
encrypt.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/smmintrin.h:447:1: error: inlining failed in call to always_inline '_mm_extract_epi32': target specific option mismatch
encrypt.c: _mm_extract_epi32 (__m128i __X, const int __N)
encrypt.c: ^~~~~~~~~~~~~~~~~
encrypt.c: encrypt.c:315:6: note: called from here
encrypt.c: b = _mm_extract_epi32(x, 0x1);
encrypt.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
encrypt.c: In file included from encrypt.c:24:0:
encrypt.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
encrypt.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
encrypt.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer sse4
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer sse4
gcc -m64 -march=barcelona -O -fomit-frame-pointer sse4
gcc -m64 -march=barcelona -Os -fomit-frame-pointer sse4

Compiler output

Implementation: crypto_aead/trivia0v2/sse4
Compiler: gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer
encrypt.c: In file included from encrypt.c:24:0:
encrypt.c: encrypt.c: In function 'Field_Mult_32':
encrypt.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
encrypt.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
encrypt.c: ^~~~~~~~~~~~~~~~~~~~
encrypt.c: encrypt.c:319:4: note: called from here
encrypt.c: x = _mm_clmulepi64_si128(x,y,0x0);
encrypt.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
encrypt.c: In file included from encrypt.c:24:0:
encrypt.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
encrypt.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
encrypt.c: ^~~~~~~~~~~~~~~~~~~~
encrypt.c: encrypt.c:313:4: note: called from here
encrypt.c: x = _mm_clmulepi64_si128(x,y,0x0);
encrypt.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
encrypt.c: In file included from encrypt.c:24:0:
encrypt.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
encrypt.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
encrypt.c: ^~~~~~~~~~~~~~~~~~~~
encrypt.c: encrypt.c:306:4: note: called from here
encrypt.c: x = _mm_clmulepi64_si128(x,y,0x0);
encrypt.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
encrypt.c: In file included from encrypt.c:24:0:
encrypt.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
encrypt.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
encrypt.c: ...

Number of similar (compiler,implementation) pairs: 12, namely:
CompilerImplementations
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer sse4
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer sse4
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer sse4
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer sse4
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer sse4
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer sse4
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer sse4
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer sse4
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer sse4
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer sse4
gcc -m64 -march=corei7 -O -fomit-frame-pointer sse4
gcc -m64 -march=corei7 -Os -fomit-frame-pointer sse4