Implementation notes: amd64, rumba7, crypto_dh/gls254prot

Computer: rumba7
Architecture: amd64
CPU ID: AuthenticAMD-00800f11-178bfbff
SUPERCOP version: 20171218
Operation: crypto_dh
Primitive: gls254prot
TimeImplementationCompilerBenchmark dateSUPERCOP version
182880optclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2018053020171218
183150optclang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments2018053020171218
183180optclang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments2018053020171218
183750optclang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer -Qunused-arguments2018053020171218
188730optgcc -m64 -march=core-avx2 -O3 -fomit-frame-pointer2018053020171218
188820optgcc -m64 -march=corei7-avx -O3 -fomit-frame-pointer2018053020171218
188940optgcc -m64 -march=core-avx-i -O3 -fomit-frame-pointer2018053020171218
208560optgcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer2018053020171218
208890optgcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer2018053020171218
208950optgcc -m64 -march=corei7-avx -O2 -fomit-frame-pointer2018053020171218
209310optgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2018053020171218
209790optgcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer2018053020171218
224280optgcc -m64 -march=core-avx2 -Os -fomit-frame-pointer2018053020171218
224940optgcc -m64 -march=core-avx-i -Os -fomit-frame-pointer2018053020171218
225210optgcc -m64 -march=corei7-avx -Os -fomit-frame-pointer2018053020171218
227460optgcc -m64 -march=core-avx2 -O -fomit-frame-pointer2018053020171218
227490optgcc -m64 -march=core-avx-i -O -fomit-frame-pointer2018053020171218
228090optgcc -m64 -march=corei7-avx -O -fomit-frame-pointer2018053020171218
228390protgcc -m64 -march=core-avx-i -O3 -fomit-frame-pointer2018053020171218
228900protgcc -m64 -march=corei7-avx -O3 -fomit-frame-pointer2018053020171218
229470protgcc -m64 -march=core-avx2 -O3 -fomit-frame-pointer2018053020171218
237630optgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2018053020171218
238290optgcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer2018053020171218
244380optgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2018053020171218
245160optgcc -m64 -march=native -mtune=native -O -fomit-frame-pointer2018053020171218
245790optgcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer2018053020171218
245820optgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2018053020171218
246450protgcc -m64 -march=core-avx-i -O -fomit-frame-pointer2018053020171218
246810protgcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer2018053020171218
247470protgcc -m64 -march=corei7-avx -O2 -fomit-frame-pointer2018053020171218
247620protgcc -m64 -march=corei7-avx -O -fomit-frame-pointer2018053020171218
247680protgcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer2018053020171218
248250protgcc -m64 -march=core-avx2 -O -fomit-frame-pointer2018053020171218
254130protgcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer2018053020171218
254490protgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2018053020171218
275460protgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2018053020171218
275640protgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2018053020171218
276450protgcc -m64 -march=native -mtune=native -O -fomit-frame-pointer2018053020171218
277260protgcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer2018053020171218

Test failure

Implementation: crypto_dh/gls254prot/prot
Compiler: clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments
error 111

Number of similar (compiler,implementation) pairs: 9, namely:
CompilerImplementations
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments prot
clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer -Qunused-arguments prot
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments prot
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments prot
gcc -m64 -march=core-avx-i -Os -fomit-frame-pointer prot
gcc -m64 -march=core-avx2 -Os -fomit-frame-pointer prot
gcc -m64 -march=corei7-avx -Os -fomit-frame-pointer prot
gcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer prot
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv prot

Compiler output

Implementation: crypto_dh/gls254prot/opt
Compiler: cc
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:41:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: smu.h: In function 'smu_5nf_dna_ltr':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:268:1: error: inlining failed in call to always_inline '_mm_cmpeq_epi64': target specific option mismatch
dh.c: _mm_cmpeq_epi64 (__m128i __X, __m128i __Y)
dh.c: ^
dh.c: In file included from dh.c:8:0:
dh.c: smu.h:251:15: error: called from here
dh.c: msk1[7] = _mm_cmpeq_epi64(cmp[7], dig1);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:41:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:268:1: error: inlining failed in call to always_inline '_mm_cmpeq_epi64': target specific option mismatch
dh.c: _mm_cmpeq_epi64 (__m128i __X, __m128i __Y)
dh.c: ^
dh.c: In file included from dh.c:8:0:
dh.c: smu.h:250:15: error: called from here
dh.c: msk0[7] = _mm_cmpeq_epi64(cmp[7], dig0);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:41:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:268:1: error: inlining failed in call to always_inline '_mm_cmpeq_epi64': target specific option mismatch
dh.c: ...

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
cc opt
gcc opt
gcc -funroll-loops opt

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: cc
dh.c: In file included from dh.c:9:0:
dh.c: scmul.incl: In function 'scmul_rand_wreg':
dh.c: scmul.incl:273:17: warning: implicit declaration of function 'types_cond_copy2' [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: scmul.incl:284:9: warning: implicit declaration of function 'types_cond_copy' [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:35:0,
dh.c: from gls254.h:20,
dh.c: from dh.c:3:
dh.c: low.incl: In function 'low_sq':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
dh.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
dh.c: ^
dh.c: In file included from dh.c:3:0:
dh.c: gls254.h:168:5: error: called from here
dh.c: t0 = _mm_shuffle_epi8(sq, t0);\
dh.c: ^
dh.c: low.incl:123:2: note: in expansion of macro 'SQUARE'
dh.c: SQUARE(m0, m1, sq, mask);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:35:0,
dh.c: from gls254.h:20,
dh.c: from dh.c:3:
dh.c: ...

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
cc prot
gcc prot
gcc -funroll-loops prot

Compiler output

Implementation: crypto_dh/gls254prot/opt
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
dh.c: In file included from dh.c:6:
dh.c: ./ffa.h:94:5: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: low_kts_stp(op00,op10,k00,k02,op01,op11,r00,r01,r02,0x00); /* a0xb0 */
dh.c: ^
dh.c: ./ffa.h:62:11: note: expanded from macro 'low_kts_stp'
dh.c: re0 = _mm_clmulepi64_si128(op0, op1, ord);\
dh.c: ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), \
dh.c: ^
dh.c: In file included from dh.c:6:
dh.c: ./ffa.h:94:5: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: ./ffa.h:63:11: note: expanded from macro 'low_kts_stp'
dh.c: re1 = _mm_clmulepi64_si128(op2, op3, ord);\
dh.c: ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), \
dh.c: ^
dh.c: In file included from dh.c:6:
dh.c: ./ffa.h:94:5: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: ./ffa.h:64:11: note: expanded from macro 'low_kts_stp'
dh.c: re2 = _mm_clmulepi64_si128(op4, op5, ord);\
dh.c: ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), \
dh.c: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments opt
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments opt
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments opt
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments opt
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments opt
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments opt

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
dh.c: In file included from dh.c:7:
dh.c: ./low.incl:26:2: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: MUL4(m1, m0, ma, mb);
dh.c: ^
dh.c: ./gls254.h:155:7: note: expanded from macro 'MUL4'
dh.c: r0 = _mm_clmulepi64_si128(ma, mb, 0x00);\
dh.c: ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), \
dh.c: ^
dh.c: In file included from dh.c:7:
dh.c: ./low.incl:26:2: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: ./gls254.h:156:7: note: expanded from macro 'MUL4'
dh.c: r1 = _mm_clmulepi64_si128(ma, mb, 0x11);\
dh.c: ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), \
dh.c: ^
dh.c: In file included from dh.c:7:
dh.c: ./low.incl:26:2: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: ./gls254.h:157:7: note: expanded from macro 'MUL4'
dh.c: t0 = _mm_clmulepi64_si128(t0, t0, 0x10);\
dh.c: ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), \
dh.c: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments prot
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments prot
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments prot
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments prot
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments prot
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments prot

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments
dh.c: In file included from dh.c:9:
dh.c: ./scmul.incl:273:17: warning: implicit declaration of function 'types_cond_copy2' is invalid in C99 [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: ./scmul.incl:284:9: warning: implicit declaration of function 'types_cond_copy' is invalid in C99 [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: 2 warnings generated.

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments prot
clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer -Qunused-arguments prot
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments prot
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments prot

Compiler output

Implementation: crypto_dh/gls254prot/opt
Compiler: gcc -O2 -fomit-frame-pointer
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: ffa.h: In function 'low_mul':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:64:9: error: called from here
dh.c: re2 = _mm_clmulepi64_si128(op4, op5, ord);\
dh.c: ^
dh.c: ffa.h:96:5: note: in expansion of macro 'low_kts_stp'
dh.c: low_kts_stp(op00,op10,k00,k02,op01,op11,r20,r21,r22,0x11); /* a1xb1 */
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:63:9: error: called from here
dh.c: re1 = _mm_clmulepi64_si128(op2, op3, ord);\
dh.c: ^
dh.c: ffa.h:96:5: note: in expansion of macro 'low_kts_stp'
dh.c: ...

Number of similar (compiler,implementation) pairs: 84, namely:
CompilerImplementations
gcc -O2 -fomit-frame-pointer opt
gcc -O3 -fomit-frame-pointer opt
gcc -O -fomit-frame-pointer opt
gcc -Os -fomit-frame-pointer opt
gcc -fno-schedule-insns -O2 -fomit-frame-pointer opt
gcc -fno-schedule-insns -O3 -fomit-frame-pointer opt
gcc -fno-schedule-insns -O -fomit-frame-pointer opt
gcc -fno-schedule-insns -Os -fomit-frame-pointer opt
gcc -funroll-loops -O2 -fomit-frame-pointer opt
gcc -funroll-loops -O3 -fomit-frame-pointer opt
gcc -funroll-loops -O -fomit-frame-pointer opt
gcc -funroll-loops -Os -fomit-frame-pointer opt
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer opt
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer opt
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer opt
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer opt
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -O -fomit-frame-pointer opt
gcc -funroll-loops -m64 -Os -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer opt
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer opt
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer opt
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer opt
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer opt
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer opt
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer opt
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer opt
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer opt
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer opt
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer opt
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer opt
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer opt
gcc -m64 -O2 -fomit-frame-pointer opt
gcc -m64 -O3 -fomit-frame-pointer opt
gcc -m64 -O -fomit-frame-pointer opt
gcc -m64 -Os -fomit-frame-pointer opt
gcc -m64 -march=core2 -O2 -fomit-frame-pointer opt
gcc -m64 -march=core2 -O3 -fomit-frame-pointer opt
gcc -m64 -march=core2 -O -fomit-frame-pointer opt
gcc -m64 -march=core2 -Os -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer opt
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer opt
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer opt
gcc -m64 -march=corei7 -O -fomit-frame-pointer opt
gcc -m64 -march=corei7 -Os -fomit-frame-pointer opt
gcc -m64 -march=k8 -O2 -fomit-frame-pointer opt
gcc -m64 -march=k8 -O3 -fomit-frame-pointer opt
gcc -m64 -march=k8 -O -fomit-frame-pointer opt
gcc -m64 -march=k8 -Os -fomit-frame-pointer opt
gcc -m64 -march=nocona -O2 -fomit-frame-pointer opt
gcc -m64 -march=nocona -O3 -fomit-frame-pointer opt
gcc -m64 -march=nocona -O -fomit-frame-pointer opt
gcc -m64 -march=nocona -Os -fomit-frame-pointer opt
gcc -march=barcelona -O2 -fomit-frame-pointer opt
gcc -march=barcelona -O3 -fomit-frame-pointer opt
gcc -march=barcelona -O -fomit-frame-pointer opt
gcc -march=barcelona -Os -fomit-frame-pointer opt
gcc -march=k8 -O2 -fomit-frame-pointer opt
gcc -march=k8 -O3 -fomit-frame-pointer opt
gcc -march=k8 -O -fomit-frame-pointer opt
gcc -march=k8 -Os -fomit-frame-pointer opt
gcc -march=nocona -O2 -fomit-frame-pointer opt
gcc -march=nocona -O3 -fomit-frame-pointer opt
gcc -march=nocona -O -fomit-frame-pointer opt
gcc -march=nocona -Os -fomit-frame-pointer opt

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: gcc -O2 -fomit-frame-pointer
dh.c: In file included from dh.c:9:0:
dh.c: scmul.incl: In function 'scmul_rand_wreg':
dh.c: scmul.incl:273:17: warning: implicit declaration of function 'types_cond_copy2' [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: scmul.incl:284:9: warning: implicit declaration of function 'types_cond_copy' [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:35:0,
dh.c: from gls254.h:20,
dh.c: from dh.c:3:
dh.c: low.incl: In function 'low_mul':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:185:1: error: inlining failed in call to always_inline '_mm_alignr_epi8': target specific option mismatch
dh.c: _mm_alignr_epi8(__m128i __X, __m128i __Y, const int __N)
dh.c: ^
dh.c: In file included from dh.c:3:0:
dh.c: gls254.h:132:7: error: called from here
dh.c: t0 = _mm_alignr_epi8(m1,m0,8);\
dh.c: ^
dh.c: low.incl:27:2: note: in expansion of macro 'REDUCE'
dh.c: REDUCE(t0, m1, m0);
dh.c: ^
dh.c: In file included from gls254.h:18:0,
dh.c: from dh.c:3:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: ...

Number of similar (compiler,implementation) pairs: 68, namely:
CompilerImplementations
gcc -O2 -fomit-frame-pointer prot
gcc -O3 -fomit-frame-pointer prot
gcc -O -fomit-frame-pointer prot
gcc -Os -fomit-frame-pointer prot
gcc -fno-schedule-insns -O2 -fomit-frame-pointer prot
gcc -fno-schedule-insns -O3 -fomit-frame-pointer prot
gcc -fno-schedule-insns -O -fomit-frame-pointer prot
gcc -fno-schedule-insns -Os -fomit-frame-pointer prot
gcc -funroll-loops -O2 -fomit-frame-pointer prot
gcc -funroll-loops -O3 -fomit-frame-pointer prot
gcc -funroll-loops -O -fomit-frame-pointer prot
gcc -funroll-loops -Os -fomit-frame-pointer prot
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer prot
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer prot
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer prot
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer prot
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -O -fomit-frame-pointer prot
gcc -funroll-loops -m64 -Os -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer prot
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer prot
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer prot
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer prot
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer prot
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer prot
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer prot
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer prot
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer prot
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer prot
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer prot
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer prot
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer prot
gcc -m64 -O2 -fomit-frame-pointer prot
gcc -m64 -O3 -fomit-frame-pointer prot
gcc -m64 -O -fomit-frame-pointer prot
gcc -m64 -Os -fomit-frame-pointer prot
gcc -m64 -march=k8 -O2 -fomit-frame-pointer prot
gcc -m64 -march=k8 -O3 -fomit-frame-pointer prot
gcc -m64 -march=k8 -O -fomit-frame-pointer prot
gcc -m64 -march=k8 -Os -fomit-frame-pointer prot
gcc -m64 -march=nocona -O2 -fomit-frame-pointer prot
gcc -m64 -march=nocona -O3 -fomit-frame-pointer prot
gcc -m64 -march=nocona -O -fomit-frame-pointer prot
gcc -m64 -march=nocona -Os -fomit-frame-pointer prot
gcc -march=barcelona -O2 -fomit-frame-pointer prot
gcc -march=barcelona -O3 -fomit-frame-pointer prot
gcc -march=barcelona -O -fomit-frame-pointer prot
gcc -march=barcelona -Os -fomit-frame-pointer prot
gcc -march=k8 -O2 -fomit-frame-pointer prot
gcc -march=k8 -O3 -fomit-frame-pointer prot
gcc -march=k8 -O -fomit-frame-pointer prot
gcc -march=k8 -Os -fomit-frame-pointer prot
gcc -march=nocona -O2 -fomit-frame-pointer prot
gcc -march=nocona -O3 -fomit-frame-pointer prot
gcc -march=nocona -O -fomit-frame-pointer prot
gcc -march=nocona -Os -fomit-frame-pointer prot

Compiler output

Implementation: crypto_dh/gls254prot/opt
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: ffa.h: In function 'low_mul':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:64:9: error: called from here
dh.c: re2 = _mm_clmulepi64_si128(op4, op5, ord);\
dh.c: ^
dh.c: ffa.h:96:5: note: in expansion of macro 'low_kts_stp'
dh.c: low_kts_stp(op00,op10,k00,k02,op01,op11,r20,r21,r22,0x11); /* a1xb1 */
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:63:9: error: called from here
dh.c: re1 = _mm_clmulepi64_si128(op2, op3, ord);\
dh.c: ^
dh.c: ffa.h:96:5: note: in expansion of macro 'low_kts_stp'
dh.c: ...
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: ffa.h: In function 'low_mul':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:64:9: error: called from here
dh.c: re2 = _mm_clmulepi64_si128(op4, op5, ord);\
dh.c: ^
dh.c: ffa.h:96:5: note: in expansion of macro 'low_kts_stp'
dh.c: low_kts_stp(op00,op10,k00,k02,op01,op11,r20,r21,r22,0x11); /* a1xb1 */
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:63:9: error: called from here
dh.c: re1 = _mm_clmulepi64_si128(op2, op3, ord);\
dh.c: ^
dh.c: ffa.h:96:5: note: in expansion of macro 'low_kts_stp'
dh.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer opt
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer opt
gcc -m64 -march=barcelona -O -fomit-frame-pointer opt
gcc -m64 -march=barcelona -Os -fomit-frame-pointer opt

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
dh.c: In file included from dh.c:9:0:
dh.c: scmul.incl: In function 'scmul_rand_wreg':
dh.c: scmul.incl:273:17: warning: implicit declaration of function 'types_cond_copy2' [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: scmul.incl:284:9: warning: implicit declaration of function 'types_cond_copy' [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:35:0,
dh.c: from gls254.h:20,
dh.c: from dh.c:3:
dh.c: low.incl: In function 'low_mul':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:185:1: error: inlining failed in call to always_inline '_mm_alignr_epi8': target specific option mismatch
dh.c: _mm_alignr_epi8(__m128i __X, __m128i __Y, const int __N)
dh.c: ^
dh.c: In file included from dh.c:3:0:
dh.c: gls254.h:132:7: error: called from here
dh.c: t0 = _mm_alignr_epi8(m1,m0,8);\
dh.c: ^
dh.c: low.incl:27:2: note: in expansion of macro 'REDUCE'
dh.c: REDUCE(t0, m1, m0);
dh.c: ^
dh.c: In file included from gls254.h:18:0,
dh.c: from dh.c:3:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: ...
dh.c: In file included from dh.c:9:0:
dh.c: scmul.incl: In function 'scmul_rand_wreg':
dh.c: scmul.incl:273:17: warning: implicit declaration of function 'types_cond_copy2' [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: scmul.incl:284:9: warning: implicit declaration of function 'types_cond_copy' [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:35:0,
dh.c: from gls254.h:20,
dh.c: from dh.c:3:
dh.c: low.incl: In function 'low_mul':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:185:1: error: inlining failed in call to always_inline '_mm_alignr_epi8': target specific option mismatch
dh.c: _mm_alignr_epi8(__m128i __X, __m128i __Y, const int __N)
dh.c: ^
dh.c: In file included from dh.c:3:0:
dh.c: gls254.h:132:7: error: called from here
dh.c: t0 = _mm_alignr_epi8(m1,m0,8);\
dh.c: ^
dh.c: low.incl:27:2: note: in expansion of macro 'REDUCE'
dh.c: REDUCE(t0, m1, m0);
dh.c: ^
dh.c: In file included from gls254.h:18:0,
dh.c: from dh.c:3:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer prot
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer prot
gcc -m64 -march=barcelona -O -fomit-frame-pointer prot
gcc -m64 -march=barcelona -Os -fomit-frame-pointer prot

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: gcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer
dh.c: In file included from dh.c:9:0:
dh.c: scmul.incl: In function 'scmul_rand_wreg':
dh.c: scmul.incl:273:17: warning: implicit declaration of function 'types_cond_copy2' [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: scmul.incl:284:9: warning: implicit declaration of function 'types_cond_copy' [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^

Number of similar (compiler,implementation) pairs: 20, namely:
CompilerImplementations
gcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer prot
gcc -m64 -march=core-avx-i -O3 -fomit-frame-pointer prot
gcc -m64 -march=core-avx-i -O -fomit-frame-pointer prot
gcc -m64 -march=core-avx-i -Os -fomit-frame-pointer prot
gcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer prot
gcc -m64 -march=core-avx2 -O3 -fomit-frame-pointer prot
gcc -m64 -march=core-avx2 -O -fomit-frame-pointer prot
gcc -m64 -march=core-avx2 -Os -fomit-frame-pointer prot
gcc -m64 -march=corei7-avx -O2 -fomit-frame-pointer prot
gcc -m64 -march=corei7-avx -O3 -fomit-frame-pointer prot
gcc -m64 -march=corei7-avx -O -fomit-frame-pointer prot
gcc -m64 -march=corei7-avx -Os -fomit-frame-pointer prot
gcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer prot
gcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer prot
gcc -m64 -march=native -mtune=native -O -fomit-frame-pointer prot
gcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer prot
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv prot
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv prot
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv prot
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv prot

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: gcc -m64 -march=core2 -O2 -fomit-frame-pointer
dh.c: In file included from dh.c:9:0:
dh.c: scmul.incl: In function 'scmul_rand_wreg':
dh.c: scmul.incl:273:17: warning: implicit declaration of function 'types_cond_copy2' [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: scmul.incl:284:9: warning: implicit declaration of function 'types_cond_copy' [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: In file included from gls254.h:18:0,
dh.c: from dh.c:3:
dh.c: low.incl: In function 'low_mul':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:3:0:
dh.c: gls254.h:157:5: error: called from here
dh.c: t0 = _mm_clmulepi64_si128(t0, t0, 0x10);\
dh.c: ^
dh.c: low.incl:26:2: note: in expansion of macro 'MUL4'
dh.c: MUL4(m1, m0, ma, mb);
dh.c: ^
dh.c: In file included from gls254.h:18:0,
dh.c: from dh.c:3:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ...

Number of similar (compiler,implementation) pairs: 16, namely:
CompilerImplementations
gcc -m64 -march=core2 -O2 -fomit-frame-pointer prot
gcc -m64 -march=core2 -O3 -fomit-frame-pointer prot
gcc -m64 -march=core2 -O -fomit-frame-pointer prot
gcc -m64 -march=core2 -Os -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer prot
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer prot
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer prot
gcc -m64 -march=corei7 -O -fomit-frame-pointer prot
gcc -m64 -march=corei7 -Os -fomit-frame-pointer prot