Implementation notes: amd64, cryptothinkx, crypto_dh/gls254prot

Computer: cryptothinkx
Architecture: amd64
CPU ID: GenuineIntel-00040651-bfebfbff
SUPERCOP version: 20170105
Operation: crypto_dh
Primitive: gls254prot
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
213951? ? ?? ? ?protgcc_-m64_-march=corei7-avx_-O3_-fomit-frame-pointer2017021420170105
214476? ? ?? ? ?protgcc_-m64_-march=core-avx2_-O3_-fomit-frame-pointer2017021420170105
221310? ? ?? ? ?protgcc_-m64_-march=native_-mtune=native_-O3_-fomit-frame-pointer2017021420170105
228021? ? ?? ? ?protgcc_-m64_-march=core-avx-i_-O3_-fomit-frame-pointer2017021420170105
229821? ? ?? ? ?protgcc_-m64_-march=core-avx2_-O2_-fomit-frame-pointer2017021420170105
230466? ? ?? ? ?protgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv2017021420170105
231297? ? ?? ? ?protgcc_-m64_-march=core-avx2_-O_-fomit-frame-pointer2017021420170105
231909? ? ?? ? ?protgcc_-m64_-march=core-avx-i_-O_-fomit-frame-pointer2017021420170105
237699? ? ?? ? ?protgcc_-m64_-march=native_-mtune=native_-O2_-fomit-frame-pointer2017021420170105
238095? ? ?? ? ?protgcc_-m64_-march=corei7-avx_-O2_-fomit-frame-pointer2017021420170105
238638? ? ?? ? ?protgcc_-m64_-march=core-avx-i_-O2_-fomit-frame-pointer2017021420170105
238671? ? ?? ? ?protgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv2017021420170105
245448? ? ?? ? ?protgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv2017021420170105
259911? ? ?? ? ?protgcc_-m64_-march=corei7-avx_-O_-fomit-frame-pointer2017021420170105
274028? ? ?? ? ?protgcc_-m64_-march=native_-mtune=native_-O_-fomit-frame-pointer2017021420170105

Checksum failure

Implementation: crypto_dh/gls254prot/opt
Compiler: clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments
f5c30b272eb62f8e5228048e5e1de6dd2170295d40e4a9f96b93b7dad84775dd
Number of similar (compiler,implementation) pairs: 24, namely:
CompilerImplementations
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments opt
clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer -Qunused-arguments opt
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments opt
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments opt
gcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer opt
gcc -m64 -march=core-avx-i -O3 -fomit-frame-pointer opt
gcc -m64 -march=core-avx-i -O -fomit-frame-pointer opt
gcc -m64 -march=core-avx-i -Os -fomit-frame-pointer opt
gcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer opt
gcc -m64 -march=core-avx2 -O3 -fomit-frame-pointer opt
gcc -m64 -march=core-avx2 -O -fomit-frame-pointer opt
gcc -m64 -march=core-avx2 -Os -fomit-frame-pointer opt
gcc -m64 -march=corei7-avx -O2 -fomit-frame-pointer opt
gcc -m64 -march=corei7-avx -O3 -fomit-frame-pointer opt
gcc -m64 -march=corei7-avx -O -fomit-frame-pointer opt
gcc -m64 -march=corei7-avx -Os -fomit-frame-pointer opt
gcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer opt
gcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer opt
gcc -m64 -march=native -mtune=native -O -fomit-frame-pointer opt
gcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer opt
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv opt
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv opt
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv opt
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv opt

Test failure

Implementation: crypto_dh/gls254prot/prot
Compiler: clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments
error 111

Number of similar (compiler,implementation) pairs: 9, namely:
CompilerImplementations
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments prot
clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer -Qunused-arguments prot
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments prot
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments prot
gcc -m64 -march=core-avx-i -Os -fomit-frame-pointer prot
gcc -m64 -march=core-avx2 -Os -fomit-frame-pointer prot
gcc -m64 -march=corei7-avx -Os -fomit-frame-pointer prot
gcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer prot
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv prot

Compiler output

Implementation: crypto_dh/gls254prot/opt
Compiler: cc
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:41:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: smu.h: In function ‘smu_5nf_dna_ltr’:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:268:1: error: inlining failed in call to always_inline ‘_mm_cmpeq_epi64’: target specific option mismatch
dh.c: _mm_cmpeq_epi64 (__m128i __X, __m128i __Y)
dh.c: ^
dh.c: In file included from dh.c:8:0:
dh.c: smu.h:251:15: error: called from here
dh.c: msk1[7] = _mm_cmpeq_epi64(cmp[7], dig1);
dh.c: ...
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:41:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:268:1: error: inlining failed in call to always_inline ‘_mm_cmpeq_epi64’: target specific option mismatch
dh.c: _mm_cmpeq_epi64 (__m128i __X, __m128i __Y)
dh.c: ^
dh.c: In file included from dh.c:8:0:
dh.c: smu.h:287:19: error: called from here
dh.c: msk0[0] = _mm_cmpeq_epi64(cmp[0], dig0);
dh.c: ^

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
cc opt
gcc opt
gcc -funroll-loops opt

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: cc
dh.c: In file included from dh.c:9:0:
dh.c: scmul.incl: In function ‘scmul_rand_wreg’:
dh.c: scmul.incl:273:17: warning: implicit declaration of function ‘types_cond_copy2’ [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: scmul.incl:284:9: warning: implicit declaration of function ‘types_cond_copy’ [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:35:0,
dh.c: from gls254.h:20,
dh.c: ...
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline ‘_mm_shuffle_epi8’: target specific option mismatch
dh.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
dh.c: ^
dh.c: In file included from dh.c:3:0:
dh.c: gls254.h:168:5: error: called from here
dh.c: t0 = _mm_shuffle_epi8(sq, t0); ^
dh.c: low.incl:123:2: note: in expansion of macro ‘SQUARE’
dh.c: SQUARE(m0, m1, sq, mask);
dh.c: ^

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
cc prot
gcc prot
gcc -funroll-loops prot

Compiler output

Implementation: crypto_dh/gls254prot/opt
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
dh.c: In file included from dh.c:6:
dh.c: ./ffa.h:94:5: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: low_kts_stp(op00,op10,k00,k02,op01,op11,r00,r01,r02,0x00); /* a0xb0 */
dh.c: ^
dh.c: ./ffa.h:62:11: note: expanded from macro 'low_kts_stp'
dh.c: re0 = _mm_clmulepi64_si128(op0, op1, ord); ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), ^
dh.c: ...
dh.c: ^
dh.c: In file included from dh.c:6:
dh.c: ./ffa.h:96:5: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: ./ffa.h:64:11: note: expanded from macro 'low_kts_stp'
dh.c: re2 = _mm_clmulepi64_si128(op4, op5, ord); ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), ^
dh.c: 9 errors generated.

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments opt
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments opt
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments opt
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments opt
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments opt
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments opt

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
dh.c: In file included from dh.c:7:
dh.c: ./low.incl:26:2: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: MUL4(m1, m0, ma, mb);
dh.c: ^
dh.c: ./gls254.h:155:7: note: expanded from macro 'MUL4'
dh.c: r0 = _mm_clmulepi64_si128(ma, mb, 0x00); ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), ^
dh.c: ...
dh.c: (__m128i)__builtin_ia32_palignr128((__v16qi)(__m128i)(a), ^
dh.c: In file included from dh.c:9:
dh.c: ./scmul.incl:273:17: warning: implicit declaration of function 'types_cond_copy2' is invalid in C99 [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: ./scmul.incl:284:9: warning: implicit declaration of function 'types_cond_copy' is invalid in C99 [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: 2 warnings and 4 errors generated.

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments prot
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments prot
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments prot
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments prot

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments
dh.c: In file included from dh.c:9:
dh.c: ./scmul.incl:273:17: warning: implicit declaration of function 'types_cond_copy2' is invalid in C99 [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: ./scmul.incl:284:9: warning: implicit declaration of function 'types_cond_copy' is invalid in C99 [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: 2 warnings generated.

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments prot
clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer -Qunused-arguments prot
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments prot
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments prot

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments
dh.c: In file included from dh.c:7:
dh.c: ./low.incl:26:2: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: MUL4(m1, m0, ma, mb);
dh.c: ^
dh.c: ./gls254.h:155:7: note: expanded from macro 'MUL4'
dh.c: r0 = _mm_clmulepi64_si128(ma, mb, 0x00); ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), ^
dh.c: ...
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), ^
dh.c: In file included from dh.c:9:
dh.c: ./scmul.incl:273:17: warning: implicit declaration of function 'types_cond_copy2' is invalid in C99 [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: ./scmul.incl:284:9: warning: implicit declaration of function 'types_cond_copy' is invalid in C99 [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: 2 warnings and 3 errors generated.

Number of similar (compiler,implementation) pairs: 2, namely:
CompilerImplementations
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments prot
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments prot

Compiler output

Implementation: crypto_dh/gls254prot/opt
Compiler: gcc -O2 -fomit-frame-pointer
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: ffa.h: In function ‘low_mul’:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline ‘_mm_clmulepi64_si128’: target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:64:9: error: called from here
dh.c: re2 = _mm_clmulepi64_si128(op4, op5, ord);...
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline ‘_mm_clmulepi64_si128’: target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:62:9: error: called from here
dh.c: re0 = _mm_clmulepi64_si128(op0, op1, ord); ^
dh.c: ffa.h:96:5: note: in expansion of macro ‘low_kts_stp’
dh.c: low_kts_stp(op00,op10,k00,k02,op01,op11,r20,r21,r22,0x11); /* a1xb1 */
dh.c: ^

Number of similar (compiler,implementation) pairs: 84, namely:
CompilerImplementations
gcc -O2 -fomit-frame-pointer opt
gcc -O3 -fomit-frame-pointer opt
gcc -O -fomit-frame-pointer opt
gcc -Os -fomit-frame-pointer opt
gcc -fno-schedule-insns -O2 -fomit-frame-pointer opt
gcc -fno-schedule-insns -O3 -fomit-frame-pointer opt
gcc -fno-schedule-insns -O -fomit-frame-pointer opt
gcc -fno-schedule-insns -Os -fomit-frame-pointer opt
gcc -funroll-loops -O2 -fomit-frame-pointer opt
gcc -funroll-loops -O3 -fomit-frame-pointer opt
gcc -funroll-loops -O -fomit-frame-pointer opt
gcc -funroll-loops -Os -fomit-frame-pointer opt
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer opt
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer opt
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer opt
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer opt
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -O -fomit-frame-pointer opt
gcc -funroll-loops -m64 -Os -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer opt
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer opt
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer opt
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer opt
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer opt
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer opt
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer opt
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer opt
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer opt
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer opt
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer opt
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer opt
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer opt
gcc -m64 -O2 -fomit-frame-pointer opt
gcc -m64 -O3 -fomit-frame-pointer opt
gcc -m64 -O -fomit-frame-pointer opt
gcc -m64 -Os -fomit-frame-pointer opt
gcc -m64 -march=core2 -O2 -fomit-frame-pointer opt
gcc -m64 -march=core2 -O3 -fomit-frame-pointer opt
gcc -m64 -march=core2 -O -fomit-frame-pointer opt
gcc -m64 -march=core2 -Os -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer opt
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer opt
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer opt
gcc -m64 -march=corei7 -O -fomit-frame-pointer opt
gcc -m64 -march=corei7 -Os -fomit-frame-pointer opt
gcc -m64 -march=k8 -O2 -fomit-frame-pointer opt
gcc -m64 -march=k8 -O3 -fomit-frame-pointer opt
gcc -m64 -march=k8 -O -fomit-frame-pointer opt
gcc -m64 -march=k8 -Os -fomit-frame-pointer opt
gcc -m64 -march=nocona -O2 -fomit-frame-pointer opt
gcc -m64 -march=nocona -O3 -fomit-frame-pointer opt
gcc -m64 -march=nocona -O -fomit-frame-pointer opt
gcc -m64 -march=nocona -Os -fomit-frame-pointer opt
gcc -march=barcelona -O2 -fomit-frame-pointer opt
gcc -march=barcelona -O3 -fomit-frame-pointer opt
gcc -march=barcelona -O -fomit-frame-pointer opt
gcc -march=barcelona -Os -fomit-frame-pointer opt
gcc -march=k8 -O2 -fomit-frame-pointer opt
gcc -march=k8 -O3 -fomit-frame-pointer opt
gcc -march=k8 -O -fomit-frame-pointer opt
gcc -march=k8 -Os -fomit-frame-pointer opt
gcc -march=nocona -O2 -fomit-frame-pointer opt
gcc -march=nocona -O3 -fomit-frame-pointer opt
gcc -march=nocona -O -fomit-frame-pointer opt
gcc -march=nocona -Os -fomit-frame-pointer opt

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: gcc -O2 -fomit-frame-pointer
dh.c: In file included from dh.c:9:0:
dh.c: scmul.incl: In function ‘scmul_rand_wreg’:
dh.c: scmul.incl:273:17: warning: implicit declaration of function ‘types_cond_copy2’ [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: scmul.incl:284:9: warning: implicit declaration of function ‘types_cond_copy’ [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:35:0,
dh.c: from gls254.h:20,
dh.c: ...
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:185:1: error: inlining failed in call to always_inline ‘_mm_alignr_epi8’: target specific option mismatch
dh.c: _mm_alignr_epi8(__m128i __X, __m128i __Y, const int __N)
dh.c: ^
dh.c: In file included from dh.c:3:0:
dh.c: gls254.h:132:7: error: called from here
dh.c: t0 = _mm_alignr_epi8(m1,m0,8); ^
dh.c: low.incl:27:2: note: in expansion of macro ‘REDUCE’
dh.c: REDUCE(t0, m1, m0);
dh.c: ^

Number of similar (compiler,implementation) pairs: 68, namely:
CompilerImplementations
gcc -O2 -fomit-frame-pointer prot
gcc -O3 -fomit-frame-pointer prot
gcc -O -fomit-frame-pointer prot
gcc -Os -fomit-frame-pointer prot
gcc -fno-schedule-insns -O2 -fomit-frame-pointer prot
gcc -fno-schedule-insns -O3 -fomit-frame-pointer prot
gcc -fno-schedule-insns -O -fomit-frame-pointer prot
gcc -fno-schedule-insns -Os -fomit-frame-pointer prot
gcc -funroll-loops -O2 -fomit-frame-pointer prot
gcc -funroll-loops -O3 -fomit-frame-pointer prot
gcc -funroll-loops -O -fomit-frame-pointer prot
gcc -funroll-loops -Os -fomit-frame-pointer prot
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer prot
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer prot
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer prot
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer prot
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -O -fomit-frame-pointer prot
gcc -funroll-loops -m64 -Os -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer prot
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer prot
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer prot
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer prot
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer prot
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer prot
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer prot
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer prot
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer prot
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer prot
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer prot
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer prot
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer prot
gcc -m64 -O2 -fomit-frame-pointer prot
gcc -m64 -O3 -fomit-frame-pointer prot
gcc -m64 -O -fomit-frame-pointer prot
gcc -m64 -Os -fomit-frame-pointer prot
gcc -m64 -march=k8 -O2 -fomit-frame-pointer prot
gcc -m64 -march=k8 -O3 -fomit-frame-pointer prot
gcc -m64 -march=k8 -O -fomit-frame-pointer prot
gcc -m64 -march=k8 -Os -fomit-frame-pointer prot
gcc -m64 -march=nocona -O2 -fomit-frame-pointer prot
gcc -m64 -march=nocona -O3 -fomit-frame-pointer prot
gcc -m64 -march=nocona -O -fomit-frame-pointer prot
gcc -m64 -march=nocona -Os -fomit-frame-pointer prot
gcc -march=barcelona -O2 -fomit-frame-pointer prot
gcc -march=barcelona -O3 -fomit-frame-pointer prot
gcc -march=barcelona -O -fomit-frame-pointer prot
gcc -march=barcelona -Os -fomit-frame-pointer prot
gcc -march=k8 -O2 -fomit-frame-pointer prot
gcc -march=k8 -O3 -fomit-frame-pointer prot
gcc -march=k8 -O -fomit-frame-pointer prot
gcc -march=k8 -Os -fomit-frame-pointer prot
gcc -march=nocona -O2 -fomit-frame-pointer prot
gcc -march=nocona -O3 -fomit-frame-pointer prot
gcc -march=nocona -O -fomit-frame-pointer prot
gcc -march=nocona -Os -fomit-frame-pointer prot

Compiler output

Implementation: crypto_dh/gls254prot/opt
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: ffa.h: In function ‘low_mul’:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline ‘_mm_clmulepi64_si128’: target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:64:9: error: called from here
dh.c: re2 = _mm_clmulepi64_si128(op4, op5, ord);...
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline ‘_mm_clmulepi64_si128’: target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:62:9: error: called from here
dh.c: re0 = _mm_clmulepi64_si128(op0, op1, ord); ^
dh.c: ffa.h:96:5: note: in expansion of macro ‘low_kts_stp’
dh.c: low_kts_stp(op00,op10,k00,k02,op01,op11,r20,r21,r22,0x11); /* a1xb1 */
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: ffa.h: In function ‘low_mul’:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline ‘_mm_clmulepi64_si128’: target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:64:9: error: called from here
dh.c: re2 = _mm_clmulepi64_si128(op4, op5, ord);...
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline ‘_mm_clmulepi64_si128’: target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:62:9: error: called from here
dh.c: re0 = _mm_clmulepi64_si128(op0, op1, ord); ^
dh.c: ffa.h:96:5: note: in expansion of macro ‘low_kts_stp’
dh.c: low_kts_stp(op00,op10,k00,k02,op01,op11,r20,r21,r22,0x11); /* a1xb1 */
dh.c: ^

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer opt
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer opt
gcc -m64 -march=barcelona -O -fomit-frame-pointer opt
gcc -m64 -march=barcelona -Os -fomit-frame-pointer opt

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
dh.c: In file included from dh.c:9:0:
dh.c: scmul.incl: In function ‘scmul_rand_wreg’:
dh.c: scmul.incl:273:17: warning: implicit declaration of function ‘types_cond_copy2’ [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: scmul.incl:284:9: warning: implicit declaration of function ‘types_cond_copy’ [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:35:0,
dh.c: from gls254.h:20,
dh.c: ...
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:185:1: error: inlining failed in call to always_inline ‘_mm_alignr_epi8’: target specific option mismatch
dh.c: _mm_alignr_epi8(__m128i __X, __m128i __Y, const int __N)
dh.c: ^
dh.c: In file included from dh.c:3:0:
dh.c: gls254.h:132:7: error: called from here
dh.c: t0 = _mm_alignr_epi8(m1,m0,8); ^
dh.c: low.incl:27:2: note: in expansion of macro ‘REDUCE’
dh.c: REDUCE(t0, m1, m0);
dh.c: ^
dh.c: In file included from dh.c:9:0:
dh.c: scmul.incl: In function ‘scmul_rand_wreg’:
dh.c: scmul.incl:273:17: warning: implicit declaration of function ‘types_cond_copy2’ [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: scmul.incl:284:9: warning: implicit declaration of function ‘types_cond_copy’ [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:35:0,
dh.c: from gls254.h:20,
dh.c: ...
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:185:1: error: inlining failed in call to always_inline ‘_mm_alignr_epi8’: target specific option mismatch
dh.c: _mm_alignr_epi8(__m128i __X, __m128i __Y, const int __N)
dh.c: ^
dh.c: In file included from dh.c:3:0:
dh.c: gls254.h:132:7: error: called from here
dh.c: t0 = _mm_alignr_epi8(m1,m0,8); ^
dh.c: low.incl:27:2: note: in expansion of macro ‘REDUCE’
dh.c: REDUCE(t0, m1, m0);
dh.c: ^

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer prot
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer prot
gcc -m64 -march=barcelona -O -fomit-frame-pointer prot
gcc -m64 -march=barcelona -Os -fomit-frame-pointer prot

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: gcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer
dh.c: In file included from dh.c:9:0:
dh.c: scmul.incl: In function ‘scmul_rand_wreg’:
dh.c: scmul.incl:273:17: warning: implicit declaration of function ‘types_cond_copy2’ [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: scmul.incl:284:9: warning: implicit declaration of function ‘types_cond_copy’ [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^

Number of similar (compiler,implementation) pairs: 20, namely:
CompilerImplementations
gcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer prot
gcc -m64 -march=core-avx-i -O3 -fomit-frame-pointer prot
gcc -m64 -march=core-avx-i -O -fomit-frame-pointer prot
gcc -m64 -march=core-avx-i -Os -fomit-frame-pointer prot
gcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer prot
gcc -m64 -march=core-avx2 -O3 -fomit-frame-pointer prot
gcc -m64 -march=core-avx2 -O -fomit-frame-pointer prot
gcc -m64 -march=core-avx2 -Os -fomit-frame-pointer prot
gcc -m64 -march=corei7-avx -O2 -fomit-frame-pointer prot
gcc -m64 -march=corei7-avx -O3 -fomit-frame-pointer prot
gcc -m64 -march=corei7-avx -O -fomit-frame-pointer prot
gcc -m64 -march=corei7-avx -Os -fomit-frame-pointer prot
gcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer prot
gcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer prot
gcc -m64 -march=native -mtune=native -O -fomit-frame-pointer prot
gcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer prot
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv prot
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv prot
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv prot
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv prot

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: gcc -m64 -march=core2 -O2 -fomit-frame-pointer
dh.c: In file included from dh.c:9:0:
dh.c: scmul.incl: In function ‘scmul_rand_wreg’:
dh.c: scmul.incl:273:17: warning: implicit declaration of function ‘types_cond_copy2’ [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: scmul.incl:284:9: warning: implicit declaration of function ‘types_cond_copy’ [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: In file included from gls254.h:18:0,
dh.c: from dh.c:3:
dh.c: ...
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline ‘_mm_clmulepi64_si128’: target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:3:0:
dh.c: gls254.h:155:5: error: called from here
dh.c: r0 = _mm_clmulepi64_si128(ma, mb, 0x00); ^
dh.c: low.incl:26:2: note: in expansion of macro ‘MUL4’
dh.c: MUL4(m1, m0, ma, mb);
dh.c: ^

Number of similar (compiler,implementation) pairs: 16, namely:
CompilerImplementations
gcc -m64 -march=core2 -O2 -fomit-frame-pointer prot
gcc -m64 -march=core2 -O3 -fomit-frame-pointer prot
gcc -m64 -march=core2 -O -fomit-frame-pointer prot
gcc -m64 -march=core2 -Os -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer prot
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer prot
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer prot
gcc -m64 -march=corei7 -O -fomit-frame-pointer prot
gcc -m64 -march=corei7 -Os -fomit-frame-pointer prot