Implementation notes: amd64, calvin, crypto_dh/gls254prot

Computer: calvin
Architecture: amd64
CPU ID: AuthenticAMD-00600f20-178bfbff
SUPERCOP version: 20171218
Operation: crypto_dh
Primitive: gls254prot
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
420630? ? ?? ? ?optclang_-O3_-fwrapv_-mavx_-maes_-mpclmul_-fomit-frame-pointer_-Qunused-arguments2017111320171020
423455? ? ?? ? ?optclang_-O3_-fwrapv_-march=native_-fomit-frame-pointer_-Qunused-arguments2017111320171020
423502? ? ?? ? ?optclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments2017111320171020
434668? ? ?? ? ?optgcc_-m64_-march=core-avx-i_-O3_-fomit-frame-pointer2017111320171020
434669? ? ?? ? ?optgcc_-m64_-march=corei7-avx_-O3_-fomit-frame-pointer2017111320171020
438656? ? ?? ? ?optgcc_-m64_-march=native_-mtune=native_-O3_-fomit-frame-pointer2017111320171020
438870? ? ?? ? ?optgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv2017111320171020
455945? ? ?? ? ?optgcc_-m64_-march=core-avx-i_-O2_-fomit-frame-pointer2017111320171020
456719? ? ?? ? ?optgcc_-m64_-march=corei7-avx_-O2_-fomit-frame-pointer2017111320171020
460444? ? ?? ? ?protgcc_-m64_-march=core-avx-i_-O3_-fomit-frame-pointer2017111320171020
460469? ? ?? ? ?protgcc_-m64_-march=corei7-avx_-O3_-fomit-frame-pointer2017111320171020
466383? ? ?? ? ?protgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv2017111320171020
466487? ? ?? ? ?protgcc_-m64_-march=native_-mtune=native_-O3_-fomit-frame-pointer2017111320171020
470207? ? ?? ? ?optgcc_-m64_-march=native_-mtune=native_-O2_-fomit-frame-pointer2017111320171020
470280? ? ?? ? ?optgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv2017111320171020
473586? ? ?? ? ?optgcc_-m64_-march=native_-mtune=native_-O_-fomit-frame-pointer2017111320171020
473590? ? ?? ? ?optgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv2017111320171020
474061? ? ?? ? ?optgcc_-m64_-march=core-avx-i_-Os_-fomit-frame-pointer2017111320171020
474079? ? ?? ? ?optgcc_-m64_-march=corei7-avx_-Os_-fomit-frame-pointer2017111320171020
476552? ? ?? ? ?optgcc_-m64_-march=core-avx-i_-O_-fomit-frame-pointer2017111320171020
476554? ? ?? ? ?optgcc_-m64_-march=corei7-avx_-O_-fomit-frame-pointer2017111320171020
479697? ? ?? ? ?optgcc_-m64_-march=native_-mtune=native_-Os_-fomit-frame-pointer2017111320171020
479701? ? ?? ? ?optgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv2017111320171020
480805? ? ?? ? ?protgcc_-m64_-march=core-avx-i_-O2_-fomit-frame-pointer2017111320171020
486304? ? ?? ? ?protgcc_-m64_-march=native_-mtune=native_-O_-fomit-frame-pointer2017111320171020
486524? ? ?? ? ?protgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv2017111320171020
486724? ? ?? ? ?protgcc_-m64_-march=corei7-avx_-O2_-fomit-frame-pointer2017111320171020
492428? ? ?? ? ?protgcc_-m64_-march=core-avx-i_-O_-fomit-frame-pointer2017111320171020
492438? ? ?? ? ?protgcc_-m64_-march=corei7-avx_-O_-fomit-frame-pointer2017111320171020
493153? ? ?? ? ?protgcc_-m64_-march=native_-mtune=native_-O2_-fomit-frame-pointer2017111320171020
493404? ? ?? ? ?protgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv2017111320171020

Test failure

Implementation: crypto_dh/gls254prot/prot
Compiler: clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments
error 111

Number of similar (compiler,implementation) pairs: 13, namely:
CompilerImplementations
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments prot
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments prot
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments prot
gcc -m64 -march=core-avx-i -Os -fomit-frame-pointer prot
gcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer opt prot
gcc -m64 -march=core-avx2 -O -fomit-frame-pointer opt prot
gcc -m64 -march=core-avx2 -Os -fomit-frame-pointer opt prot
gcc -m64 -march=corei7-avx -Os -fomit-frame-pointer prot
gcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer prot
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv prot

Compiler output

Implementation: crypto_dh/gls254prot/opt
Compiler: cc
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:41:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: smu.h: In function 'smu_5nf_dna_ltr':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:268:1: error: inlining failed in call to always_inline '_mm_cmpeq_epi64': target specific option mismatch
dh.c: _mm_cmpeq_epi64 (__m128i __X, __m128i __Y)
dh.c: ^
dh.c: In file included from dh.c:8:0:
dh.c: smu.h:251:15: error: called from here
dh.c: msk1[7] = _mm_cmpeq_epi64(cmp[7], dig1);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:41:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:268:1: error: inlining failed in call to always_inline '_mm_cmpeq_epi64': target specific option mismatch
dh.c: _mm_cmpeq_epi64 (__m128i __X, __m128i __Y)
dh.c: ^
dh.c: In file included from dh.c:8:0:
dh.c: smu.h:250:15: error: called from here
dh.c: msk0[7] = _mm_cmpeq_epi64(cmp[7], dig0);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:41:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:268:1: error: inlining failed in call to always_inline '_mm_cmpeq_epi64': target specific option mismatch
dh.c: ...

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
cc opt
gcc opt
gcc -funroll-loops opt

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: cc
dh.c: In file included from dh.c:9:0:
dh.c: scmul.incl: In function 'scmul_rand_wreg':
dh.c: scmul.incl:273:17: warning: implicit declaration of function 'types_cond_copy2' [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: scmul.incl:284:9: warning: implicit declaration of function 'types_cond_copy' [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:35:0,
dh.c: from gls254.h:20,
dh.c: from dh.c:3:
dh.c: low.incl: In function 'low_sq':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:136:1: error: inlining failed in call to always_inline '_mm_shuffle_epi8': target specific option mismatch
dh.c: _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
dh.c: ^
dh.c: In file included from dh.c:3:0:
dh.c: gls254.h:168:5: error: called from here
dh.c: t0 = _mm_shuffle_epi8(sq, t0);\
dh.c: ^
dh.c: low.incl:123:2: note: in expansion of macro 'SQUARE'
dh.c: SQUARE(m0, m1, sq, mask);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:35:0,
dh.c: from gls254.h:20,
dh.c: from dh.c:3:
dh.c: ...

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
cc prot
gcc prot
gcc -funroll-loops prot

Compiler output

Implementation: crypto_dh/gls254prot/opt
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
dh.c: In file included from dh.c:6:
dh.c: ./ffa.h:94:5: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: low_kts_stp(op00,op10,k00,k02,op01,op11,r00,r01,r02,0x00); /* a0xb0 */
dh.c: ^
dh.c: ./ffa.h:62:11: note: expanded from macro 'low_kts_stp'
dh.c: re0 = _mm_clmulepi64_si128(op0, op1, ord);\
dh.c: ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), \
dh.c: ^
dh.c: In file included from dh.c:6:
dh.c: ./ffa.h:94:5: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: ./ffa.h:63:11: note: expanded from macro 'low_kts_stp'
dh.c: re1 = _mm_clmulepi64_si128(op2, op3, ord);\
dh.c: ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), \
dh.c: ^
dh.c: In file included from dh.c:6:
dh.c: ./ffa.h:94:5: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: ./ffa.h:64:11: note: expanded from macro 'low_kts_stp'
dh.c: re2 = _mm_clmulepi64_si128(op4, op5, ord);\
dh.c: ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), \
dh.c: ...

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments opt
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments opt
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments opt
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments opt
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments opt

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
dh.c: In file included from dh.c:7:
dh.c: ./low.incl:26:2: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: MUL4(m1, m0, ma, mb);
dh.c: ^
dh.c: ./gls254.h:155:7: note: expanded from macro 'MUL4'
dh.c: r0 = _mm_clmulepi64_si128(ma, mb, 0x00);\
dh.c: ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), \
dh.c: ^
dh.c: In file included from dh.c:7:
dh.c: ./low.incl:26:2: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: ./gls254.h:156:7: note: expanded from macro 'MUL4'
dh.c: r1 = _mm_clmulepi64_si128(ma, mb, 0x11);\
dh.c: ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), \
dh.c: ^
dh.c: In file included from dh.c:7:
dh.c: ./low.incl:26:2: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: ./gls254.h:157:7: note: expanded from macro 'MUL4'
dh.c: t0 = _mm_clmulepi64_si128(t0, t0, 0x10);\
dh.c: ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), \
dh.c: ...

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments prot
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments prot
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments prot
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments prot
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments prot

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments
dh.c: In file included from dh.c:9:
dh.c: ./scmul.incl:273:17: warning: implicit declaration of function 'types_cond_copy2' is invalid in C99 [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: ./scmul.incl:284:9: warning: implicit declaration of function 'types_cond_copy' is invalid in C99 [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: 2 warnings generated.

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments prot
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments prot
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments prot

Compiler output

Implementation: crypto_dh/gls254prot/opt
Compiler: gcc -O2 -fomit-frame-pointer
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: ffa.h: In function 'low_mul':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:64:9: error: called from here
dh.c: re2 = _mm_clmulepi64_si128(op4, op5, ord);\
dh.c: ^
dh.c: ffa.h:96:5: note: in expansion of macro 'low_kts_stp'
dh.c: low_kts_stp(op00,op10,k00,k02,op01,op11,r20,r21,r22,0x11); /* a1xb1 */
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:63:9: error: called from here
dh.c: re1 = _mm_clmulepi64_si128(op2, op3, ord);\
dh.c: ^
dh.c: ffa.h:96:5: note: in expansion of macro 'low_kts_stp'
dh.c: ...

Number of similar (compiler,implementation) pairs: 84, namely:
CompilerImplementations
gcc -O2 -fomit-frame-pointer opt
gcc -O3 -fomit-frame-pointer opt
gcc -O -fomit-frame-pointer opt
gcc -Os -fomit-frame-pointer opt
gcc -fno-schedule-insns -O2 -fomit-frame-pointer opt
gcc -fno-schedule-insns -O3 -fomit-frame-pointer opt
gcc -fno-schedule-insns -O -fomit-frame-pointer opt
gcc -fno-schedule-insns -Os -fomit-frame-pointer opt
gcc -funroll-loops -O2 -fomit-frame-pointer opt
gcc -funroll-loops -O3 -fomit-frame-pointer opt
gcc -funroll-loops -O -fomit-frame-pointer opt
gcc -funroll-loops -Os -fomit-frame-pointer opt
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer opt
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer opt
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer opt
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer opt
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -O -fomit-frame-pointer opt
gcc -funroll-loops -m64 -Os -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer opt
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer opt
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer opt
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer opt
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer opt
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer opt
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer opt
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer opt
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer opt
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer opt
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer opt
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer opt
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer opt
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer opt
gcc -m64 -O2 -fomit-frame-pointer opt
gcc -m64 -O3 -fomit-frame-pointer opt
gcc -m64 -O -fomit-frame-pointer opt
gcc -m64 -Os -fomit-frame-pointer opt
gcc -m64 -march=core2 -O2 -fomit-frame-pointer opt
gcc -m64 -march=core2 -O3 -fomit-frame-pointer opt
gcc -m64 -march=core2 -O -fomit-frame-pointer opt
gcc -m64 -march=core2 -Os -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer opt
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer opt
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer opt
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer opt
gcc -m64 -march=corei7 -O -fomit-frame-pointer opt
gcc -m64 -march=corei7 -Os -fomit-frame-pointer opt
gcc -m64 -march=k8 -O2 -fomit-frame-pointer opt
gcc -m64 -march=k8 -O3 -fomit-frame-pointer opt
gcc -m64 -march=k8 -O -fomit-frame-pointer opt
gcc -m64 -march=k8 -Os -fomit-frame-pointer opt
gcc -m64 -march=nocona -O2 -fomit-frame-pointer opt
gcc -m64 -march=nocona -O3 -fomit-frame-pointer opt
gcc -m64 -march=nocona -O -fomit-frame-pointer opt
gcc -m64 -march=nocona -Os -fomit-frame-pointer opt
gcc -march=barcelona -O2 -fomit-frame-pointer opt
gcc -march=barcelona -O3 -fomit-frame-pointer opt
gcc -march=barcelona -O -fomit-frame-pointer opt
gcc -march=barcelona -Os -fomit-frame-pointer opt
gcc -march=k8 -O2 -fomit-frame-pointer opt
gcc -march=k8 -O3 -fomit-frame-pointer opt
gcc -march=k8 -O -fomit-frame-pointer opt
gcc -march=k8 -Os -fomit-frame-pointer opt
gcc -march=nocona -O2 -fomit-frame-pointer opt
gcc -march=nocona -O3 -fomit-frame-pointer opt
gcc -march=nocona -O -fomit-frame-pointer opt
gcc -march=nocona -Os -fomit-frame-pointer opt

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: gcc -O2 -fomit-frame-pointer
dh.c: In file included from dh.c:9:0:
dh.c: scmul.incl: In function 'scmul_rand_wreg':
dh.c: scmul.incl:273:17: warning: implicit declaration of function 'types_cond_copy2' [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: scmul.incl:284:9: warning: implicit declaration of function 'types_cond_copy' [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:35:0,
dh.c: from gls254.h:20,
dh.c: from dh.c:3:
dh.c: low.incl: In function 'low_mul':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:185:1: error: inlining failed in call to always_inline '_mm_alignr_epi8': target specific option mismatch
dh.c: _mm_alignr_epi8(__m128i __X, __m128i __Y, const int __N)
dh.c: ^
dh.c: In file included from dh.c:3:0:
dh.c: gls254.h:132:7: error: called from here
dh.c: t0 = _mm_alignr_epi8(m1,m0,8);\
dh.c: ^
dh.c: low.incl:27:2: note: in expansion of macro 'REDUCE'
dh.c: REDUCE(t0, m1, m0);
dh.c: ^
dh.c: In file included from gls254.h:18:0,
dh.c: from dh.c:3:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: ...

Number of similar (compiler,implementation) pairs: 68, namely:
CompilerImplementations
gcc -O2 -fomit-frame-pointer prot
gcc -O3 -fomit-frame-pointer prot
gcc -O -fomit-frame-pointer prot
gcc -Os -fomit-frame-pointer prot
gcc -fno-schedule-insns -O2 -fomit-frame-pointer prot
gcc -fno-schedule-insns -O3 -fomit-frame-pointer prot
gcc -fno-schedule-insns -O -fomit-frame-pointer prot
gcc -fno-schedule-insns -Os -fomit-frame-pointer prot
gcc -funroll-loops -O2 -fomit-frame-pointer prot
gcc -funroll-loops -O3 -fomit-frame-pointer prot
gcc -funroll-loops -O -fomit-frame-pointer prot
gcc -funroll-loops -Os -fomit-frame-pointer prot
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer prot
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer prot
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer prot
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer prot
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -O -fomit-frame-pointer prot
gcc -funroll-loops -m64 -Os -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer prot
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer prot
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer prot
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer prot
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer prot
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer prot
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer prot
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer prot
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer prot
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer prot
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer prot
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer prot
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer prot
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer prot
gcc -m64 -O2 -fomit-frame-pointer prot
gcc -m64 -O3 -fomit-frame-pointer prot
gcc -m64 -O -fomit-frame-pointer prot
gcc -m64 -Os -fomit-frame-pointer prot
gcc -m64 -march=k8 -O2 -fomit-frame-pointer prot
gcc -m64 -march=k8 -O3 -fomit-frame-pointer prot
gcc -m64 -march=k8 -O -fomit-frame-pointer prot
gcc -m64 -march=k8 -Os -fomit-frame-pointer prot
gcc -m64 -march=nocona -O2 -fomit-frame-pointer prot
gcc -m64 -march=nocona -O3 -fomit-frame-pointer prot
gcc -m64 -march=nocona -O -fomit-frame-pointer prot
gcc -m64 -march=nocona -Os -fomit-frame-pointer prot
gcc -march=barcelona -O2 -fomit-frame-pointer prot
gcc -march=barcelona -O3 -fomit-frame-pointer prot
gcc -march=barcelona -O -fomit-frame-pointer prot
gcc -march=barcelona -Os -fomit-frame-pointer prot
gcc -march=k8 -O2 -fomit-frame-pointer prot
gcc -march=k8 -O3 -fomit-frame-pointer prot
gcc -march=k8 -O -fomit-frame-pointer prot
gcc -march=k8 -Os -fomit-frame-pointer prot
gcc -march=nocona -O2 -fomit-frame-pointer prot
gcc -march=nocona -O3 -fomit-frame-pointer prot
gcc -march=nocona -O -fomit-frame-pointer prot
gcc -march=nocona -Os -fomit-frame-pointer prot

Compiler output

Implementation: crypto_dh/gls254prot/opt
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: ffa.h: In function 'low_mul':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:64:9: error: called from here
dh.c: re2 = _mm_clmulepi64_si128(op4, op5, ord);\
dh.c: ^
dh.c: ffa.h:96:5: note: in expansion of macro 'low_kts_stp'
dh.c: low_kts_stp(op00,op10,k00,k02,op01,op11,r20,r21,r22,0x11); /* a1xb1 */
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:63:9: error: called from here
dh.c: re1 = _mm_clmulepi64_si128(op2, op3, ord);\
dh.c: ^
dh.c: ffa.h:96:5: note: in expansion of macro 'low_kts_stp'
dh.c: ...
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: ffa.h: In function 'low_mul':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:64:9: error: called from here
dh.c: re2 = _mm_clmulepi64_si128(op4, op5, ord);\
dh.c: ^
dh.c: ffa.h:96:5: note: in expansion of macro 'low_kts_stp'
dh.c: low_kts_stp(op00,op10,k00,k02,op01,op11,r20,r21,r22,0x11); /* a1xb1 */
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:63:9: error: called from here
dh.c: re1 = _mm_clmulepi64_si128(op2, op3, ord);\
dh.c: ^
dh.c: ffa.h:96:5: note: in expansion of macro 'low_kts_stp'
dh.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer opt
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer opt
gcc -m64 -march=barcelona -O -fomit-frame-pointer opt
gcc -m64 -march=barcelona -Os -fomit-frame-pointer opt

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
dh.c: In file included from dh.c:9:0:
dh.c: scmul.incl: In function 'scmul_rand_wreg':
dh.c: scmul.incl:273:17: warning: implicit declaration of function 'types_cond_copy2' [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: scmul.incl:284:9: warning: implicit declaration of function 'types_cond_copy' [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:35:0,
dh.c: from gls254.h:20,
dh.c: from dh.c:3:
dh.c: low.incl: In function 'low_mul':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:185:1: error: inlining failed in call to always_inline '_mm_alignr_epi8': target specific option mismatch
dh.c: _mm_alignr_epi8(__m128i __X, __m128i __Y, const int __N)
dh.c: ^
dh.c: In file included from dh.c:3:0:
dh.c: gls254.h:132:7: error: called from here
dh.c: t0 = _mm_alignr_epi8(m1,m0,8);\
dh.c: ^
dh.c: low.incl:27:2: note: in expansion of macro 'REDUCE'
dh.c: REDUCE(t0, m1, m0);
dh.c: ^
dh.c: In file included from gls254.h:18:0,
dh.c: from dh.c:3:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: ...
dh.c: In file included from dh.c:9:0:
dh.c: scmul.incl: In function 'scmul_rand_wreg':
dh.c: scmul.incl:273:17: warning: implicit declaration of function 'types_cond_copy2' [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: scmul.incl:284:9: warning: implicit declaration of function 'types_cond_copy' [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:35:0,
dh.c: from gls254.h:20,
dh.c: from dh.c:3:
dh.c: low.incl: In function 'low_mul':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:185:1: error: inlining failed in call to always_inline '_mm_alignr_epi8': target specific option mismatch
dh.c: _mm_alignr_epi8(__m128i __X, __m128i __Y, const int __N)
dh.c: ^
dh.c: In file included from dh.c:3:0:
dh.c: gls254.h:132:7: error: called from here
dh.c: t0 = _mm_alignr_epi8(m1,m0,8);\
dh.c: ^
dh.c: low.incl:27:2: note: in expansion of macro 'REDUCE'
dh.c: REDUCE(t0, m1, m0);
dh.c: ^
dh.c: In file included from gls254.h:18:0,
dh.c: from dh.c:3:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer prot
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer prot
gcc -m64 -march=barcelona -O -fomit-frame-pointer prot
gcc -m64 -march=barcelona -Os -fomit-frame-pointer prot

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: gcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer
dh.c: In file included from dh.c:9:0:
dh.c: scmul.incl: In function 'scmul_rand_wreg':
dh.c: scmul.incl:273:17: warning: implicit declaration of function 'types_cond_copy2' [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: scmul.incl:284:9: warning: implicit declaration of function 'types_cond_copy' [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^

Number of similar (compiler,implementation) pairs: 19, namely:
CompilerImplementations
gcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer prot
gcc -m64 -march=core-avx-i -O3 -fomit-frame-pointer prot
gcc -m64 -march=core-avx-i -O -fomit-frame-pointer prot
gcc -m64 -march=core-avx-i -Os -fomit-frame-pointer prot
gcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer prot
gcc -m64 -march=core-avx2 -O -fomit-frame-pointer prot
gcc -m64 -march=core-avx2 -Os -fomit-frame-pointer prot
gcc -m64 -march=corei7-avx -O2 -fomit-frame-pointer prot
gcc -m64 -march=corei7-avx -O3 -fomit-frame-pointer prot
gcc -m64 -march=corei7-avx -O -fomit-frame-pointer prot
gcc -m64 -march=corei7-avx -Os -fomit-frame-pointer prot
gcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer prot
gcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer prot
gcc -m64 -march=native -mtune=native -O -fomit-frame-pointer prot
gcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer prot
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv prot
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv prot
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv prot
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv prot

Compiler output

Implementation: crypto_dh/gls254prot/prot
Compiler: gcc -m64 -march=core2 -O2 -fomit-frame-pointer
dh.c: In file included from dh.c:9:0:
dh.c: scmul.incl: In function 'scmul_rand_wreg':
dh.c: scmul.incl:273:17: warning: implicit declaration of function 'types_cond_copy2' [-Wimplicit-function-declaration]
dh.c: types_cond_copy2(tab0, tab4, tabx, n1, n2);
dh.c: ^
dh.c: scmul.incl:284:9: warning: implicit declaration of function 'types_cond_copy' [-Wimplicit-function-declaration]
dh.c: types_cond_copy(QX, tab0, o1);
dh.c: ^
dh.c: In file included from gls254.h:18:0,
dh.c: from dh.c:3:
dh.c: low.incl: In function 'low_mul':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:3:0:
dh.c: gls254.h:157:5: error: called from here
dh.c: t0 = _mm_clmulepi64_si128(t0, t0, 0x10);\
dh.c: ^
dh.c: low.incl:26:2: note: in expansion of macro 'MUL4'
dh.c: MUL4(m1, m0, ma, mb);
dh.c: ^
dh.c: In file included from gls254.h:18:0,
dh.c: from dh.c:3:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ...

Number of similar (compiler,implementation) pairs: 16, namely:
CompilerImplementations
gcc -m64 -march=core2 -O2 -fomit-frame-pointer prot
gcc -m64 -march=core2 -O3 -fomit-frame-pointer prot
gcc -m64 -march=core2 -O -fomit-frame-pointer prot
gcc -m64 -march=core2 -Os -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer prot
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer prot
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer prot
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer prot
gcc -m64 -march=corei7 -O -fomit-frame-pointer prot
gcc -m64 -march=corei7 -Os -fomit-frame-pointer prot