Implementation notes: amd64, cryptothinkx, crypto_dh/k298

Computer: cryptothinkx
Architecture: amd64
CPU ID: GenuineIntel-00040651-bfebfbff
SUPERCOP version: 20170105
Operation: crypto_dh
Primitive: k298
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
263481? ? ?? ? ?refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv2017021520170105
263931? ? ?? ? ?refgcc_-m64_-march=native_-mtune=native_-O3_-fomit-frame-pointer2017021520170105
263967? ? ?? ? ?refgcc_-m64_-march=core-avx2_-O3_-fomit-frame-pointer2017021520170105
264897? ? ?? ? ?refgcc_-m64_-march=corei7-avx_-O3_-fomit-frame-pointer2017021520170105
265065? ? ?? ? ?refgcc_-m64_-march=core-avx-i_-O3_-fomit-frame-pointer2017021520170105
276636? ? ?? ? ?refgcc_-m64_-march=native_-mtune=native_-O2_-fomit-frame-pointer2017021520170105
277101? ? ?? ? ?refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv2017021520170105
278103? ? ?? ? ?refgcc_-m64_-march=core-avx2_-O2_-fomit-frame-pointer2017021520170105
279093? ? ?? ? ?refgcc_-m64_-march=core-avx-i_-O2_-fomit-frame-pointer2017021520170105
280314? ? ?? ? ?refgcc_-m64_-march=core-avx2_-Os_-fomit-frame-pointer2017021520170105
281106? ? ?? ? ?refgcc_-m64_-march=core-avx-i_-Os_-fomit-frame-pointer2017021520170105
288792? ? ?? ? ?refgcc_-m64_-march=corei7-avx_-O2_-fomit-frame-pointer2017021520170105
289935? ? ?? ? ?refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv2017021520170105
290076? ? ?? ? ?refgcc_-m64_-march=native_-mtune=native_-O_-fomit-frame-pointer2017021520170105
290517? ? ?? ? ?refgcc_-m64_-march=corei7-avx_-O_-fomit-frame-pointer2017021520170105
299607? ? ?? ? ?refgcc_-m64_-march=native_-mtune=native_-Os_-fomit-frame-pointer2017021520170105
310407? ? ?? ? ?refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv2017021520170105
311019? ? ?? ? ?refgcc_-m64_-march=corei7-avx_-Os_-fomit-frame-pointer2017021520170105
322248? ? ?? ? ?refgcc_-m64_-march=core-avx2_-O_-fomit-frame-pointer2017021520170105
333856? ? ?? ? ?refgcc_-m64_-march=core-avx-i_-O_-fomit-frame-pointer2017021520170105

Test failure

Implementation: crypto_dh/k298/ref
Compiler: clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments
error 111

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments ref
clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer -Qunused-arguments ref
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments ref
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments ref

Compiler output

Implementation: crypto_dh/k298/ref
Compiler: cc
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:41:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: smu.h: In function ‘smu_3nf_ltr’:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:268:1: error: inlining failed in call to always_inline ‘_mm_cmpeq_epi64’: target specific option mismatch
dh.c: _mm_cmpeq_epi64 (__m128i __X, __m128i __Y)
dh.c: ^
dh.c: In file included from dh.c:8:0:
dh.c: smu.h:337:19: error: called from here
dh.c: mask_lps[7] = _mm_cmpeq_epi64(digits[7], dig_sse);
dh.c: ...
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:41:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/smmintrin.h:268:1: error: inlining failed in call to always_inline ‘_mm_cmpeq_epi64’: target specific option mismatch
dh.c: _mm_cmpeq_epi64 (__m128i __X, __m128i __Y)
dh.c: ^
dh.c: In file included from dh.c:8:0:
dh.c: smu.h:365:23: error: called from here
dh.c: mask_lps[0] = _mm_cmpeq_epi64(digits[0], dig_sse);
dh.c: ^

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
cc ref
gcc ref
gcc -funroll-loops ref

Compiler output

Implementation: crypto_dh/k298/ref
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
dh.c: In file included from dh.c:6:
dh.c: ./ffa.h:47:5: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: ffa_red_149_stp(a_00, a_01, tp_0, tp_1, tp_2, p_149_0, p_149_1);
dh.c: ^
dh.c: ./ffa.h:18:12: note: expanded from macro 'ffa_red_149_stp'
dh.c: tp_2 = _mm_clmulepi64_si128(p_149_0, tp_0, 0x00); ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), ^
dh.c: ...
dh.c: ffa_red_149_stp(a_00, a_01, tp_0, tp_1, tp_2, p_149_0, p_149_1);
dh.c: ^
dh.c: ./ffa.h:18:12: note: expanded from macro 'ffa_red_149_stp'
dh.c: tp_2 = _mm_clmulepi64_si128(p_149_0, tp_0, 0x00); ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), ^
dh.c: fatal error: too many errors emitted, stopping now [-ferror-limit=]
dh.c: 20 errors generated.

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments ref
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments ref
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments ref
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments ref
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments ref
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments ref

Compiler output

Implementation: crypto_dh/k298/ref
Compiler: gcc -O2 -fomit-frame-pointer
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: ffa.h: In function ‘ffa_red_149’:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline ‘_mm_clmulepi64_si128’: target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:20:10: error: called from here
dh.c: tp_0 = _mm_clmulepi64_si128(p_149_1, tp_0, 0x00);...
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline ‘_mm_clmulepi64_si128’: target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:18:10: error: called from here
dh.c: tp_2 = _mm_clmulepi64_si128(p_149_0, tp_0, 0x00); ^
dh.c: ffa.h:82:5: note: in expansion of macro ‘ffa_red_149_stp’
dh.c: ffa_red_149_stp(b_00, b_01, tp_0, tp_1, tp_2, p_149_0, p_149_1);
dh.c: ^

Number of similar (compiler,implementation) pairs: 84, namely:
CompilerImplementations
gcc -O2 -fomit-frame-pointer ref
gcc -O3 -fomit-frame-pointer ref
gcc -O -fomit-frame-pointer ref
gcc -Os -fomit-frame-pointer ref
gcc -fno-schedule-insns -O2 -fomit-frame-pointer ref
gcc -fno-schedule-insns -O3 -fomit-frame-pointer ref
gcc -fno-schedule-insns -O -fomit-frame-pointer ref
gcc -fno-schedule-insns -Os -fomit-frame-pointer ref
gcc -funroll-loops -O2 -fomit-frame-pointer ref
gcc -funroll-loops -O3 -fomit-frame-pointer ref
gcc -funroll-loops -O -fomit-frame-pointer ref
gcc -funroll-loops -Os -fomit-frame-pointer ref
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer ref
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer ref
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer ref
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer ref
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -O -fomit-frame-pointer ref
gcc -funroll-loops -m64 -Os -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer ref
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer ref
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer ref
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer ref
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer ref
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer ref
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer ref
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer ref
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer ref
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer ref
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer ref
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer ref
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer ref
gcc -m64 -O2 -fomit-frame-pointer ref
gcc -m64 -O3 -fomit-frame-pointer ref
gcc -m64 -O -fomit-frame-pointer ref
gcc -m64 -Os -fomit-frame-pointer ref
gcc -m64 -march=core2 -O2 -fomit-frame-pointer ref
gcc -m64 -march=core2 -O3 -fomit-frame-pointer ref
gcc -m64 -march=core2 -O -fomit-frame-pointer ref
gcc -m64 -march=core2 -Os -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer ref
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer ref
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer ref
gcc -m64 -march=corei7 -O -fomit-frame-pointer ref
gcc -m64 -march=corei7 -Os -fomit-frame-pointer ref
gcc -m64 -march=k8 -O2 -fomit-frame-pointer ref
gcc -m64 -march=k8 -O3 -fomit-frame-pointer ref
gcc -m64 -march=k8 -O -fomit-frame-pointer ref
gcc -m64 -march=k8 -Os -fomit-frame-pointer ref
gcc -m64 -march=nocona -O2 -fomit-frame-pointer ref
gcc -m64 -march=nocona -O3 -fomit-frame-pointer ref
gcc -m64 -march=nocona -O -fomit-frame-pointer ref
gcc -m64 -march=nocona -Os -fomit-frame-pointer ref
gcc -march=barcelona -O2 -fomit-frame-pointer ref
gcc -march=barcelona -O3 -fomit-frame-pointer ref
gcc -march=barcelona -O -fomit-frame-pointer ref
gcc -march=barcelona -Os -fomit-frame-pointer ref
gcc -march=k8 -O2 -fomit-frame-pointer ref
gcc -march=k8 -O3 -fomit-frame-pointer ref
gcc -march=k8 -O -fomit-frame-pointer ref
gcc -march=k8 -Os -fomit-frame-pointer ref
gcc -march=nocona -O2 -fomit-frame-pointer ref
gcc -march=nocona -O3 -fomit-frame-pointer ref
gcc -march=nocona -O -fomit-frame-pointer ref
gcc -march=nocona -Os -fomit-frame-pointer ref

Compiler output

Implementation: crypto_dh/k298/ref
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: ffa.h: In function ‘ffa_red_149’:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline ‘_mm_clmulepi64_si128’: target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:20:10: error: called from here
dh.c: tp_0 = _mm_clmulepi64_si128(p_149_1, tp_0, 0x00);...
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline ‘_mm_clmulepi64_si128’: target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:18:10: error: called from here
dh.c: tp_2 = _mm_clmulepi64_si128(p_149_0, tp_0, 0x00); ^
dh.c: ffa.h:82:5: note: in expansion of macro ‘ffa_red_149_stp’
dh.c: ffa_red_149_stp(b_00, b_01, tp_0, tp_1, tp_2, p_149_0, p_149_1);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: ffa.h: In function ‘ffa_red_149’:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline ‘_mm_clmulepi64_si128’: target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:20:10: error: called from here
dh.c: tp_0 = _mm_clmulepi64_si128(p_149_1, tp_0, 0x00);...
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline ‘_mm_clmulepi64_si128’: target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:18:10: error: called from here
dh.c: tp_2 = _mm_clmulepi64_si128(p_149_0, tp_0, 0x00); ^
dh.c: ffa.h:82:5: note: in expansion of macro ‘ffa_red_149_stp’
dh.c: ffa_red_149_stp(b_00, b_01, tp_0, tp_1, tp_2, p_149_0, p_149_1);
dh.c: ^

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer ref
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer ref
gcc -m64 -march=barcelona -O -fomit-frame-pointer ref
gcc -m64 -march=barcelona -Os -fomit-frame-pointer ref