Implementation notes: amd64, rumba7, crypto_dh/k277mon

Computer: rumba7
Architecture: amd64
CPU ID: AuthenticAMD-00800f11-178bfbff
SUPERCOP version: 20171218
Operation: crypto_dh
Primitive: k277mon
TimeImplementationCompilerBenchmark dateSUPERCOP version
341580refgcc -m64 -march=core-avx2 -O3 -fomit-frame-pointer2018053020171218
341970refgcc -m64 -march=core-avx-i -O3 -fomit-frame-pointer2018053020171218
342060refgcc -m64 -march=corei7-avx -O3 -fomit-frame-pointer2018053020171218
353520refclang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments2018053020171218
355140refclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2018053020171218
356250refclang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments2018053020171218
356610refclang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer -Qunused-arguments2018053020171218
374070refgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2018053020171218
374640refgcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer2018053020171218
399180refgcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer2018053020171218
399270refgcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer2018053020171218
400380refgcc -m64 -march=corei7-avx -O2 -fomit-frame-pointer2018053020171218
425790refgcc -m64 -march=core-avx-i -O -fomit-frame-pointer2018053020171218
426750refgcc -m64 -march=core-avx2 -O -fomit-frame-pointer2018053020171218
427740refgcc -m64 -march=core-avx2 -Os -fomit-frame-pointer2018053020171218
428160refgcc -m64 -march=core-avx-i -Os -fomit-frame-pointer2018053020171218
428220refgcc -m64 -march=corei7-avx -Os -fomit-frame-pointer2018053020171218
428340refgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2018053020171218
428520refgcc -m64 -march=corei7-avx -O -fomit-frame-pointer2018053020171218
429630refgcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer2018053020171218
441270refgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2018053020171218
441390refgcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer2018053020171218
448530refgcc -m64 -march=native -mtune=native -O -fomit-frame-pointer2018053020171218
450180refgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2018053020171218

Compiler output

Implementation: crypto_dh/k277mon/ref
Compiler: cc
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h: In function 'ffa_red_277':
dh.c: ffa.h:123:10: error: '__builtin_ia32_palignr128' needs isa option -m32 -mssse3
dh.c: a_01 = _mm_alignr_epi8(a_02, a_01, 8);
dh.c: ^
dh.c: ffa.h:90:10: error: '__builtin_ia32_pclmulqdq128' needs isa option -m32 -mpclmul
dh.c: m_00 = _mm_clmulepi64_si128(mul, P_00, 0x00);\
dh.c: ^
dh.c: ffa.h:110:5: note: in expansion of macro 'ffa_red_277_stp'
dh.c: ffa_red_277_stp(A##_00,A##_01,A##_02,P##_00,P##_01,P##_02,\
dh.c: ^
dh.c: ffa.h:132:5: note: in expansion of macro 'FFA_RED_277_STP'
dh.c: FFA_RED_277_STP(a,P,m,t_00,mul);
dh.c: ^
dh.c: ffa.h:91:10: error: '__builtin_ia32_pclmulqdq128' needs isa option -m32 -mpclmul
dh.c: m_01 = _mm_clmulepi64_si128(mul, P_00, 0x10);\
dh.c: ^
dh.c: ffa.h:110:5: note: in expansion of macro 'ffa_red_277_stp'
dh.c: ffa_red_277_stp(A##_00,A##_01,A##_02,P##_00,P##_01,P##_02,\
dh.c: ^
dh.c: ffa.h:132:5: note: in expansion of macro 'FFA_RED_277_STP'
dh.c: FFA_RED_277_STP(a,P,m,t_00,mul);
dh.c: ^
dh.c: ffa.h:92:10: error: '__builtin_ia32_pclmulqdq128' needs isa option -m32 -mpclmul
dh.c: m_02 = _mm_clmulepi64_si128(mul, P_01, 0x00);\
dh.c: ...

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
cc ref
gcc ref
gcc -funroll-loops ref

Compiler output

Implementation: crypto_dh/k277mon/ref
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
dh.c: In file included from dh.c:6:
dh.c: ./ffa.h:123:12: error: '__builtin_ia32_palignr128' needs target feature ssse3
dh.c: a_01 = _mm_alignr_epi8(a_02, a_01, 8);
dh.c: ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/tmmintrin.h:69:12: note: expanded from macro '_mm_alignr_epi8'
dh.c: (__m128i)__builtin_ia32_palignr128((__v16qi)(__m128i)(a), \
dh.c: ^
dh.c: In file included from dh.c:6:
dh.c: ./ffa.h:132:5: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: FFA_RED_277_STP(a,P,m,t_00,mul);
dh.c: ^
dh.c: ./ffa.h:110:5: note: expanded from macro 'FFA_RED_277_STP'
dh.c: ffa_red_277_stp(A##_00,A##_01,A##_02,P##_00,P##_01,P##_02,\
dh.c: ^
dh.c: ./ffa.h:90:12: note: expanded from macro 'ffa_red_277_stp'
dh.c: m_00 = _mm_clmulepi64_si128(mul, P_00, 0x00);\
dh.c: ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), \
dh.c: ^
dh.c: In file included from dh.c:6:
dh.c: ./ffa.h:132:5: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: ./ffa.h:110:5: note: expanded from macro 'FFA_RED_277_STP'
dh.c: ffa_red_277_stp(A##_00,A##_01,A##_02,P##_00,P##_01,P##_02,\
dh.c: ^
dh.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments ref
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments ref
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments ref
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments ref

Compiler output

Implementation: crypto_dh/k277mon/ref
Compiler: clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments
dh.c: In file included from dh.c:6:
dh.c: ./ffa.h:132:5: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: FFA_RED_277_STP(a,P,m,t_00,mul);
dh.c: ^
dh.c: ./ffa.h:110:5: note: expanded from macro 'FFA_RED_277_STP'
dh.c: ffa_red_277_stp(A##_00,A##_01,A##_02,P##_00,P##_01,P##_02,\
dh.c: ^
dh.c: ./ffa.h:90:12: note: expanded from macro 'ffa_red_277_stp'
dh.c: m_00 = _mm_clmulepi64_si128(mul, P_00, 0x00);\
dh.c: ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), \
dh.c: ^
dh.c: In file included from dh.c:6:
dh.c: ./ffa.h:132:5: error: '__builtin_ia32_pclmulqdq128' needs target feature pclmul
dh.c: ./ffa.h:110:5: note: expanded from macro 'FFA_RED_277_STP'
dh.c: ffa_red_277_stp(A##_00,A##_01,A##_02,P##_00,P##_01,P##_02,\
dh.c: ^
dh.c: ./ffa.h:91:12: note: expanded from macro 'ffa_red_277_stp'
dh.c: m_01 = _mm_clmulepi64_si128(mul, P_00, 0x10);\
dh.c: ^
dh.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.0/include/__wmmintrin_pclmul.h:27:13: note: expanded from macro '_mm_clmulepi64_si128'
dh.c: ((__m128i)__builtin_ia32_pclmulqdq128((__v2di)(__m128i)(__X), \
dh.c: ^
dh.c: In file included from dh.c:6:
dh.c: ...

Number of similar (compiler,implementation) pairs: 2, namely:
CompilerImplementations
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments ref
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments ref

Compiler output

Implementation: crypto_dh/k277mon/ref
Compiler: gcc -O2 -fomit-frame-pointer
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:37:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: ffa.h: In function 'ffa_red_277':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:185:1: error: inlining failed in call to always_inline '_mm_alignr_epi8': target specific option mismatch
dh.c: _mm_alignr_epi8(__m128i __X, __m128i __Y, const int __N)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:148:13: error: called from here
dh.c: *c_02 = _mm_alignr_epi8(a_02, a_01, 8);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:37:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:185:1: error: inlining failed in call to always_inline '_mm_alignr_epi8': target specific option mismatch
dh.c: _mm_alignr_epi8(__m128i __X, __m128i __Y, const int __N)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:147:13: error: called from here
dh.c: *c_01 = _mm_alignr_epi8(a_01, a_00, 8);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:37:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:185:1: error: inlining failed in call to always_inline '_mm_alignr_epi8': target specific option mismatch
dh.c: ...

Number of similar (compiler,implementation) pairs: 68, namely:
CompilerImplementations
gcc -O2 -fomit-frame-pointer ref
gcc -O3 -fomit-frame-pointer ref
gcc -O -fomit-frame-pointer ref
gcc -Os -fomit-frame-pointer ref
gcc -fno-schedule-insns -O2 -fomit-frame-pointer ref
gcc -fno-schedule-insns -O3 -fomit-frame-pointer ref
gcc -fno-schedule-insns -O -fomit-frame-pointer ref
gcc -fno-schedule-insns -Os -fomit-frame-pointer ref
gcc -funroll-loops -O2 -fomit-frame-pointer ref
gcc -funroll-loops -O3 -fomit-frame-pointer ref
gcc -funroll-loops -O -fomit-frame-pointer ref
gcc -funroll-loops -Os -fomit-frame-pointer ref
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer ref
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer ref
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer ref
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer ref
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -O -fomit-frame-pointer ref
gcc -funroll-loops -m64 -Os -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer ref
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer ref
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer ref
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer ref
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer ref
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer ref
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer ref
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer ref
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer ref
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer ref
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer ref
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer ref
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer ref
gcc -m64 -O2 -fomit-frame-pointer ref
gcc -m64 -O3 -fomit-frame-pointer ref
gcc -m64 -O -fomit-frame-pointer ref
gcc -m64 -Os -fomit-frame-pointer ref
gcc -m64 -march=k8 -O2 -fomit-frame-pointer ref
gcc -m64 -march=k8 -O3 -fomit-frame-pointer ref
gcc -m64 -march=k8 -O -fomit-frame-pointer ref
gcc -m64 -march=k8 -Os -fomit-frame-pointer ref
gcc -m64 -march=nocona -O2 -fomit-frame-pointer ref
gcc -m64 -march=nocona -O3 -fomit-frame-pointer ref
gcc -m64 -march=nocona -O -fomit-frame-pointer ref
gcc -m64 -march=nocona -Os -fomit-frame-pointer ref
gcc -march=barcelona -O2 -fomit-frame-pointer ref
gcc -march=barcelona -O3 -fomit-frame-pointer ref
gcc -march=barcelona -O -fomit-frame-pointer ref
gcc -march=barcelona -Os -fomit-frame-pointer ref
gcc -march=k8 -O2 -fomit-frame-pointer ref
gcc -march=k8 -O3 -fomit-frame-pointer ref
gcc -march=k8 -O -fomit-frame-pointer ref
gcc -march=k8 -Os -fomit-frame-pointer ref
gcc -march=nocona -O2 -fomit-frame-pointer ref
gcc -march=nocona -O3 -fomit-frame-pointer ref
gcc -march=nocona -O -fomit-frame-pointer ref
gcc -march=nocona -Os -fomit-frame-pointer ref

Compiler output

Implementation: crypto_dh/k277mon/ref
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:37:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: ffa.h: In function 'ffa_red_277':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:185:1: error: inlining failed in call to always_inline '_mm_alignr_epi8': target specific option mismatch
dh.c: _mm_alignr_epi8(__m128i __X, __m128i __Y, const int __N)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:148:13: error: called from here
dh.c: *c_02 = _mm_alignr_epi8(a_02, a_01, 8);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:37:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:185:1: error: inlining failed in call to always_inline '_mm_alignr_epi8': target specific option mismatch
dh.c: _mm_alignr_epi8(__m128i __X, __m128i __Y, const int __N)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:147:13: error: called from here
dh.c: *c_01 = _mm_alignr_epi8(a_01, a_00, 8);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:37:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:185:1: error: inlining failed in call to always_inline '_mm_alignr_epi8': target specific option mismatch
dh.c: ...
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:37:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: ffa.h: In function 'ffa_red_277':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:185:1: error: inlining failed in call to always_inline '_mm_alignr_epi8': target specific option mismatch
dh.c: _mm_alignr_epi8(__m128i __X, __m128i __Y, const int __N)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:148:13: error: called from here
dh.c: *c_02 = _mm_alignr_epi8(a_02, a_01, 8);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:37:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:185:1: error: inlining failed in call to always_inline '_mm_alignr_epi8': target specific option mismatch
dh.c: _mm_alignr_epi8(__m128i __X, __m128i __Y, const int __N)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:147:13: error: called from here
dh.c: *c_01 = _mm_alignr_epi8(a_01, a_00, 8);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:37:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/tmmintrin.h:185:1: error: inlining failed in call to always_inline '_mm_alignr_epi8': target specific option mismatch
dh.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer ref
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer ref
gcc -m64 -march=barcelona -O -fomit-frame-pointer ref
gcc -m64 -march=barcelona -Os -fomit-frame-pointer ref

Compiler output

Implementation: crypto_dh/k277mon/ref
Compiler: gcc -m64 -march=core2 -O2 -fomit-frame-pointer
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: ffa.h: In function 'ffa_red_277':
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:94:10: error: called from here
dh.c: m_04 = _mm_clmulepi64_si128(mul, P_02, 0x00);\
dh.c: ^
dh.c: ffa.h:110:5: note: in expansion of macro 'ffa_red_277_stp'
dh.c: ffa_red_277_stp(A##_00,A##_01,A##_02,P##_00,P##_01,P##_02,\
dh.c: ^
dh.c: ffa.h:142:5: note: in expansion of macro 'FFA_RED_277_STP'
dh.c: FFA_RED_277_STP(a,P,m,t_00,mul);
dh.c: ^
dh.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/x86intrin.h:43:0,
dh.c: from lib.h:2,
dh.c: from dh.c:2:
dh.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:116:1: error: inlining failed in call to always_inline '_mm_clmulepi64_si128': target specific option mismatch
dh.c: _mm_clmulepi64_si128 (__m128i __X, __m128i __Y, const int __I)
dh.c: ^
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h:93:10: error: called from here
dh.c: ...

Number of similar (compiler,implementation) pairs: 16, namely:
CompilerImplementations
gcc -m64 -march=core2 -O2 -fomit-frame-pointer ref
gcc -m64 -march=core2 -O3 -fomit-frame-pointer ref
gcc -m64 -march=core2 -O -fomit-frame-pointer ref
gcc -m64 -march=core2 -Os -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer ref
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer ref
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer ref
gcc -m64 -march=corei7 -O -fomit-frame-pointer ref
gcc -m64 -march=corei7 -Os -fomit-frame-pointer ref