Implementation notes: amd64, gsm2, crypto_dh/k277taa

Computer: gsm2
Architecture: amd64
CPU ID: GenuineIntel-000306e4-bfebfbff
SUPERCOP version: 20170725
Operation: crypto_dh
Primitive: k277taa
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
731632? ? ?? ? ?refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv2017080620170725
1075628? ? ?? ? ?refgcc_-m64_-march=native_-mtune=native_-O3_-fomit-frame-pointer2017080620170725
1099700? ? ?? ? ?refgcc_-m64_-march=core-avx-i_-O2_-fomit-frame-pointer2017080620170725
1108656? ? ?? ? ?refgcc_-m64_-march=corei7-avx_-Os_-fomit-frame-pointer2017080620170725
1116712? ? ?? ? ?refgcc_-m64_-march=core-avx-i_-O_-fomit-frame-pointer2017080620170725
1126252? ? ?? ? ?refgcc_-m64_-march=core-avx-i_-O3_-fomit-frame-pointer2017080620170725
1131664? ? ?? ? ?refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv2017080620170725
1153368? ? ?? ? ?refgcc_-m64_-march=corei7-avx_-O2_-fomit-frame-pointer2017080620170725
1164800? ? ?? ? ?refgcc_-m64_-march=corei7-avx_-O3_-fomit-frame-pointer2017080620170725
1177860? ? ?? ? ?refgcc_-m64_-march=native_-mtune=native_-Os_-fomit-frame-pointer2017080620170725
1177868? ? ?? ? ?refgcc_-m64_-march=native_-mtune=native_-O_-fomit-frame-pointer2017080620170725
1198252? ? ?? ? ?refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv2017080620170725
1212648? ? ?? ? ?refgcc_-m64_-march=native_-mtune=native_-O2_-fomit-frame-pointer2017080620170725
1220028? ? ?? ? ?refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv2017080620170725
1224656? ? ?? ? ?refgcc_-m64_-march=corei7-avx_-O_-fomit-frame-pointer2017080620170725
1245764? ? ?? ? ?refgcc_-m64_-march=core-avx-i_-Os_-fomit-frame-pointer2017080620170725

Test failure

Implementation: crypto_dh/k277taa/ref
Compiler: gcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer
error 111

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
gcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer ref
gcc -m64 -march=core-avx2 -O -fomit-frame-pointer ref
gcc -m64 -march=core-avx2 -Os -fomit-frame-pointer ref

Compiler output

Implementation: crypto_dh/k277taa/ref
Compiler: cc
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h: In function 'ffa_red_277':
dh.c: ffa.h:123:10: error: incompatible types when assigning to type '__m128i' from type 'int'
dh.c: a_01 = _mm_alignr_epi8(a_02, a_01, 8);
dh.c: ^
dh.c: ffa.h:90:10: error: incompatible types when assigning to type '__m128i' from type 'int'
dh.c: m_00 = _mm_clmulepi64_si128(mul, P_00, 0x00);\
dh.c: ^
dh.c: ffa.h:110:5: note: in expansion of macro 'ffa_red_277_stp'
dh.c: ffa_red_277_stp(A##_00,A##_01,A##_02,P##_00,P##_01,P##_02,\
dh.c: ^
dh.c: ffa.h:132:5: note: in expansion of macro 'FFA_RED_277_STP'
dh.c: FFA_RED_277_STP(a,P,m,t_00,mul);
dh.c: ^
dh.c: ffa.h:91:10: error: incompatible types when assigning to type '__m128i' from type 'int'
dh.c: m_01 = _mm_clmulepi64_si128(mul, P_00, 0x10);\
dh.c: ^
dh.c: ffa.h:110:5: note: in expansion of macro 'ffa_red_277_stp'
dh.c: ffa_red_277_stp(A##_00,A##_01,A##_02,P##_00,P##_01,P##_02,\
dh.c: ^
dh.c: ffa.h:132:5: note: in expansion of macro 'FFA_RED_277_STP'
dh.c: FFA_RED_277_STP(a,P,m,t_00,mul);
dh.c: ^
dh.c: ffa.h:92:10: error: incompatible types when assigning to type '__m128i' from type 'int'
dh.c: m_02 = _mm_clmulepi64_si128(mul, P_01, 0x00);\
dh.c: ...

Number of similar (compiler,implementation) pairs: 71, namely:
CompilerImplementations
cc ref
gcc ref
gcc -O2 -fomit-frame-pointer ref
gcc -O3 -fomit-frame-pointer ref
gcc -O -fomit-frame-pointer ref
gcc -Os -fomit-frame-pointer ref
gcc -fno-schedule-insns -O2 -fomit-frame-pointer ref
gcc -fno-schedule-insns -O3 -fomit-frame-pointer ref
gcc -fno-schedule-insns -O -fomit-frame-pointer ref
gcc -fno-schedule-insns -Os -fomit-frame-pointer ref
gcc -funroll-loops ref
gcc -funroll-loops -O2 -fomit-frame-pointer ref
gcc -funroll-loops -O3 -fomit-frame-pointer ref
gcc -funroll-loops -O -fomit-frame-pointer ref
gcc -funroll-loops -Os -fomit-frame-pointer ref
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer ref
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer ref
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer ref
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer ref
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -O -fomit-frame-pointer ref
gcc -funroll-loops -m64 -Os -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer ref
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer ref
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer ref
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer ref
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer ref
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer ref
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer ref
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer ref
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer ref
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer ref
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer ref
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer ref
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer ref
gcc -m64 -O2 -fomit-frame-pointer ref
gcc -m64 -O3 -fomit-frame-pointer ref
gcc -m64 -O -fomit-frame-pointer ref
gcc -m64 -Os -fomit-frame-pointer ref
gcc -m64 -march=k8 -O2 -fomit-frame-pointer ref
gcc -m64 -march=k8 -O3 -fomit-frame-pointer ref
gcc -m64 -march=k8 -O -fomit-frame-pointer ref
gcc -m64 -march=k8 -Os -fomit-frame-pointer ref
gcc -m64 -march=nocona -O2 -fomit-frame-pointer ref
gcc -m64 -march=nocona -O3 -fomit-frame-pointer ref
gcc -m64 -march=nocona -O -fomit-frame-pointer ref
gcc -m64 -march=nocona -Os -fomit-frame-pointer ref
gcc -march=barcelona -O2 -fomit-frame-pointer ref
gcc -march=barcelona -O3 -fomit-frame-pointer ref
gcc -march=barcelona -O -fomit-frame-pointer ref
gcc -march=barcelona -Os -fomit-frame-pointer ref
gcc -march=k8 -O2 -fomit-frame-pointer ref
gcc -march=k8 -O3 -fomit-frame-pointer ref
gcc -march=k8 -O -fomit-frame-pointer ref
gcc -march=k8 -Os -fomit-frame-pointer ref
gcc -march=nocona -O2 -fomit-frame-pointer ref
gcc -march=nocona -O3 -fomit-frame-pointer ref
gcc -march=nocona -O -fomit-frame-pointer ref
gcc -march=nocona -Os -fomit-frame-pointer ref

Compiler output

Implementation: crypto_dh/k277taa/ref
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h: In function 'ffa_red_277':
dh.c: ffa.h:123:10: error: incompatible types when assigning to type '__m128i' from type 'int'
dh.c: a_01 = _mm_alignr_epi8(a_02, a_01, 8);
dh.c: ^
dh.c: ffa.h:90:10: error: incompatible types when assigning to type '__m128i' from type 'int'
dh.c: m_00 = _mm_clmulepi64_si128(mul, P_00, 0x00);\
dh.c: ^
dh.c: ffa.h:110:5: note: in expansion of macro 'ffa_red_277_stp'
dh.c: ffa_red_277_stp(A##_00,A##_01,A##_02,P##_00,P##_01,P##_02,\
dh.c: ^
dh.c: ffa.h:132:5: note: in expansion of macro 'FFA_RED_277_STP'
dh.c: FFA_RED_277_STP(a,P,m,t_00,mul);
dh.c: ^
dh.c: ffa.h:91:10: error: incompatible types when assigning to type '__m128i' from type 'int'
dh.c: m_01 = _mm_clmulepi64_si128(mul, P_00, 0x10);\
dh.c: ^
dh.c: ffa.h:110:5: note: in expansion of macro 'ffa_red_277_stp'
dh.c: ffa_red_277_stp(A##_00,A##_01,A##_02,P##_00,P##_01,P##_02,\
dh.c: ^
dh.c: ffa.h:132:5: note: in expansion of macro 'FFA_RED_277_STP'
dh.c: FFA_RED_277_STP(a,P,m,t_00,mul);
dh.c: ^
dh.c: ffa.h:92:10: error: incompatible types when assigning to type '__m128i' from type 'int'
dh.c: m_02 = _mm_clmulepi64_si128(mul, P_01, 0x00);\
dh.c: ...
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h: In function 'ffa_red_277':
dh.c: ffa.h:123:10: error: incompatible types when assigning to type '__m128i' from type 'int'
dh.c: a_01 = _mm_alignr_epi8(a_02, a_01, 8);
dh.c: ^
dh.c: ffa.h:90:10: error: incompatible types when assigning to type '__m128i' from type 'int'
dh.c: m_00 = _mm_clmulepi64_si128(mul, P_00, 0x00);\
dh.c: ^
dh.c: ffa.h:110:5: note: in expansion of macro 'ffa_red_277_stp'
dh.c: ffa_red_277_stp(A##_00,A##_01,A##_02,P##_00,P##_01,P##_02,\
dh.c: ^
dh.c: ffa.h:132:5: note: in expansion of macro 'FFA_RED_277_STP'
dh.c: FFA_RED_277_STP(a,P,m,t_00,mul);
dh.c: ^
dh.c: ffa.h:91:10: error: incompatible types when assigning to type '__m128i' from type 'int'
dh.c: m_01 = _mm_clmulepi64_si128(mul, P_00, 0x10);\
dh.c: ^
dh.c: ffa.h:110:5: note: in expansion of macro 'ffa_red_277_stp'
dh.c: ffa_red_277_stp(A##_00,A##_01,A##_02,P##_00,P##_01,P##_02,\
dh.c: ^
dh.c: ffa.h:132:5: note: in expansion of macro 'FFA_RED_277_STP'
dh.c: FFA_RED_277_STP(a,P,m,t_00,mul);
dh.c: ^
dh.c: ffa.h:92:10: error: incompatible types when assigning to type '__m128i' from type 'int'
dh.c: m_02 = _mm_clmulepi64_si128(mul, P_01, 0x00);\
dh.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer ref
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer ref
gcc -m64 -march=barcelona -O -fomit-frame-pointer ref
gcc -m64 -march=barcelona -Os -fomit-frame-pointer ref

Compiler output

Implementation: crypto_dh/k277taa/ref
Compiler: gcc -m64 -march=core2 -O2 -fomit-frame-pointer
dh.c: In file included from dh.c:6:0:
dh.c: ffa.h: In function 'ffa_red_277':
dh.c: ffa.h:90:10: error: incompatible types when assigning to type '__m128i' from type 'int'
dh.c: m_00 = _mm_clmulepi64_si128(mul, P_00, 0x00);\
dh.c: ^
dh.c: ffa.h:110:5: note: in expansion of macro 'ffa_red_277_stp'
dh.c: ffa_red_277_stp(A##_00,A##_01,A##_02,P##_00,P##_01,P##_02,\
dh.c: ^
dh.c: ffa.h:132:5: note: in expansion of macro 'FFA_RED_277_STP'
dh.c: FFA_RED_277_STP(a,P,m,t_00,mul);
dh.c: ^
dh.c: ffa.h:91:10: error: incompatible types when assigning to type '__m128i' from type 'int'
dh.c: m_01 = _mm_clmulepi64_si128(mul, P_00, 0x10);\
dh.c: ^
dh.c: ffa.h:110:5: note: in expansion of macro 'ffa_red_277_stp'
dh.c: ffa_red_277_stp(A##_00,A##_01,A##_02,P##_00,P##_01,P##_02,\
dh.c: ^
dh.c: ffa.h:132:5: note: in expansion of macro 'FFA_RED_277_STP'
dh.c: FFA_RED_277_STP(a,P,m,t_00,mul);
dh.c: ^
dh.c: ffa.h:92:10: error: incompatible types when assigning to type '__m128i' from type 'int'
dh.c: m_02 = _mm_clmulepi64_si128(mul, P_01, 0x00);\
dh.c: ^
dh.c: ffa.h:110:5: note: in expansion of macro 'ffa_red_277_stp'
dh.c: ffa_red_277_stp(A##_00,A##_01,A##_02,P##_00,P##_01,P##_02,\
dh.c: ...

Number of similar (compiler,implementation) pairs: 16, namely:
CompilerImplementations
gcc -m64 -march=core2 -O2 -fomit-frame-pointer ref
gcc -m64 -march=core2 -O3 -fomit-frame-pointer ref
gcc -m64 -march=core2 -O -fomit-frame-pointer ref
gcc -m64 -march=core2 -Os -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer ref
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer ref
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer ref
gcc -m64 -march=corei7 -O -fomit-frame-pointer ref
gcc -m64 -march=corei7 -Os -fomit-frame-pointer ref