Implementation notes: amd64, hydra3, crypto_dh/surf2113

Computer: hydra3
Architecture: amd64
CPU ID: AuthenticAMD-00100fa0-178bfbff
SUPERCOP version: 20171218
Operation: crypto_dh
Primitive: surf2113
TimeImplementationCompilerBenchmark dateSUPERCOP version
2820621mpfqgcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer2017111720171020
2822605mpfqgcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer2017111720171020
2825605mpfqgcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer2017111720171020
2829864mpfqgcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer2017111720171020
2835580mpfqgcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer2017111720171020
2838689mpfqgcc -funroll-loops -m64 -O2 -fomit-frame-pointer2017111720171020
2839116mpfqgcc -funroll-loops -O2 -fomit-frame-pointer2017111720171020
2846975mpfqgcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer2017111720171020
2849045mpfqgcc -funroll-loops -O3 -fomit-frame-pointer2017111720171020
2849187mpfqgcc -funroll-loops -m64 -O3 -fomit-frame-pointer2017111720171020
2859411mpfqgcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer2017111720171020
2861985mpfqgcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer2017111720171020
2865403mpfqgcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer2017111720171020
2871999mpfqgcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer2017111720171020
2992163mpfqgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2017111720171020
2992259mpfqgcc -m64 -march=barcelona -O3 -fomit-frame-pointer2017111720171020
2992570mpfqgcc -march=barcelona -O3 -fomit-frame-pointer2017111720171020
2994724mpfqgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2017111720171020
2996262mpfqgcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer2017111720171020
2997322mpfqgcc -m64 -march=barcelona -O2 -fomit-frame-pointer2017111720171020
2997617mpfqgcc -march=barcelona -O2 -fomit-frame-pointer2017111720171020
2999306mpfqgcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer2017111720171020
3011459mpfqgcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer2017111720171020
3016408mpfqgcc -funroll-loops -march=barcelona -O -fomit-frame-pointer2017111720171020
3020953mpfqgcc -funroll-loops -march=k8 -O -fomit-frame-pointer2017111720171020
3024094mpfqgcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer2017111720171020
3033579mpfqgcc -funroll-loops -O -fomit-frame-pointer2017111720171020
3034751mpfqgcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer2017111720171020
3035614mpfqgcc -funroll-loops -m64 -O -fomit-frame-pointer2017111720171020
3039612mpfqgcc -m64 -march=barcelona -O -fomit-frame-pointer2017111720171020
3040984mpfqgcc -m64 -march=native -mtune=native -O -fomit-frame-pointer2017111720171020
3042848mpfqgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2017111720171020
3043362mpfqgcc -march=barcelona -O -fomit-frame-pointer2017111720171020
3050109mpfqgcc -fno-schedule-insns -O2 -fomit-frame-pointer2017111720171020
3050144mpfqgcc -m64 -march=k8 -O3 -fomit-frame-pointer2017111720171020
3050894mpfqgcc -O2 -fomit-frame-pointer2017111720171020
3051043mpfqgcc -march=k8 -O3 -fomit-frame-pointer2017111720171020
3051381mpfqgcc -m64 -O2 -fomit-frame-pointer2017111720171020
3051510mpfqgcc -m64 -march=k8 -O2 -fomit-frame-pointer2017111720171020
3051848mpfqgcc -m64 -march=k8 -O -fomit-frame-pointer2017111720171020
3052221mpfqgcc -O3 -fomit-frame-pointer2017111720171020
3052496mpfqgcc -m64 -march=core2 -O2 -fomit-frame-pointer2017111720171020
3053065mpfqgcc -fno-schedule-insns -O3 -fomit-frame-pointer2017111720171020
3053296mpfqgcc -march=k8 -O2 -fomit-frame-pointer2017111720171020
3053866mpfqgcc -march=k8 -O -fomit-frame-pointer2017111720171020
3056565mpfqgcc -m64 -O3 -fomit-frame-pointer2017111720171020
3056900mpfqgcc -funroll-loops -march=nocona -O -fomit-frame-pointer2017111720171020
3058054mpfqgcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer2017111720171020
3061829mpfqgcc -fno-schedule-insns -O -fomit-frame-pointer2017111720171020
3062397mpfqgcc -O -fomit-frame-pointer2017111720171020
3063182mpfqgcc -m64 -O -fomit-frame-pointer2017111720171020
3079868mpfqgcc -m64 -march=nocona -O -fomit-frame-pointer2017111720171020
3083194mpfqgcc -march=nocona -O -fomit-frame-pointer2017111720171020
3088296mpfqgcc -m64 -march=core2 -O -fomit-frame-pointer2017111720171020
3153022mpfqgcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer2017111720171020
3158444mpfqgcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer2017111720171020
3173166mpfqgcc -funroll-loops -march=k8 -Os -fomit-frame-pointer2017111720171020
3174285mpfqgcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer2017111720171020
3174704mpfqgcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer2017111720171020
3174987mpfqgcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer2017111720171020
3187703mpfqgcc -funroll-loops -Os -fomit-frame-pointer2017111720171020
3188200mpfqgcc -m64 -march=nocona -O3 -fomit-frame-pointer2017111720171020
3190950mpfqgcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer2017111720171020
3192938mpfqgcc -funroll-loops -m64 -Os -fomit-frame-pointer2017111720171020
3194446mpfqgcc -march=nocona -O3 -fomit-frame-pointer2017111720171020
3204487mpfqgcc -march=nocona -O2 -fomit-frame-pointer2017111720171020
3206109mpfqgcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer2017111720171020
3208298mpfqgcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer2017111720171020
3209290mpfqgcc -m64 -march=nocona -O2 -fomit-frame-pointer2017111720171020
3388374mpfqgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2017111720171020
3449540mpfqgcc -march=barcelona -Os -fomit-frame-pointer2017111720171020
3450196mpfqgcc -m64 -march=barcelona -Os -fomit-frame-pointer2017111720171020
3452399mpfqgcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer2017111720171020
3469569mpfqgcc -march=k8 -Os -fomit-frame-pointer2017111720171020
3475639mpfqgcc -m64 -march=k8 -Os -fomit-frame-pointer2017111720171020
3477524mpfqgcc -m64 -march=core2 -Os -fomit-frame-pointer2017111720171020
3479636mpfqgcc -Os -fomit-frame-pointer2017111720171020
3479772mpfqgcc -m64 -Os -fomit-frame-pointer2017111720171020
3483535mpfqgcc -fno-schedule-insns -Os -fomit-frame-pointer2017111720171020
3511723mpfqgcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer2017111720171020
3516195mpfqgcc -funroll-loops -march=nocona -Os -fomit-frame-pointer2017111720171020
3557634mpfqgcc -m64 -march=nocona -Os -fomit-frame-pointer2017111720171020
3561151mpfqgcc -march=nocona -Os -fomit-frame-pointer2017111720171020
15286227mpfqgcc -funroll-loops2017111720171020
15426434mpfqcc2017111720171020
15426594mpfqgcc2017111720171020

Test failure

Implementation: crypto_dh/surf2113/mpfq
Compiler: gcc -m64 -march=core2 -O3 -fomit-frame-pointer
error 111

Number of similar (compiler,implementation) pairs: 10, namely:
CompilerImplementations
gcc -m64 -march=core2 -O3 -fomit-frame-pointer mpfq
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer mpfq
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer mpfq
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer mpfq
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer mpfq
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer mpfq
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer mpfq
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer mpfq
gcc -m64 -march=corei7 -O -fomit-frame-pointer mpfq
gcc -m64 -march=corei7 -Os -fomit-frame-pointer mpfq

Compiler output

Implementation: crypto_dh/surf2113/mpfq
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
Surf2_113.c: In file included from Surf2_113.c:11:
Surf2_113.c: In file included from ./field.h:1:
Surf2_113.c: In file included from ./mpfq_2_113.h:5:
Surf2_113.c: ./x86_64/mpfq_2_113.h:714:14: error: use of unknown builtin '__builtin_ia32_pslldqi128' [-Wimplicit-function-declaration]
Surf2_113.c: r.s = t0 ^ SHLD(t1, 64);
Surf2_113.c: ^
Surf2_113.c: ./x86_64/mpfq_2_113.h:571:25: note: expanded from macro 'SHLD'
Surf2_113.c: #define SHLD(x,r) (v2di)__builtin_ia32_pslldqi128 ((gcc43bugfix) (x),(r))
Surf2_113.c: ^
Surf2_113.c: ./x86_64/mpfq_2_113.h:714:14: error: invalid conversion between vector type 'v2di' (vector of 2 'uint64_t' values) and integer type 'int' of different size
Surf2_113.c: r.s = t0 ^ SHLD(t1, 64);
Surf2_113.c: ^~~~~~~~~~~~
Surf2_113.c: ./x86_64/mpfq_2_113.h:571:19: note: expanded from macro 'SHLD'
Surf2_113.c: #define SHLD(x,r) (v2di)__builtin_ia32_pslldqi128 ((gcc43bugfix) (x),(r))
Surf2_113.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Surf2_113.c: ./x86_64/mpfq_2_113.h:721:14: error: use of unknown builtin '__builtin_ia32_psrldqi128' [-Wimplicit-function-declaration]
Surf2_113.c: r.s = t2 ^ SHRD(t1, 64);
Surf2_113.c: ^
Surf2_113.c: ./x86_64/mpfq_2_113.h:572:25: note: expanded from macro 'SHRD'
Surf2_113.c: #define SHRD(x,r) (v2di)__builtin_ia32_psrldqi128 ((gcc43bugfix) (x),(r))
Surf2_113.c: ^
Surf2_113.c: ./x86_64/mpfq_2_113.h:721:14: error: invalid conversion between vector type 'v2di' (vector of 2 'uint64_t' values) and integer type 'int' of different size
Surf2_113.c: r.s = t2 ^ SHRD(t1, 64);
Surf2_113.c: ^~~~~~~~~~~~
Surf2_113.c: ./x86_64/mpfq_2_113.h:572:19: note: expanded from macro 'SHRD'
Surf2_113.c: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments mpfq
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments mpfq
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments mpfq
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments mpfq
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments mpfq
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments mpfq