Implementation notes: amd64, rumba7, crypto_dh/surf2113

Computer: rumba7
Architecture: amd64
CPU ID: AuthenticAMD-00800f11-178bfbff
SUPERCOP version: 20171218
Operation: crypto_dh
Primitive: surf2113
TimeImplementationCompilerBenchmark dateSUPERCOP version
2153550mpfqgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2018053020171218
2154330mpfqgcc -m64 -march=native -mtune=native -O -fomit-frame-pointer2018053020171218
2184480mpfqgcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer2018053020171218
2193420mpfqgcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer2018053020171218
2212080mpfqgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2018053020171218
2215590mpfqgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2018053020171218
2326680mpfqgcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer2018053020171218
2326890mpfqgcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer2018053020171218
2335590mpfqgcc -m64 -march=barcelona -O -fomit-frame-pointer2018053020171218
2345550mpfqgcc -march=k8 -O -fomit-frame-pointer2018053020171218
2346270mpfqgcc -march=barcelona -O -fomit-frame-pointer2018053020171218
2347290mpfqgcc -O -fomit-frame-pointer2018053020171218
2347800mpfqgcc -m64 -O -fomit-frame-pointer2018053020171218
2348400mpfqgcc -fno-schedule-insns -O -fomit-frame-pointer2018053020171218
2352510mpfqgcc -m64 -march=core2 -O -fomit-frame-pointer2018053020171218
2360490mpfqgcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer2018053020171218
2361120mpfqgcc -m64 -march=k8 -O -fomit-frame-pointer2018053020171218
2361720mpfqgcc -march=barcelona -O2 -fomit-frame-pointer2018053020171218
2362020mpfqgcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer2018053020171218
2362410mpfqgcc -m64 -march=barcelona -O2 -fomit-frame-pointer2018053020171218
2362830mpfqgcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer2018053020171218
2363820mpfqgcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer2018053020171218
2369340mpfqgcc -funroll-loops -m64 -O2 -fomit-frame-pointer2018053020171218
2374980mpfqgcc -m64 -march=k8 -O2 -fomit-frame-pointer2018053020171218
2377830mpfqgcc -m64 -march=core2 -O2 -fomit-frame-pointer2018053020171218
2378070mpfqgcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer2018053020171218
2378400mpfqgcc -funroll-loops -m64 -O3 -fomit-frame-pointer2018053020171218
2378460mpfqgcc -funroll-loops -O3 -fomit-frame-pointer2018053020171218
2380170mpfqgcc -funroll-loops -O2 -fomit-frame-pointer2018053020171218
2380410mpfqgcc -m64 -march=barcelona -O3 -fomit-frame-pointer2018053020171218
2380800mpfqgcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer2018053020171218
2381010mpfqgcc -march=barcelona -O3 -fomit-frame-pointer2018053020171218
2383710mpfqgcc -O2 -fomit-frame-pointer2018053020171218
2384220mpfqgcc -fno-schedule-insns -O2 -fomit-frame-pointer2018053020171218
2384280mpfqgcc -m64 -O2 -fomit-frame-pointer2018053020171218
2386500mpfqgcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer2018053020171218
2386560mpfqgcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer2018053020171218
2387880mpfqgcc -funroll-loops -march=barcelona -O -fomit-frame-pointer2018053020171218
2388720mpfqgcc -m64 -march=core2 -O3 -fomit-frame-pointer2018053020171218
2390490mpfqgcc -funroll-loops -m64 -O -fomit-frame-pointer2018053020171218
2390880mpfqgcc -march=k8 -O2 -fomit-frame-pointer2018053020171218
2391270mpfqgcc -funroll-loops -O -fomit-frame-pointer2018053020171218
2392650mpfqgcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer2018053020171218
2392650mpfqgcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer2018053020171218
2392680mpfqgcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer2018053020171218
2392800mpfqgcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer2018053020171218
2393310mpfqgcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer2018053020171218
2393460mpfqgcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer2018053020171218
2393670mpfqgcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer2018053020171218
2397090mpfqgcc -funroll-loops -march=k8 -O -fomit-frame-pointer2018053020171218
2397900mpfqgcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer2018053020171218
2401170mpfqgcc -march=nocona -O -fomit-frame-pointer2018053020171218
2404020mpfqgcc -O3 -fomit-frame-pointer2018053020171218
2404110mpfqgcc -m64 -O3 -fomit-frame-pointer2018053020171218
2404200mpfqgcc -fno-schedule-insns -O3 -fomit-frame-pointer2018053020171218
2407560mpfqgcc -m64 -march=nocona -O -fomit-frame-pointer2018053020171218
2411640mpfqgcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer2018053020171218
2412120mpfqgcc -m64 -march=k8 -O3 -fomit-frame-pointer2018053020171218
2412300mpfqgcc -march=k8 -O3 -fomit-frame-pointer2018053020171218
2430450mpfqgcc -funroll-loops -march=nocona -O -fomit-frame-pointer2018053020171218
2430510mpfqgcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer2018053020171218
2442510mpfqgcc -march=nocona -O2 -fomit-frame-pointer2018053020171218
2443050mpfqgcc -m64 -march=nocona -O2 -fomit-frame-pointer2018053020171218
2448150mpfqgcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer2018053020171218
2463900mpfqgcc -m64 -march=nocona -O3 -fomit-frame-pointer2018053020171218
2463930mpfqgcc -march=nocona -O3 -fomit-frame-pointer2018053020171218
2465940mpfqgcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer2018053020171218
2469930mpfqgcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer2018053020171218
2470530mpfqgcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer2018053020171218
2485710mpfqgcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer2018053020171218
2487690mpfqgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2018053020171218
2515530mpfqgcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer2018053020171218
2517180mpfqgcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer2018053020171218
2523990mpfqgcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer2018053020171218
2524710mpfqgcc -funroll-loops -march=k8 -Os -fomit-frame-pointer2018053020171218
2526480mpfqgcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer2018053020171218
2528070mpfqgcc -funroll-loops -m64 -Os -fomit-frame-pointer2018053020171218
2528910mpfqgcc -funroll-loops -Os -fomit-frame-pointer2018053020171218
2536950mpfqgcc -m64 -march=barcelona -Os -fomit-frame-pointer2018053020171218
2549070mpfqgcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer2018053020171218
2550630mpfqgcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer2018053020171218
2555730mpfqgcc -march=k8 -Os -fomit-frame-pointer2018053020171218
2556000mpfqgcc -march=barcelona -Os -fomit-frame-pointer2018053020171218
2558640mpfqgcc -m64 -march=k8 -Os -fomit-frame-pointer2018053020171218
2562150mpfqgcc -m64 -march=core2 -Os -fomit-frame-pointer2018053020171218
2573310mpfqgcc -m64 -Os -fomit-frame-pointer2018053020171218
2575080mpfqgcc -fno-schedule-insns -Os -fomit-frame-pointer2018053020171218
2575530mpfqgcc -Os -fomit-frame-pointer2018053020171218
2601150mpfqgcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer2018053020171218
2604990mpfqgcc -funroll-loops -march=nocona -Os -fomit-frame-pointer2018053020171218
2664030mpfqgcc -m64 -march=nocona -Os -fomit-frame-pointer2018053020171218
2666910mpfqgcc -march=nocona -Os -fomit-frame-pointer2018053020171218
2931810mpfqgcc -m64 -march=core-avx2 -O3 -fomit-frame-pointer2018053020171218
2935860mpfqgcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer2018053020171218
2943750mpfqgcc -m64 -march=corei7-avx -O2 -fomit-frame-pointer2018053020171218
2953740mpfqgcc -m64 -march=corei7-avx -O3 -fomit-frame-pointer2018053020171218
2954340mpfqgcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer2018053020171218
2961270mpfqgcc -m64 -march=core-avx-i -O3 -fomit-frame-pointer2018053020171218
3039150mpfqgcc -m64 -march=core-avx2 -O -fomit-frame-pointer2018053020171218
3053370mpfqgcc -m64 -march=core-avx-i -O -fomit-frame-pointer2018053020171218
3067320mpfqgcc -m64 -march=corei7-avx -O -fomit-frame-pointer2018053020171218
3117030mpfqgcc -m64 -march=corei7 -O2 -fomit-frame-pointer2018053020171218
3149760mpfqgcc -m64 -march=corei7 -O3 -fomit-frame-pointer2018053020171218
3237210mpfqgcc -m64 -march=corei7 -Os -fomit-frame-pointer2018053020171218
3243900mpfqgcc -m64 -march=corei7 -O -fomit-frame-pointer2018053020171218
3339960mpfqgcc -m64 -march=core-avx-i -Os -fomit-frame-pointer2018053020171218
3344160mpfqgcc -m64 -march=corei7-avx -Os -fomit-frame-pointer2018053020171218
3370830mpfqgcc -m64 -march=core-avx2 -Os -fomit-frame-pointer2018053020171218
9435810mpfqgcc2018053020171218
9449790mpfqcc2018053020171218
9468930mpfqgcc -funroll-loops2018053020171218

Compiler output

Implementation: crypto_dh/surf2113/mpfq
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
Surf2_113.c: In file included from Surf2_113.c:11:
Surf2_113.c: In file included from ./field.h:1:
Surf2_113.c: In file included from ./mpfq_2_113.h:5:
Surf2_113.c: ./x86_64/mpfq_2_113.h:714:14: error: use of unknown builtin '__builtin_ia32_pslldqi128' [-Wimplicit-function-declaration]
Surf2_113.c: r.s = t0 ^ SHLD(t1, 64);
Surf2_113.c: ^
Surf2_113.c: ./x86_64/mpfq_2_113.h:571:25: note: expanded from macro 'SHLD'
Surf2_113.c: #define SHLD(x,r) (v2di)__builtin_ia32_pslldqi128 ((gcc43bugfix) (x),(r))
Surf2_113.c: ^
Surf2_113.c: ./x86_64/mpfq_2_113.h:714:14: error: invalid conversion between vector type 'v2di' (vector of 2 'uint64_t' values) and integer type 'int' of different size
Surf2_113.c: r.s = t0 ^ SHLD(t1, 64);
Surf2_113.c: ^~~~~~~~~~~~
Surf2_113.c: ./x86_64/mpfq_2_113.h:571:19: note: expanded from macro 'SHLD'
Surf2_113.c: #define SHLD(x,r) (v2di)__builtin_ia32_pslldqi128 ((gcc43bugfix) (x),(r))
Surf2_113.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Surf2_113.c: ./x86_64/mpfq_2_113.h:721:14: error: use of unknown builtin '__builtin_ia32_psrldqi128' [-Wimplicit-function-declaration]
Surf2_113.c: r.s = t2 ^ SHRD(t1, 64);
Surf2_113.c: ^
Surf2_113.c: ./x86_64/mpfq_2_113.h:572:25: note: expanded from macro 'SHRD'
Surf2_113.c: #define SHRD(x,r) (v2di)__builtin_ia32_psrldqi128 ((gcc43bugfix) (x),(r))
Surf2_113.c: ^
Surf2_113.c: ./x86_64/mpfq_2_113.h:721:14: error: invalid conversion between vector type 'v2di' (vector of 2 'uint64_t' values) and integer type 'int' of different size
Surf2_113.c: r.s = t2 ^ SHRD(t1, 64);
Surf2_113.c: ^~~~~~~~~~~~
Surf2_113.c: ./x86_64/mpfq_2_113.h:572:19: note: expanded from macro 'SHRD'
Surf2_113.c: ...

Number of similar (compiler,implementation) pairs: 10, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments mpfq
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments mpfq
clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer -Qunused-arguments mpfq
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments mpfq
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments mpfq
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments mpfq
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments mpfq
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments mpfq
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments mpfq
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments mpfq