Implementation notes: amd64, skylake, crypto_dh/nist521gs

Computer: skylake
Architecture: amd64
CPU ID: GenuineIntel-000506e3-bfebfbff
SUPERCOP version: 20161026
Operation: crypto_dh
Primitive: nist521gs
TimeImplementationCompilerBenchmark dateSUPERCOP version
3737620refgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016121720161026
3813444refgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016121720161026
3905886refgcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer2016121720161026
3910802refgcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer2016121720161026
3911106refgcc -march=k8 -O3 -fomit-frame-pointer2016121720161026
3913164refgcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer2016121720161026
3918806refgcc -funroll-loops -m64 -O3 -fomit-frame-pointer2016121720161026
3919968refgcc -m64 -O3 -fomit-frame-pointer2016121720161026
3923478refgcc -march=barcelona -O3 -fomit-frame-pointer2016121720161026
3923812refgcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer2016121720161026
3924074refgcc -fno-schedule-insns -O3 -fomit-frame-pointer2016121720161026
3924430refgcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer2016121720161026
3925356refgcc -m64 -march=barcelona -O3 -fomit-frame-pointer2016121720161026
3926388refgcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer2016121720161026
3929990refgcc -m64 -march=core2 -O3 -fomit-frame-pointer2016121720161026
3931856refgcc -m64 -march=corei7 -O3 -fomit-frame-pointer2016121720161026
3932116refgcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer2016121720161026
3932816refgcc -m64 -march=core-avx2 -O3 -fomit-frame-pointer2016121720161026
3932834refgcc -O3 -fomit-frame-pointer2016121720161026
3935162refgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016121720161026
3937206refgcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer2016121720161026
3942152refgcc -m64 -march=core-avx-i -O3 -fomit-frame-pointer2016121720161026
3946956refgcc -m64 -march=k8 -O3 -fomit-frame-pointer2016121720161026
3965334refgcc -march=nocona -O3 -fomit-frame-pointer2016121720161026
3973832refgcc -m64 -march=core-avx2 -Os -fomit-frame-pointer2016121720161026
3978256refgcc -funroll-loops -O3 -fomit-frame-pointer2016121720161026
3986780refgcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer2016121720161026
3986822refgcc -m64 -march=corei7-avx -O3 -fomit-frame-pointer2016121720161026
3994754refgcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer2016121720161026
3996908refgcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer2016121720161026
4008472refgcc -m64 -march=k8 -Os -fomit-frame-pointer2016121720161026
4009972refgcc -m64 -march=nocona -Os -fomit-frame-pointer2016121720161026
4011286refgcc -m64 -march=corei7-avx -Os -fomit-frame-pointer2016121720161026
4013006refgcc -m64 -march=core-avx-i -Os -fomit-frame-pointer2016121720161026
4013246refgcc -march=k8 -Os -fomit-frame-pointer2016121720161026
4013882refgcc -m64 -march=barcelona -Os -fomit-frame-pointer2016121720161026
4014358refgcc -march=barcelona -Os -fomit-frame-pointer2016121720161026
4014378refgcc -fno-schedule-insns -Os -fomit-frame-pointer2016121720161026
4014384refgcc -march=nocona -Os -fomit-frame-pointer2016121720161026
4014578refgcc -Os -fomit-frame-pointer2016121720161026
4014660refgcc -m64 -march=nocona -O3 -fomit-frame-pointer2016121720161026
4015598refgcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer2016121720161026
4016234refgcc -m64 -march=corei7 -Os -fomit-frame-pointer2016121720161026
4016494refgcc -m64 -march=core2 -Os -fomit-frame-pointer2016121720161026
4016614refgcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer2016121720161026
4019100refgcc -funroll-loops -m64 -O2 -fomit-frame-pointer2016121720161026
4020570refgcc -funroll-loops -O2 -fomit-frame-pointer2016121720161026
4021502refgcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer2016121720161026
4023806refgcc -m64 -Os -fomit-frame-pointer2016121720161026
4024900refgcc -funroll-loops -march=k8 -Os -fomit-frame-pointer2016121720161026
4028028refgcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer2016121720161026
4029168refgcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer2016121720161026
4029532refgcc -funroll-loops -m64 -Os -fomit-frame-pointer2016121720161026
4029546refgcc -funroll-loops -Os -fomit-frame-pointer2016121720161026
4029862refgcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer2016121720161026
4030382refgcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer2016121720161026
4033206refgcc -funroll-loops -march=nocona -Os -fomit-frame-pointer2016121720161026
4034638refgcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer2016121720161026
4036706refgcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer2016121720161026
4044040refgcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer2016121720161026
4045216refgcc -O2 -fomit-frame-pointer2016121720161026
4048218refgcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer2016121720161026
4049200refgcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer2016121720161026
4049578refgcc -m64 -march=corei7-avx -O2 -fomit-frame-pointer2016121720161026
4050144refgcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer2016121720161026
4053200refgcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer2016121720161026
4053560refgcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer2016121720161026
4053834refgcc -m64 -march=core2 -O2 -fomit-frame-pointer2016121720161026
4056214refgcc -m64 -march=barcelona -O2 -fomit-frame-pointer2016121720161026
4056826refgcc -march=k8 -O2 -fomit-frame-pointer2016121720161026
4057378refgcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer2016121720161026
4057662refgcc -march=nocona -O2 -fomit-frame-pointer2016121720161026
4060944refgcc -fno-schedule-insns -O2 -fomit-frame-pointer2016121720161026
4065034refgcc -m64 -march=nocona -O2 -fomit-frame-pointer2016121720161026
4065948refgcc -m64 -march=k8 -O2 -fomit-frame-pointer2016121720161026
4067912refgcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer2016121720161026
4069630refgcc -m64 -march=corei7 -O2 -fomit-frame-pointer2016121720161026
4069948refgcc -m64 -O2 -fomit-frame-pointer2016121720161026
4070236refclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016121720161026
4072496refgcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer2016121720161026
4073454refclang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments2016121720161026
4076830refgcc -march=barcelona -O2 -fomit-frame-pointer2016121720161026
4087114refgcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer2016121720161026
4181552refclang -O3 -fomit-frame-pointer -Qunused-arguments2016121720161026
4182448refclang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016121720161026
4189896refclang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016121720161026
4199888refclang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments2016121720161026
4208830refclang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer -Qunused-arguments2016121720161026
4218296refclang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016121720161026
4227766refclang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments2016121720161026
4234448refclang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments2016121720161026
6607356refgcc -funroll-loops -march=barcelona -O -fomit-frame-pointer2016121720161026
6616232refgcc -funroll-loops -march=k8 -O -fomit-frame-pointer2016121720161026
6633278refgcc -march=barcelona -O -fomit-frame-pointer2016121720161026
6638366refgcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer2016121720161026
6639498refgcc -march=k8 -O -fomit-frame-pointer2016121720161026
6666106refgcc -m64 -march=k8 -O -fomit-frame-pointer2016121720161026
6677970refgcc -funroll-loops -march=nocona -O -fomit-frame-pointer2016121720161026
6681086refgcc -funroll-loops -O -fomit-frame-pointer2016121720161026
6682104refgcc -m64 -march=barcelona -O -fomit-frame-pointer2016121720161026
6686656refgcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer2016121720161026
6696494refgcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer2016121720161026
6697002refgcc -m64 -O -fomit-frame-pointer2016121720161026
6707616refgcc -march=nocona -O -fomit-frame-pointer2016121720161026
6710734refgcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer2016121720161026
6723524refgcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer2016121720161026
6727448refgcc -m64 -march=core-avx-i -O -fomit-frame-pointer2016121720161026
6732742refgcc -m64 -march=corei7 -O -fomit-frame-pointer2016121720161026
6732856refgcc -m64 -march=corei7-avx -O -fomit-frame-pointer2016121720161026
6738226refgcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer2016121720161026
6739252refgcc -m64 -march=core2 -O -fomit-frame-pointer2016121720161026
6747952refgcc -O -fomit-frame-pointer2016121720161026
6756956refgcc -fno-schedule-insns -O -fomit-frame-pointer2016121720161026
6758790refgcc -m64 -march=nocona -O -fomit-frame-pointer2016121720161026
6768660refgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016121720161026
6770066refgcc -funroll-loops -m64 -O -fomit-frame-pointer2016121720161026
6813810refgcc -m64 -march=native -mtune=native -O -fomit-frame-pointer2016121720161026
6855824refgcc -m64 -march=core-avx2 -O -fomit-frame-pointer2016121720161026
15460006refgcc2016121720161026
15498340refgcc -funroll-loops2016121720161026
15585418refcc2016121720161026

Compiler output

Implementation: crypto_dh/nist521gs/ref
Compiler: cc
dh.c: dh.c: In function 'crypto_dh_nist521gs_ref_keypair':
dh.c: dh.c:831:2: warning: implicit declaration of function 'randombytes' [-Wimplicit-function-declaration]
dh.c: randombytes(sk,PANES-1);
dh.c: ^~~~~~~~~~~

Number of similar (compiler,implementation) pairs: 107, namely:
CompilerImplementations
cc ref
gcc ref
gcc -O2 -fomit-frame-pointer ref
gcc -O3 -fomit-frame-pointer ref
gcc -O -fomit-frame-pointer ref
gcc -Os -fomit-frame-pointer ref
gcc -fno-schedule-insns -O2 -fomit-frame-pointer ref
gcc -fno-schedule-insns -O3 -fomit-frame-pointer ref
gcc -fno-schedule-insns -O -fomit-frame-pointer ref
gcc -fno-schedule-insns -Os -fomit-frame-pointer ref
gcc -funroll-loops ref
gcc -funroll-loops -O2 -fomit-frame-pointer ref
gcc -funroll-loops -O3 -fomit-frame-pointer ref
gcc -funroll-loops -O -fomit-frame-pointer ref
gcc -funroll-loops -Os -fomit-frame-pointer ref
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer ref
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer ref
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer ref
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer ref
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -O -fomit-frame-pointer ref
gcc -funroll-loops -m64 -Os -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer ref
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer ref
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer ref
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer ref
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer ref
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer ref
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer ref
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer ref
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer ref
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer ref
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer ref
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer ref
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer ref
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer ref
gcc -m64 -O2 -fomit-frame-pointer ref
gcc -m64 -O3 -fomit-frame-pointer ref
gcc -m64 -O -fomit-frame-pointer ref
gcc -m64 -Os -fomit-frame-pointer ref
gcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer ref
gcc -m64 -march=core-avx-i -O3 -fomit-frame-pointer ref
gcc -m64 -march=core-avx-i -O -fomit-frame-pointer ref
gcc -m64 -march=core-avx-i -Os -fomit-frame-pointer ref
gcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer ref
gcc -m64 -march=core-avx2 -O3 -fomit-frame-pointer ref
gcc -m64 -march=core-avx2 -O -fomit-frame-pointer ref
gcc -m64 -march=core-avx2 -Os -fomit-frame-pointer ref
gcc -m64 -march=core2 -O2 -fomit-frame-pointer ref
gcc -m64 -march=core2 -O3 -fomit-frame-pointer ref
gcc -m64 -march=core2 -O -fomit-frame-pointer ref
gcc -m64 -march=core2 -Os -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer ref
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer ref
gcc -m64 -march=corei7-avx -O2 -fomit-frame-pointer ref
gcc -m64 -march=corei7-avx -O3 -fomit-frame-pointer ref
gcc -m64 -march=corei7-avx -O -fomit-frame-pointer ref
gcc -m64 -march=corei7-avx -Os -fomit-frame-pointer ref
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer ref
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer ref
gcc -m64 -march=corei7 -O -fomit-frame-pointer ref
gcc -m64 -march=corei7 -Os -fomit-frame-pointer ref
gcc -m64 -march=k8 -O2 -fomit-frame-pointer ref
gcc -m64 -march=k8 -O3 -fomit-frame-pointer ref
gcc -m64 -march=k8 -O -fomit-frame-pointer ref
gcc -m64 -march=k8 -Os -fomit-frame-pointer ref
gcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer ref
gcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer ref
gcc -m64 -march=native -mtune=native -O -fomit-frame-pointer ref
gcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer ref
gcc -m64 -march=nocona -O2 -fomit-frame-pointer ref
gcc -m64 -march=nocona -O3 -fomit-frame-pointer ref
gcc -m64 -march=nocona -O -fomit-frame-pointer ref
gcc -m64 -march=nocona -Os -fomit-frame-pointer ref
gcc -march=barcelona -O2 -fomit-frame-pointer ref
gcc -march=barcelona -O3 -fomit-frame-pointer ref
gcc -march=barcelona -O -fomit-frame-pointer ref
gcc -march=barcelona -Os -fomit-frame-pointer ref
gcc -march=k8 -O2 -fomit-frame-pointer ref
gcc -march=k8 -O3 -fomit-frame-pointer ref
gcc -march=k8 -O -fomit-frame-pointer ref
gcc -march=k8 -Os -fomit-frame-pointer ref
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv ref
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv ref
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv ref
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv ref
gcc -march=nocona -O2 -fomit-frame-pointer ref
gcc -march=nocona -O3 -fomit-frame-pointer ref
gcc -march=nocona -O -fomit-frame-pointer ref
gcc -march=nocona -Os -fomit-frame-pointer ref

Compiler output

Implementation: crypto_dh/nist521gs/ref
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
dh.c: dh.c:831:2: warning: implicit declaration of function 'randombytes' is invalid in C99 [-Wimplicit-function-declaration]
dh.c: randombytes(sk,PANES-1);
dh.c: ^
dh.c: 1 warning generated.

Number of similar (compiler,implementation) pairs: 10, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments ref
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments ref
clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer -Qunused-arguments ref
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments ref
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments ref
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments ref
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments ref
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments ref
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments ref
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments ref

Compiler output

Implementation: crypto_dh/nist521gs/ref
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
dh.c: dh.c: In function 'crypto_dh_nist521gs_ref_keypair':
dh.c: dh.c:831:2: warning: implicit declaration of function 'randombytes' [-Wimplicit-function-declaration]
dh.c: randombytes(sk,PANES-1);
dh.c: ^~~~~~~~~~~
dh.c: dh.c: In function 'crypto_dh_nist521gs_ref_keypair':
dh.c: dh.c:831:2: warning: implicit declaration of function 'randombytes' [-Wimplicit-function-declaration]
dh.c: randombytes(sk,PANES-1);
dh.c: ^~~~~~~~~~~

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer ref
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer ref
gcc -m64 -march=barcelona -O -fomit-frame-pointer ref
gcc -m64 -march=barcelona -Os -fomit-frame-pointer ref