Implementation notes: amd64, mangetsu, crypto_core/salsa2012

Computer: mangetsu
Architecture: amd64
CPU ID: GenuineIntel-000206a7-bfebfbff
SUPERCOP version: 20141014
Operation: crypto_core
Primitive: salsa2012
TimeImplementationCompilerBenchmark dateSUPERCOP version
468refgcc-4.7 -funroll-loops -O2 -fomit-frame-pointer2014091120140907
468refgcc-4.7 -funroll-loops -O3 -fomit-frame-pointer2014091120140907
468refgcc-4.7 -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer2014091120140907
468refgcc-4.7 -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer2014091120140907
468refgcc-4.7 -funroll-loops -m64 -O3 -fomit-frame-pointer2014091120140907
468refgcc-4.7 -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer2014091120140907
468refgcc-4.7 -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer2014091120140907
468refgcc-4.7 -funroll-loops -march=k8 -O2 -fomit-frame-pointer2014091120140907
472refgcc-4.7 -O3 -fomit-frame-pointer2014091120140907
472refgcc-4.7 -fno-schedule-insns -O3 -fomit-frame-pointer2014091120140907
472refgcc-4.7 -funroll-loops -m64 -O2 -fomit-frame-pointer2014091120140907
472refgcc-4.7 -funroll-loops -march=barcelona -O2 -fomit-frame-pointer2014091120140907
472refgcc-4.7 -m64 -march=barcelona -O3 -fomit-frame-pointer2014091120140907
472refgcc-4.7 -m64 -march=core-avx-i -O3 -fomit-frame-pointer2014091120140907
472refgcc-4.7 -m64 -march=core2 -O3 -fomit-frame-pointer2014091120140907
472refgcc-4.7 -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer2014091120140907
472refgcc-4.7 -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer2014091120140907
472refgcc-4.7 -m64 -march=corei7-avx -O3 -fomit-frame-pointer2014091120140907
472refgcc-4.7 -m64 -march=corei7 -O3 -fomit-frame-pointer2014091120140907
472refgcc-4.7 -m64 -march=k8 -O3 -fomit-frame-pointer2014091120140907
472refgcc-4.7 -m64 -march=native -mtune=native -O3 -fomit-frame-pointer2014091120140907
472refgcc-4.7 -march=barcelona -O3 -fomit-frame-pointer2014091120140907
472refgcc-4.7 -march=k8 -O3 -fomit-frame-pointer2014091120140907
476refclang -O3 -fomit-frame-pointer2014091120140907
476refclang -O3 -fwrapv -march=native -fomit-frame-pointer2014091120140907
476refclang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer2014101820141014
476refclang -O3 -fwrapv -mavx2 -fomit-frame-pointer2014091120140907
476refclang -O3 -fwrapv -mavx -fomit-frame-pointer2014091120140907
476refclang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer2014101820141014
476refgcc-4.7 -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer2014091120140907
476refgcc-4.7 -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer2014091120140907
476refgcc-4.7 -funroll-loops -march=nocona -O3 -fomit-frame-pointer2014091120140907
476refgcc-4.7 -m64 -O3 -fomit-frame-pointer2014091120140907
476refgcc-4.7 -m64 -march=nocona -O3 -fomit-frame-pointer2014091120140907
476refgcc-4.7 -march=nocona -O3 -fomit-frame-pointer2014091120140907
480refgcc-4.7 -funroll-loops -march=nocona -O2 -fomit-frame-pointer2014091120140907
488refgcc-4.7 -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer2014091120140907
492refgcc-4.7 -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer2014091120140907
492refgcc-4.7 -funroll-loops -march=barcelona -O3 -fomit-frame-pointer2014091120140907
492refgcc-4.7 -funroll-loops -march=k8 -O3 -fomit-frame-pointer2014091120140907
504refclang -O3 -fwrapv -march=native -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer2014091120140907
508refclang -O3 -fwrapv -mavx -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer2014091120140907
508refgcc-4.7 -m64 -march=core2 -O2 -fomit-frame-pointer2014091120140907
512refclang -O3 -fwrapv -mavx -maes -mpclmul -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer2014101820141014
512refgcc-4.7 -O2 -fomit-frame-pointer2014091120140907
512refgcc-4.7 -fno-schedule-insns -O2 -fomit-frame-pointer2014091120140907
512refgcc-4.7 -m64 -O2 -fomit-frame-pointer2014091120140907
512refgcc-4.7 -m64 -march=core-avx-i -O2 -fomit-frame-pointer2014091120140907
512refgcc-4.7 -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer2014091120140907
512refgcc-4.7 -m64 -march=corei7 -O2 -fomit-frame-pointer2014091120140907
512refgcc-4.7 -m64 -march=k8 -O2 -fomit-frame-pointer2014091120140907
512refgcc-4.7 -m64 -march=nocona -O2 -fomit-frame-pointer2014091120140907
512refgcc-4.7 -march=barcelona -O2 -fomit-frame-pointer2014091120140907
516refgcc-4.7 -m64 -march=barcelona -O2 -fomit-frame-pointer2014091120140907
516refgcc-4.7 -m64 -march=native -mtune=native -O2 -fomit-frame-pointer2014091120140907
516refgcc-4.7 -march=k8 -O2 -fomit-frame-pointer2014091120140907
520refgcc-4.7 -m64 -march=corei7-avx -O2 -fomit-frame-pointer2014091120140907
520refgcc-4.7 -march=nocona -O2 -fomit-frame-pointer2014091120140907
528refgcc-4.8 -fno-schedule-insns -O -fomit-frame-pointer2014091120140907
528refgcc-4.8 -m64 -march=core-avx-i -O -fomit-frame-pointer2014091120140907
528refgcc-4.8 -m64 -march=corei7 -O -fomit-frame-pointer2014091120140907
528refgcc-4.8 -m64 -march=native -mtune=native -O -fomit-frame-pointer2014091120140907
528refgcc-4.8 -m64 -march=nocona -O -fomit-frame-pointer2014091120140907
528refgcc-4.8 -march=nocona -O -fomit-frame-pointer2014091120140907
532refgcc-4.7 -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer2014091120140907
532refgcc-4.8 -funroll-loops -O -fomit-frame-pointer2014091120140907
532refgcc-4.8 -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer2014091120140907
532refgcc-4.8 -funroll-loops -m64 -O -fomit-frame-pointer2014091120140907
532refgcc-4.8 -m64 -O -fomit-frame-pointer2014091120140907
532refgcc-4.8 -m64 -march=core2 -msse4 -O -fomit-frame-pointer2014091120140907
532refgcc-4.8 -march=barcelona -O -fomit-frame-pointer2014091120140907
536refgcc-4.8 -O -fomit-frame-pointer2014091120140907
536refgcc-4.8 -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer2014091120140907
536refgcc-4.8 -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer2014091120140907
536refgcc-4.8 -funroll-loops -march=barcelona -O -fomit-frame-pointer2014091120140907
536refgcc-4.8 -m64 -march=barcelona -O -fomit-frame-pointer2014091120140907
536refgcc-4.8 -m64 -march=core2 -O -fomit-frame-pointer2014091120140907
536refgcc-4.8 -m64 -march=corei7-avx -O -fomit-frame-pointer2014091120140907
536refgcc-4.8 -m64 -march=k8 -O -fomit-frame-pointer2014091120140907
540refgcc-4.8 -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer2014091120140907
540refgcc-4.8 -funroll-loops -march=k8 -O -fomit-frame-pointer2014091120140907
540refgcc-4.8 -funroll-loops -march=nocona -O -fomit-frame-pointer2014091120140907
540refgcc-4.8 -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer2014091120140907
540refgcc-4.8 -march=k8 -O -fomit-frame-pointer2014091120140907
580refgcc-4.7 -fno-schedule-insns -Os -fomit-frame-pointer2014091120140907
580refgcc-4.7 -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer2014091120140907
580refgcc-4.7 -funroll-loops -march=barcelona -Os -fomit-frame-pointer2014091120140907
580refgcc-4.7 -funroll-loops -march=nocona -Os -fomit-frame-pointer2014091120140907
580refgcc-4.7 -m64 -march=barcelona -Os -fomit-frame-pointer2014091120140907
580refgcc-4.7 -march=barcelona -Os -fomit-frame-pointer2014091120140907
584refgcc-4.7 -Os -fomit-frame-pointer2014091120140907
584refgcc-4.7 -funroll-loops -Os -fomit-frame-pointer2014091120140907
584refgcc-4.7 -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer2014091120140907
584refgcc-4.7 -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer2014091120140907
584refgcc-4.7 -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer2014091120140907
584refgcc-4.7 -funroll-loops -march=k8 -Os -fomit-frame-pointer2014091120140907
584refgcc-4.7 -m64 -march=core-avx-i -Os -fomit-frame-pointer2014091120140907
584refgcc-4.7 -m64 -march=core2 -Os -fomit-frame-pointer2014091120140907
584refgcc-4.7 -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer2014091120140907
584refgcc-4.7 -m64 -march=corei7-avx -Os -fomit-frame-pointer2014091120140907
584refgcc-4.7 -m64 -march=corei7 -Os -fomit-frame-pointer2014091120140907
584refgcc-4.7 -m64 -march=native -mtune=native -Os -fomit-frame-pointer2014091120140907
584refgcc-4.7 -m64 -march=nocona -Os -fomit-frame-pointer2014091120140907
584refgcc-4.7 -march=nocona -Os -fomit-frame-pointer2014091120140907
588refgcc-4.7 -funroll-loops -m64 -Os -fomit-frame-pointer2014091120140907
588refgcc-4.7 -m64 -Os -fomit-frame-pointer2014091120140907
588refgcc-4.7 -m64 -march=core2 -msse4 -Os -fomit-frame-pointer2014091120140907
588refgcc-4.7 -m64 -march=k8 -Os -fomit-frame-pointer2014091120140907
588refgcc-4.7 -march=k8 -Os -fomit-frame-pointer2014091120140907
784refgcc-4.8 -funroll-loops -O2 -fomit-frame-pointer2014091120140907
788refgcc-4.8 -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer2014091120140907
788refgcc-4.8 -funroll-loops -m64 -O2 -fomit-frame-pointer2014091120140907
788refgcc-4.8 -funroll-loops -march=k8 -O2 -fomit-frame-pointer2014091120140907
792refgcc-4.8 -funroll-loops -O3 -fomit-frame-pointer2014091120140907
792refgcc-4.8 -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer2014091120140907
792refgcc-4.8 -funroll-loops -m64 -O3 -fomit-frame-pointer2014091120140907
792refgcc-4.8 -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer2014091120140907
792refgcc-4.8 -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer2014091120140907
792refgcc-4.8 -funroll-loops -march=barcelona -O2 -fomit-frame-pointer2014091120140907
800refgcc-4.8 -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer2014091120140907
800refgcc-4.8 -funroll-loops -march=k8 -O3 -fomit-frame-pointer2014091120140907
800refgcc-4.8 -m64 -march=core2 -O3 -fomit-frame-pointer2014091120140907
800refgcc-4.8 -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer2014091120140907
800refgcc-4.8 -m64 -march=corei7 -O3 -fomit-frame-pointer2014091120140907
804refgcc-4.8 -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer2014091120140907
804refgcc-4.8 -funroll-loops -march=barcelona -O3 -fomit-frame-pointer2014091120140907
804refgcc-4.8 -m64 -march=core-avx-i -O3 -fomit-frame-pointer2014091120140907
804refgcc-4.8 -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer2014091120140907
804refgcc-4.8 -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer2014091120140907
804refgcc-4.8 -m64 -march=corei7-avx -O2 -fomit-frame-pointer2014091120140907
804refgcc-4.8 -m64 -march=corei7-avx -O3 -fomit-frame-pointer2014091120140907
804refgcc-4.8 -m64 -march=corei7 -O2 -fomit-frame-pointer2014091120140907
808refgcc-4.8 -fno-schedule-insns -O2 -fomit-frame-pointer2014091120140907
808refgcc-4.8 -m64 -O2 -fomit-frame-pointer2014091120140907
808refgcc-4.8 -m64 -march=core-avx-i -O2 -fomit-frame-pointer2014091120140907
808refgcc-4.8 -m64 -march=core2 -O2 -fomit-frame-pointer2014091120140907
808refgcc-4.8 -m64 -march=native -mtune=native -O2 -fomit-frame-pointer2014091120140907
808refgcc-4.8 -m64 -march=native -mtune=native -O3 -fomit-frame-pointer2014091120140907
812refgcc-4.8 -O3 -fomit-frame-pointer2014091120140907
812refgcc-4.8 -fno-schedule-insns -O3 -fomit-frame-pointer2014091120140907
812refgcc-4.8 -m64 -O3 -fomit-frame-pointer2014091120140907
812refgcc-4.8 -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer2014091120140907
812refgcc-4.8 -m64 -march=k8 -O2 -fomit-frame-pointer2014091120140907
816refgcc-4.8 -O2 -fomit-frame-pointer2014091120140907
816refgcc-4.8 -m64 -march=k8 -O3 -fomit-frame-pointer2014091120140907
816refgcc-4.8 -march=barcelona -O2 -fomit-frame-pointer2014091120140907
816refgcc-4.8 -march=k8 -O2 -fomit-frame-pointer2014091120140907
816refgcc-4.8 -march=k8 -O3 -fomit-frame-pointer2014091120140907
820refgcc-4.8 -m64 -march=barcelona -O2 -fomit-frame-pointer2014091120140907
820refgcc-4.8 -m64 -march=barcelona -O3 -fomit-frame-pointer2014091120140907
820refgcc-4.8 -march=barcelona -O3 -fomit-frame-pointer2014091120140907
844refgcc-4.8 -march=nocona -O2 -fomit-frame-pointer2014091120140907
852refgcc-4.8 -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer2014091120140907
852refgcc-4.8 -funroll-loops -march=nocona -O2 -fomit-frame-pointer2014091120140907
856refgcc-4.8 -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer2014091120140907
856refgcc-4.8 -funroll-loops -march=nocona -O3 -fomit-frame-pointer2014091120140907
856refgcc-4.8 -m64 -march=nocona -O2 -fomit-frame-pointer2014091120140907
860refgcc-4.8 -m64 -march=nocona -O3 -fomit-frame-pointer2014091120140907
860refgcc-4.8 -march=nocona -O3 -fomit-frame-pointer2014091120140907
948refgcc-4.8 -funroll-loops -Os -fomit-frame-pointer2014091120140907
948refgcc-4.8 -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer2014091120140907
952refgcc-4.8 -Os -fomit-frame-pointer2014091120140907
952refgcc-4.8 -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer2014091120140907
952refgcc-4.8 -funroll-loops -march=barcelona -Os -fomit-frame-pointer2014091120140907
952refgcc-4.8 -funroll-loops -march=k8 -Os -fomit-frame-pointer2014091120140907
952refgcc-4.8 -m64 -march=k8 -Os -fomit-frame-pointer2014091120140907
952refgcc-4.8 -march=k8 -Os -fomit-frame-pointer2014091120140907
956refgcc-4.8 -fno-schedule-insns -Os -fomit-frame-pointer2014091120140907
956refgcc-4.8 -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer2014091120140907
956refgcc-4.8 -funroll-loops -m64 -Os -fomit-frame-pointer2014091120140907
956refgcc-4.8 -m64 -Os -fomit-frame-pointer2014091120140907
956refgcc-4.8 -m64 -march=barcelona -Os -fomit-frame-pointer2014091120140907
956refgcc-4.8 -m64 -march=core-avx-i -Os -fomit-frame-pointer2014091120140907
956refgcc-4.8 -m64 -march=core2 -Os -fomit-frame-pointer2014091120140907
956refgcc-4.8 -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer2014091120140907
956refgcc-4.8 -m64 -march=corei7-avx -Os -fomit-frame-pointer2014091120140907
956refgcc-4.8 -m64 -march=corei7 -Os -fomit-frame-pointer2014091120140907
956refgcc-4.8 -march=barcelona -Os -fomit-frame-pointer2014091120140907
956refgcc-4.8 -march=nocona -Os -fomit-frame-pointer2014091120140907
960refgcc-4.8 -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer2014091120140907
960refgcc-4.8 -funroll-loops -march=nocona -Os -fomit-frame-pointer2014091120140907
960refgcc-4.8 -m64 -march=core2 -msse4 -Os -fomit-frame-pointer2014091120140907
960refgcc-4.8 -m64 -march=native -mtune=native -Os -fomit-frame-pointer2014091120140907
960refgcc-4.8 -m64 -march=nocona -Os -fomit-frame-pointer2014091120140907
1160refgcc-4.7 -m64 -march=k8 -O -fomit-frame-pointer2014091120140907
1160refgcc-4.7 -march=k8 -O -fomit-frame-pointer2014091120140907
1164refgcc-4.7 -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer2014091120140907
1164refgcc-4.7 -funroll-loops -march=nocona -O -fomit-frame-pointer2014091120140907
1164refgcc-4.7 -march=barcelona -O -fomit-frame-pointer2014091120140907
1164refgcc-4.7 -march=nocona -O -fomit-frame-pointer2014091120140907
1168refgcc-4.7 -O -fomit-frame-pointer2014091120140907
1168refgcc-4.7 -fno-schedule-insns -O -fomit-frame-pointer2014091120140907
1168refgcc-4.7 -m64 -O -fomit-frame-pointer2014091120140907
1168refgcc-4.7 -m64 -march=core-avx-i -O -fomit-frame-pointer2014091120140907
1168refgcc-4.7 -m64 -march=core-avx2 -O -fomit-frame-pointer2014091120140907
1168refgcc-4.7 -m64 -march=core2 -O -fomit-frame-pointer2014091120140907
1168refgcc-4.7 -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer2014091120140907
1168refgcc-4.7 -m64 -march=core2 -msse4 -O -fomit-frame-pointer2014091120140907
1168refgcc-4.7 -m64 -march=corei7-avx -O -fomit-frame-pointer2014091120140907
1168refgcc-4.7 -m64 -march=corei7 -O -fomit-frame-pointer2014091120140907
1168refgcc-4.7 -m64 -march=native -mtune=native -O -fomit-frame-pointer2014091120140907
1168refgcc-4.7 -m64 -march=nocona -O -fomit-frame-pointer2014091120140907
1172refgcc-4.7 -funroll-loops -O -fomit-frame-pointer2014091120140907
1172refgcc-4.7 -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer2014091120140907
1172refgcc-4.7 -funroll-loops -m64 -O -fomit-frame-pointer2014091120140907
1172refgcc-4.7 -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer2014091120140907
1172refgcc-4.7 -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer2014091120140907
1172refgcc-4.7 -funroll-loops -march=barcelona -O -fomit-frame-pointer2014091120140907
1172refgcc-4.7 -funroll-loops -march=k8 -O -fomit-frame-pointer2014091120140907
1172refgcc-4.7 -m64 -march=barcelona -O -fomit-frame-pointer2014091120140907
2836refgcc-4.82014091120140907
2896refgcc-4.8 -funroll-loops2014091120140907
3044refgcc-4.72014091120140907
3104refgcc-4.7 -funroll-loops2014091120140907
3116refcc2014091120140907

Test failure

Implementation: crypto_core/salsa2012/ref
Compiler: gcc-4.7 -m64 -march=core-avx2 -O2 -fomit-frame-pointer
error 111

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
gcc-4.7 -m64 -march=core-avx2 -O2 -fomit-frame-pointer ref
gcc-4.7 -m64 -march=core-avx2 -Os -fomit-frame-pointer ref
gcc-4.8 -m64 -march=core-avx2 -O2 -fomit-frame-pointer ref
gcc-4.8 -m64 -march=core-avx2 -O -fomit-frame-pointer ref
gcc-4.8 -m64 -march=core-avx2 -Os -fomit-frame-pointer ref

Compiler output

Implementation: crypto_core/salsa2012/ref
Compiler: clang -O3 -fwrapv -march=native -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer
core.c: clang-3.3: warning: argument unused during compilation: '-fpolly'

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
clang -O3 -fwrapv -march=native -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer ref
clang -O3 -fwrapv -mavx -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer ref
clang -O3 -fwrapv -mavx -maes -mpclmul -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer ref

Compiler output

Implementation: crypto_core/salsa2012/ref
Compiler: clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer
core.c: clang-3.3: warning: argument unused during compilation: '-mcpu=core-avx2'

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer ref