Implementation notes: amd64, mangetsu, crypto_core/salsa208

Computer: mangetsu
Architecture: amd64
CPU ID: GenuineIntel-000206a7-bfebfbff
SUPERCOP version: 20141014
Operation: crypto_core
Primitive: salsa208
TimeImplementationCompilerBenchmark dateSUPERCOP version
352refgcc-4.7 -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer2014091120140907
356refgcc-4.7 -funroll-loops -O2 -fomit-frame-pointer2014091120140907
356refgcc-4.7 -funroll-loops -O3 -fomit-frame-pointer2014091120140907
356refgcc-4.7 -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer2014091120140907
356refgcc-4.7 -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer2014091120140907
356refgcc-4.7 -funroll-loops -m64 -O2 -fomit-frame-pointer2014091120140907
356refgcc-4.7 -funroll-loops -m64 -O3 -fomit-frame-pointer2014091120140907
356refgcc-4.7 -funroll-loops -march=barcelona -O2 -fomit-frame-pointer2014091120140907
356refgcc-4.7 -m64 -march=core2 -O3 -fomit-frame-pointer2014091120140907
356refgcc-4.7 -m64 -march=corei7-avx -O3 -fomit-frame-pointer2014091120140907
356refgcc-4.7 -m64 -march=corei7 -O3 -fomit-frame-pointer2014091120140907
356refgcc-4.7 -m64 -march=k8 -O3 -fomit-frame-pointer2014091120140907
360refgcc-4.7 -funroll-loops -march=k8 -O2 -fomit-frame-pointer2014091120140907
360refgcc-4.7 -m64 -march=barcelona -O3 -fomit-frame-pointer2014091120140907
360refgcc-4.7 -m64 -march=core-avx-i -O3 -fomit-frame-pointer2014091120140907
360refgcc-4.7 -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer2014091120140907
360refgcc-4.7 -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer2014091120140907
360refgcc-4.7 -m64 -march=native -mtune=native -O3 -fomit-frame-pointer2014091120140907
360refgcc-4.7 -march=k8 -O3 -fomit-frame-pointer2014091120140907
364refclang -O3 -fwrapv -mavx -fomit-frame-pointer2014091120140907
364refclang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer2014101820141014
364refgcc-4.7 -O3 -fomit-frame-pointer2014091120140907
364refgcc-4.7 -m64 -O3 -fomit-frame-pointer2014091120140907
368refclang -O3 -fomit-frame-pointer2014091120140907
368refclang -O3 -fwrapv -march=native -fomit-frame-pointer2014091120140907
368refclang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer2014101820141014
368refgcc-4.7 -fno-schedule-insns -O3 -fomit-frame-pointer2014091120140907
368refgcc-4.7 -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer2014091120140907
368refgcc-4.7 -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer2014091120140907
368refgcc-4.7 -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer2014091120140907
368refgcc-4.7 -funroll-loops -march=nocona -O2 -fomit-frame-pointer2014091120140907
368refgcc-4.7 -funroll-loops -march=nocona -O3 -fomit-frame-pointer2014091120140907
368refgcc-4.7 -m64 -march=nocona -O3 -fomit-frame-pointer2014091120140907
368refgcc-4.7 -march=barcelona -O3 -fomit-frame-pointer2014091120140907
368refgcc-4.7 -march=nocona -O3 -fomit-frame-pointer2014091120140907
376refgcc-4.7 -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer2014091120140907
376refgcc-4.7 -funroll-loops -march=barcelona -O3 -fomit-frame-pointer2014091120140907
380refclang -O3 -fwrapv -mavx2 -fomit-frame-pointer2014091120140907
380refgcc-4.7 -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer2014091120140907
380refgcc-4.7 -funroll-loops -march=k8 -O3 -fomit-frame-pointer2014091120140907
396refclang -O3 -fwrapv -march=native -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer2014091120140907
396refgcc-4.7 -O2 -fomit-frame-pointer2014091120140907
396refgcc-4.7 -m64 -march=core-avx-i -O2 -fomit-frame-pointer2014091120140907
396refgcc-4.7 -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer2014091120140907
396refgcc-4.7 -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer2014091120140907
396refgcc-4.7 -m64 -march=corei7-avx -O2 -fomit-frame-pointer2014091120140907
396refgcc-4.7 -m64 -march=corei7 -O2 -fomit-frame-pointer2014091120140907
396refgcc-4.7 -m64 -march=native -mtune=native -O2 -fomit-frame-pointer2014091120140907
396refgcc-4.7 -march=barcelona -O2 -fomit-frame-pointer2014091120140907
400refclang -O3 -fwrapv -mavx -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer2014091120140907
400refclang -O3 -fwrapv -mavx -maes -mpclmul -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer2014101820141014
400refgcc-4.7 -m64 -O2 -fomit-frame-pointer2014091120140907
400refgcc-4.7 -m64 -march=k8 -O2 -fomit-frame-pointer2014091120140907
400refgcc-4.7 -march=k8 -O2 -fomit-frame-pointer2014091120140907
400refgcc-4.7 -march=nocona -O2 -fomit-frame-pointer2014091120140907
404refgcc-4.7 -fno-schedule-insns -O2 -fomit-frame-pointer2014091120140907
408refgcc-4.7 -m64 -march=core2 -O2 -fomit-frame-pointer2014091120140907
408refgcc-4.7 -m64 -march=nocona -O2 -fomit-frame-pointer2014091120140907
416refgcc-4.7 -m64 -march=barcelona -O2 -fomit-frame-pointer2014091120140907
416refgcc-4.8 -m64 -march=native -mtune=native -O -fomit-frame-pointer2014091120140907
420refgcc-4.8 -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer2014091120140907
420refgcc-4.8 -m64 -march=corei7 -O -fomit-frame-pointer2014091120140907
424refgcc-4.8 -O -fomit-frame-pointer2014091120140907
424refgcc-4.8 -fno-schedule-insns -O -fomit-frame-pointer2014091120140907
424refgcc-4.8 -funroll-loops -O -fomit-frame-pointer2014091120140907
424refgcc-4.8 -funroll-loops -m64 -O -fomit-frame-pointer2014091120140907
424refgcc-4.8 -m64 -O -fomit-frame-pointer2014091120140907
424refgcc-4.8 -m64 -march=barcelona -O -fomit-frame-pointer2014091120140907
424refgcc-4.8 -m64 -march=core2 -msse4 -O -fomit-frame-pointer2014091120140907
424refgcc-4.8 -m64 -march=nocona -O -fomit-frame-pointer2014091120140907
424refgcc-4.8 -march=barcelona -O -fomit-frame-pointer2014091120140907
424refgcc-4.8 -march=k8 -O -fomit-frame-pointer2014091120140907
424refgcc-4.8 -march=nocona -O -fomit-frame-pointer2014091120140907
428refgcc-4.8 -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer2014091120140907
428refgcc-4.8 -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer2014091120140907
428refgcc-4.8 -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer2014091120140907
428refgcc-4.8 -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer2014091120140907
428refgcc-4.8 -funroll-loops -march=barcelona -O -fomit-frame-pointer2014091120140907
428refgcc-4.8 -funroll-loops -march=k8 -O -fomit-frame-pointer2014091120140907
428refgcc-4.8 -m64 -march=core-avx-i -O -fomit-frame-pointer2014091120140907
428refgcc-4.8 -m64 -march=core2 -O -fomit-frame-pointer2014091120140907
428refgcc-4.8 -m64 -march=corei7-avx -O -fomit-frame-pointer2014091120140907
428refgcc-4.8 -m64 -march=k8 -O -fomit-frame-pointer2014091120140907
432refgcc-4.8 -funroll-loops -march=nocona -O -fomit-frame-pointer2014091120140907
456refgcc-4.7 -fno-schedule-insns -Os -fomit-frame-pointer2014091120140907
456refgcc-4.7 -m64 -Os -fomit-frame-pointer2014091120140907
456refgcc-4.7 -march=barcelona -Os -fomit-frame-pointer2014091120140907
460refgcc-4.7 -Os -fomit-frame-pointer2014091120140907
460refgcc-4.7 -funroll-loops -Os -fomit-frame-pointer2014091120140907
460refgcc-4.7 -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer2014091120140907
460refgcc-4.7 -funroll-loops -m64 -Os -fomit-frame-pointer2014091120140907
460refgcc-4.7 -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer2014091120140907
460refgcc-4.7 -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer2014091120140907
460refgcc-4.7 -funroll-loops -march=barcelona -Os -fomit-frame-pointer2014091120140907
460refgcc-4.7 -funroll-loops -march=k8 -Os -fomit-frame-pointer2014091120140907
460refgcc-4.7 -funroll-loops -march=nocona -Os -fomit-frame-pointer2014091120140907
460refgcc-4.7 -m64 -march=barcelona -Os -fomit-frame-pointer2014091120140907
460refgcc-4.7 -m64 -march=core-avx-i -Os -fomit-frame-pointer2014091120140907
460refgcc-4.7 -m64 -march=core2 -msse4 -Os -fomit-frame-pointer2014091120140907
460refgcc-4.7 -m64 -march=k8 -Os -fomit-frame-pointer2014091120140907
460refgcc-4.7 -m64 -march=nocona -Os -fomit-frame-pointer2014091120140907
460refgcc-4.7 -march=k8 -Os -fomit-frame-pointer2014091120140907
460refgcc-4.7 -march=nocona -Os -fomit-frame-pointer2014091120140907
464refgcc-4.7 -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer2014091120140907
464refgcc-4.7 -m64 -march=core2 -Os -fomit-frame-pointer2014091120140907
464refgcc-4.7 -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer2014091120140907
464refgcc-4.7 -m64 -march=corei7-avx -Os -fomit-frame-pointer2014091120140907
464refgcc-4.7 -m64 -march=corei7 -Os -fomit-frame-pointer2014091120140907
464refgcc-4.7 -m64 -march=native -mtune=native -Os -fomit-frame-pointer2014091120140907
600refgcc-4.8 -funroll-loops -m64 -O2 -fomit-frame-pointer2014091120140907
604refgcc-4.8 -funroll-loops -O2 -fomit-frame-pointer2014091120140907
604refgcc-4.8 -funroll-loops -O3 -fomit-frame-pointer2014091120140907
604refgcc-4.8 -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer2014091120140907
604refgcc-4.8 -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer2014091120140907
604refgcc-4.8 -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer2014091120140907
604refgcc-4.8 -funroll-loops -march=barcelona -O2 -fomit-frame-pointer2014091120140907
608refgcc-4.8 -funroll-loops -m64 -O3 -fomit-frame-pointer2014091120140907
608refgcc-4.8 -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer2014091120140907
608refgcc-4.8 -funroll-loops -march=k8 -O2 -fomit-frame-pointer2014091120140907
616refgcc-4.8 -O2 -fomit-frame-pointer2014091120140907
616refgcc-4.8 -O3 -fomit-frame-pointer2014091120140907
616refgcc-4.8 -m64 -O3 -fomit-frame-pointer2014091120140907
616refgcc-4.8 -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer2014091120140907
616refgcc-4.8 -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer2014091120140907
616refgcc-4.8 -m64 -march=corei7 -O3 -fomit-frame-pointer2014091120140907
616refgcc-4.8 -m64 -march=native -mtune=native -O2 -fomit-frame-pointer2014091120140907
620refgcc-4.8 -fno-schedule-insns -O3 -fomit-frame-pointer2014091120140907
620refgcc-4.8 -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer2014091120140907
620refgcc-4.8 -funroll-loops -march=barcelona -O3 -fomit-frame-pointer2014091120140907
620refgcc-4.8 -m64 -march=core-avx-i -O3 -fomit-frame-pointer2014091120140907
620refgcc-4.8 -m64 -march=core2 -O2 -fomit-frame-pointer2014091120140907
620refgcc-4.8 -m64 -march=corei7-avx -O3 -fomit-frame-pointer2014091120140907
620refgcc-4.8 -m64 -march=corei7 -O2 -fomit-frame-pointer2014091120140907
624refgcc-4.8 -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer2014091120140907
624refgcc-4.8 -m64 -march=core-avx-i -O2 -fomit-frame-pointer2014091120140907
624refgcc-4.8 -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer2014091120140907
624refgcc-4.8 -m64 -march=corei7-avx -O2 -fomit-frame-pointer2014091120140907
624refgcc-4.8 -m64 -march=k8 -O2 -fomit-frame-pointer2014091120140907
624refgcc-4.8 -m64 -march=native -mtune=native -O3 -fomit-frame-pointer2014091120140907
624refgcc-4.8 -march=barcelona -O2 -fomit-frame-pointer2014091120140907
624refgcc-4.8 -march=k8 -O3 -fomit-frame-pointer2014091120140907
628refgcc-4.8 -m64 -march=core2 -O3 -fomit-frame-pointer2014091120140907
628refgcc-4.8 -march=barcelona -O3 -fomit-frame-pointer2014091120140907
628refgcc-4.8 -march=k8 -O2 -fomit-frame-pointer2014091120140907
632refgcc-4.8 -fno-schedule-insns -O2 -fomit-frame-pointer2014091120140907
632refgcc-4.8 -funroll-loops -march=k8 -O3 -fomit-frame-pointer2014091120140907
632refgcc-4.8 -m64 -O2 -fomit-frame-pointer2014091120140907
632refgcc-4.8 -m64 -march=barcelona -O3 -fomit-frame-pointer2014091120140907
632refgcc-4.8 -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer2014091120140907
632refgcc-4.8 -m64 -march=k8 -O3 -fomit-frame-pointer2014091120140907
636refgcc-4.8 -m64 -march=barcelona -O2 -fomit-frame-pointer2014091120140907
652refgcc-4.8 -march=nocona -O2 -fomit-frame-pointer2014091120140907
656refgcc-4.8 -m64 -march=nocona -O2 -fomit-frame-pointer2014091120140907
660refgcc-4.8 -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer2014091120140907
660refgcc-4.8 -march=nocona -O3 -fomit-frame-pointer2014091120140907
664refgcc-4.8 -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer2014091120140907
664refgcc-4.8 -funroll-loops -march=nocona -O2 -fomit-frame-pointer2014091120140907
664refgcc-4.8 -funroll-loops -march=nocona -O3 -fomit-frame-pointer2014091120140907
664refgcc-4.8 -m64 -march=nocona -O3 -fomit-frame-pointer2014091120140907
696refgcc-4.8 -funroll-loops -march=k8 -Os -fomit-frame-pointer2014091120140907
700refgcc-4.8 -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer2014091120140907
708refgcc-4.8 -funroll-loops -Os -fomit-frame-pointer2014091120140907
708refgcc-4.8 -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer2014091120140907
708refgcc-4.8 -funroll-loops -m64 -Os -fomit-frame-pointer2014091120140907
708refgcc-4.8 -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer2014091120140907
708refgcc-4.8 -funroll-loops -march=barcelona -Os -fomit-frame-pointer2014091120140907
708refgcc-4.8 -m64 -march=core-avx-i -Os -fomit-frame-pointer2014091120140907
708refgcc-4.8 -m64 -march=core2 -Os -fomit-frame-pointer2014091120140907
708refgcc-4.8 -m64 -march=corei7-avx -Os -fomit-frame-pointer2014091120140907
708refgcc-4.8 -march=k8 -Os -fomit-frame-pointer2014091120140907
712refgcc-4.8 -Os -fomit-frame-pointer2014091120140907
712refgcc-4.8 -fno-schedule-insns -Os -fomit-frame-pointer2014091120140907
712refgcc-4.8 -funroll-loops -march=nocona -Os -fomit-frame-pointer2014091120140907
712refgcc-4.8 -m64 -Os -fomit-frame-pointer2014091120140907
712refgcc-4.8 -m64 -march=barcelona -Os -fomit-frame-pointer2014091120140907
712refgcc-4.8 -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer2014091120140907
712refgcc-4.8 -m64 -march=core2 -msse4 -Os -fomit-frame-pointer2014091120140907
712refgcc-4.8 -m64 -march=corei7 -Os -fomit-frame-pointer2014091120140907
712refgcc-4.8 -m64 -march=k8 -Os -fomit-frame-pointer2014091120140907
712refgcc-4.8 -m64 -march=native -mtune=native -Os -fomit-frame-pointer2014091120140907
712refgcc-4.8 -m64 -march=nocona -Os -fomit-frame-pointer2014091120140907
712refgcc-4.8 -march=barcelona -Os -fomit-frame-pointer2014091120140907
712refgcc-4.8 -march=nocona -Os -fomit-frame-pointer2014091120140907
716refgcc-4.8 -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer2014091120140907
856refgcc-4.7 -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer2014091120140907
856refgcc-4.7 -funroll-loops -march=nocona -O -fomit-frame-pointer2014091120140907
856refgcc-4.7 -m64 -march=barcelona -O -fomit-frame-pointer2014091120140907
860refgcc-4.7 -O -fomit-frame-pointer2014091120140907
860refgcc-4.7 -fno-schedule-insns -O -fomit-frame-pointer2014091120140907
860refgcc-4.7 -m64 -march=core-avx-i -O -fomit-frame-pointer2014091120140907
860refgcc-4.7 -m64 -march=core-avx2 -O -fomit-frame-pointer2014091120140907
860refgcc-4.7 -m64 -march=core2 -O -fomit-frame-pointer2014091120140907
860refgcc-4.7 -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer2014091120140907
860refgcc-4.7 -m64 -march=core2 -msse4 -O -fomit-frame-pointer2014091120140907
860refgcc-4.7 -m64 -march=corei7-avx -O -fomit-frame-pointer2014091120140907
860refgcc-4.7 -m64 -march=corei7 -O -fomit-frame-pointer2014091120140907
860refgcc-4.7 -m64 -march=k8 -O -fomit-frame-pointer2014091120140907
860refgcc-4.7 -m64 -march=native -mtune=native -O -fomit-frame-pointer2014091120140907
860refgcc-4.7 -m64 -march=nocona -O -fomit-frame-pointer2014091120140907
860refgcc-4.7 -march=barcelona -O -fomit-frame-pointer2014091120140907
860refgcc-4.7 -march=k8 -O -fomit-frame-pointer2014091120140907
860refgcc-4.7 -march=nocona -O -fomit-frame-pointer2014091120140907
864refgcc-4.7 -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer2014091120140907
864refgcc-4.7 -funroll-loops -m64 -O -fomit-frame-pointer2014091120140907
864refgcc-4.7 -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer2014091120140907
864refgcc-4.7 -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer2014091120140907
864refgcc-4.7 -funroll-loops -march=barcelona -O -fomit-frame-pointer2014091120140907
864refgcc-4.7 -funroll-loops -march=k8 -O -fomit-frame-pointer2014091120140907
868refgcc-4.7 -funroll-loops -O -fomit-frame-pointer2014091120140907
916refgcc-4.7 -m64 -O -fomit-frame-pointer2014091120140907
2116refgcc-4.8 -funroll-loops2014091120140907
2132refcc2014091120140907
2136refgcc-4.82014091120140907
2240refgcc-4.7 -funroll-loops2014091120140907
2252refgcc-4.72014091120140907

Test failure

Implementation: crypto_core/salsa208/ref
Compiler: gcc-4.7 -m64 -march=core-avx2 -O2 -fomit-frame-pointer
error 111

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
gcc-4.7 -m64 -march=core-avx2 -O2 -fomit-frame-pointer ref
gcc-4.7 -m64 -march=core-avx2 -Os -fomit-frame-pointer ref
gcc-4.8 -m64 -march=core-avx2 -O2 -fomit-frame-pointer ref
gcc-4.8 -m64 -march=core-avx2 -O -fomit-frame-pointer ref
gcc-4.8 -m64 -march=core-avx2 -Os -fomit-frame-pointer ref

Compiler output

Implementation: crypto_core/salsa208/ref
Compiler: clang -O3 -fwrapv -march=native -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer
core.c: clang-3.3: warning: argument unused during compilation: '-fpolly'

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
clang -O3 -fwrapv -march=native -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer ref
clang -O3 -fwrapv -mavx -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer ref
clang -O3 -fwrapv -mavx -maes -mpclmul -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer ref

Compiler output

Implementation: crypto_core/salsa208/ref
Compiler: clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer
core.c: clang-3.3: warning: argument unused during compilation: '-mcpu=core-avx2'

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer ref