Implementation notes: amd64, par, crypto_stream/salsa2012

Computer: par
Architecture: amd64
CPU ID: GenuineIntel-000406c3-bfebfbff
SUPERCOP version: 20161026
Operation: crypto_stream
Primitive: salsa2012
TimeImplementationCompilerBenchmark dateSUPERCOP version
6640dolbeau/amd64-xmm6intgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
6920dolbeau/amd64-xmm6intgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
6920dolbeau/amd64-xmm6intgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
6960e/amd64-xmm2gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
6960e/amd64-xmm2gcc -march=native -mcpu=native -O22016121420161026
6960e/amd64-xmm2gcc -march=native -mcpu=native -O32016121420161026
6960e/amd64-xmm2gcc -march=native -mcpu=native -Os2016121420161026
6980e/amd64-xmm2gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
6980e/amd64-xmm2gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
7060dolbeau/amd64-xmm6intgcc -march=native -mcpu=native -Os2016121420161026
7360e/amd64-xmmgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
7360e/amd64-xmmgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
7360e/amd64-xmmgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
7360e/amd64-xmmgcc -march=native -mcpu=native -O22016121420161026
7360e/amd64-xmmgcc -march=native -mcpu=native -O32016121420161026
7380e/amd64-xmmgcc -march=native -mcpu=native -Os2016121420161026
7460dolbeau/amd64-xmm6intgcc -march=native -mcpu=native -O32016121420161026
7780amd64_xmm6gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
7780amd64_xmm6gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
7780amd64_xmm6gcc -march=native -mcpu=native -O22016121420161026
7780amd64_xmm6gcc -march=native -mcpu=native -O32016121420161026
7780amd64_xmm6gcc -march=native -mcpu=native -Os2016121420161026
7800amd64_xmm6gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
7800dolbeau/amd64-xmm6intgcc -march=native -mcpu=native -O22016121420161026
8020e/amd64-xmm5gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
8020e/amd64-xmm5gcc -march=native -mcpu=native -O22016121420161026
8020e/amd64-xmm5gcc -march=native -mcpu=native -O32016121420161026
8040e/amd64-xmm5gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
8040e/amd64-xmm5gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
8040e/amd64-xmm5gcc -march=native -mcpu=native -Os2016121420161026
8060e/amd64-xmm6gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
8060e/amd64-xmm6gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
8060e/amd64-xmm6gcc -march=native -mcpu=native -O22016121420161026
8060e/amd64-xmm6gcc -march=native -mcpu=native -O32016121420161026
8080e/amd64-xmm6gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
8080e/amd64-xmm6gcc -march=native -mcpu=native -Os2016121420161026
13620e/amd64-3gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
13660e/amd64-3gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
13660e/amd64-3gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
13660e/amd64-3gcc -march=native -mcpu=native -O22016121420161026
13660e/amd64-3gcc -march=native -mcpu=native -Os2016121420161026
13720e/amd64-3gcc -march=native -mcpu=native -O32016121420161026
14080e/refgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
14500e/regsgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
14740e/amd64-2gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
14760e/amd64-2gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
14760e/mergedgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
14760e/amd64-2gcc -march=native -mcpu=native -O22016121420161026
14760e/amd64-2gcc -march=native -mcpu=native -O32016121420161026
14760e/amd64-2gcc -march=native -mcpu=native -Os2016121420161026
14800e/amd64-2gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
14820e/mergedgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
14820e/refgcc -march=native -mcpu=native -O32016121420161026
15000e/regsgcc -march=native -mcpu=native -O32016121420161026
15020e/mergedgcc -march=native -mcpu=native -O32016121420161026
15080e/mergedgcc -march=native -mcpu=native -O22016121420161026
15800e/mergedgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
15980e/mergedgcc -march=native -mcpu=native -Os2016121420161026
16580refgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
16660refgcc -march=native -mcpu=native -O32016121420161026
17460e/amd64-1gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
17500e/amd64-1gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
17520e/amd64-1gcc -march=native -mcpu=native -O22016121420161026
17520e/amd64-1gcc -march=native -mcpu=native -O32016121420161026
17520e/amd64-1gcc -march=native -mcpu=native -Os2016121420161026
17620e/amd64-1gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
18660e/refgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
18820e/regsgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
19860e/regsgcc -march=native -mcpu=native -O22016121420161026
20980refgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
21980refgcc -march=native -mcpu=native -O22016121420161026
23440refgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
23440refgcc -march=native -mcpu=native -Os2016121420161026
23780e/regsgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
24180e/refgcc -march=native -mcpu=native -O22016121420161026
27960e/refgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
29000e/regsgcc -march=native -mcpu=native -Os2016121420161026
33380e/refgcc -march=native -mcpu=native -Os2016121420161026

Compiler output

Implementation: crypto_stream/salsa2012/dolbeau/amd64-xmm6int
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
api.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
salsa20-xmm6int.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 dolbeau/amd64-xmm6int
gcc -funroll-loops -march=native -mcpu=native -O3 dolbeau/amd64-xmm6int
gcc -funroll-loops -march=native -mcpu=native -Os dolbeau/amd64-xmm6int
gcc -march=native -mcpu=native -O2 dolbeau/amd64-xmm6int
gcc -march=native -mcpu=native -O3 dolbeau/amd64-xmm6int
gcc -march=native -mcpu=native -Os dolbeau/amd64-xmm6int

Compiler output

Implementation: crypto_stream/salsa2012/e/merged
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
api.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
salsa20.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 18, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 e/merged e/ref e/regs
gcc -funroll-loops -march=native -mcpu=native -O3 e/merged e/ref e/regs
gcc -funroll-loops -march=native -mcpu=native -Os e/merged e/ref e/regs
gcc -march=native -mcpu=native -O2 e/merged e/ref e/regs
gcc -march=native -mcpu=native -O3 e/merged e/ref e/regs
gcc -march=native -mcpu=native -Os e/merged e/ref e/regs

Compiler output

Implementation: crypto_stream/salsa2012/e/amd64-1
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
api.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
salsa20.s: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 42, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 e/amd64-1 e/amd64-2 e/amd64-3 e/amd64-xmm e/amd64-xmm2 e/amd64-xmm5 e/amd64-xmm6
gcc -funroll-loops -march=native -mcpu=native -O3 e/amd64-1 e/amd64-2 e/amd64-3 e/amd64-xmm e/amd64-xmm2 e/amd64-xmm5 e/amd64-xmm6
gcc -funroll-loops -march=native -mcpu=native -Os e/amd64-1 e/amd64-2 e/amd64-3 e/amd64-xmm e/amd64-xmm2 e/amd64-xmm5 e/amd64-xmm6
gcc -march=native -mcpu=native -O2 e/amd64-1 e/amd64-2 e/amd64-3 e/amd64-xmm e/amd64-xmm2 e/amd64-xmm5 e/amd64-xmm6
gcc -march=native -mcpu=native -O3 e/amd64-1 e/amd64-2 e/amd64-3 e/amd64-xmm e/amd64-xmm2 e/amd64-xmm5 e/amd64-xmm6
gcc -march=native -mcpu=native -Os e/amd64-1 e/amd64-2 e/amd64-3 e/amd64-xmm e/amd64-xmm2 e/amd64-xmm5 e/amd64-xmm6

Compiler output

Implementation: crypto_stream/salsa2012/ref
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
stream.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
xor.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 ref
gcc -funroll-loops -march=native -mcpu=native -O3 ref
gcc -funroll-loops -march=native -mcpu=native -Os ref
gcc -march=native -mcpu=native -O2 ref
gcc -march=native -mcpu=native -O3 ref
gcc -march=native -mcpu=native -Os ref

Compiler output

Implementation: crypto_stream/salsa2012/amd64_xmm6
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
stream.s: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 amd64_xmm6
gcc -funroll-loops -march=native -mcpu=native -O3 amd64_xmm6
gcc -funroll-loops -march=native -mcpu=native -Os amd64_xmm6
gcc -march=native -mcpu=native -O2 amd64_xmm6
gcc -march=native -mcpu=native -O3 amd64_xmm6
gcc -march=native -mcpu=native -Os amd64_xmm6