Implementation notes: amd64, par, crypto_stream/salsa20

Computer: par
Architecture: amd64
CPU ID: GenuineIntel-000406c3-bfebfbff
SUPERCOP version: 20161026
Operation: crypto_stream
Primitive: salsa20
TimeImplementationCompilerBenchmark dateSUPERCOP version
10520dolbeau/amd64-xmm6intgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
10600e/amd64-xmm2gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
10600e/amd64-xmm2gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
10600e/amd64-xmm2gcc -march=native -mcpu=native -O32016121420161026
10620e/amd64-xmm2gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
10620e/amd64-xmm2gcc -march=native -mcpu=native -O22016121420161026
10620e/amd64-xmm2gcc -march=native -mcpu=native -Os2016121420161026
11240e/amd64-xmmgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
11240e/amd64-xmmgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
11240e/amd64-xmmgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
11240e/amd64-xmmgcc -march=native -mcpu=native -O32016121420161026
11240e/amd64-xmmgcc -march=native -mcpu=native -Os2016121420161026
11260e/amd64-xmmgcc -march=native -mcpu=native -O22016121420161026
11280dolbeau/amd64-xmm6intgcc -march=native -mcpu=native -Os2016121420161026
11680dolbeau/amd64-xmm6intgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
11700dolbeau/amd64-xmm6intgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
12060amd64_xmm6gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
12060amd64_xmm6gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
12060amd64_xmm6gcc -march=native -mcpu=native -O22016121420161026
12060amd64_xmm6gcc -march=native -mcpu=native -O32016121420161026
12080amd64_xmm6gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
12080amd64_xmm6gcc -march=native -mcpu=native -Os2016121420161026
12260e/amd64-xmm5gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
12260e/amd64-xmm5gcc -march=native -mcpu=native -O32016121420161026
12280e/amd64-xmm5gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
12280e/amd64-xmm5gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
12280e/amd64-xmm5gcc -march=native -mcpu=native -O22016121420161026
12280e/amd64-xmm5gcc -march=native -mcpu=native -Os2016121420161026
12420dolbeau/amd64-xmm6intgcc -march=native -mcpu=native -O32016121420161026
12460e/amd64-xmm6gcc -march=native -mcpu=native -O22016121420161026
12460e/amd64-xmm6gcc -march=native -mcpu=native -O32016121420161026
12480e/amd64-xmm6gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
12480e/amd64-xmm6gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
12480e/amd64-xmm6gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
12480e/amd64-xmm6gcc -march=native -mcpu=native -Os2016121420161026
12560dolbeau/amd64-xmm6intgcc -march=native -mcpu=native -O22016121420161026
21140e/mergedgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
21620e/amd64-3gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
21620e/amd64-3gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
21620e/amd64-3gcc -march=native -mcpu=native -O32016121420161026
21660e/refgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
21680e/amd64-3gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
21680e/amd64-3gcc -march=native -mcpu=native -O22016121420161026
21680e/amd64-3gcc -march=native -mcpu=native -Os2016121420161026
21920e/regsgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
22060e/mergedgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
22400e/mergedgcc -march=native -mcpu=native -O32016121420161026
22800e/regsgcc -march=native -mcpu=native -O32016121420161026
22940e/refgcc -march=native -mcpu=native -O32016121420161026
23240e/mergedgcc -march=native -mcpu=native -O22016121420161026
23240refgcc -march=native -mcpu=native -O32016121420161026
23300refgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
23460e/amd64-2gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
23460e/amd64-2gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
23460e/amd64-2gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
23460e/amd64-2gcc -march=native -mcpu=native -Os2016121420161026
23480e/amd64-2gcc -march=native -mcpu=native -O22016121420161026
23520e/amd64-2gcc -march=native -mcpu=native -O32016121420161026
24120e/mergedgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
25060e/refgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
25300e/mergedgcc -march=native -mcpu=native -Os2016121420161026
27000e/regsgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
27600refgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
27740e/amd64-1gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
27740e/amd64-1gcc -march=native -mcpu=native -O32016121420161026
27780e/amd64-1gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
27780e/amd64-1gcc -march=native -mcpu=native -O22016121420161026
27800e/amd64-1gcc -march=native -mcpu=native -Os2016121420161026
27820e/amd64-1gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
27900e/regsgcc -march=native -mcpu=native -O22016121420161026
28500refgcc -march=native -mcpu=native -O22016121420161026
30020refgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
30100refgcc -march=native -mcpu=native -Os2016121420161026
30940e/refgcc -march=native -mcpu=native -O22016121420161026
31920e/regsgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
36820e/refgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
37560e/regsgcc -march=native -mcpu=native -Os2016121420161026
42640e/refgcc -march=native -mcpu=native -Os2016121420161026

Compiler output

Implementation: crypto_stream/salsa20/dolbeau/amd64-xmm6int
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
api.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
salsa20-xmm6int.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 dolbeau/amd64-xmm6int
gcc -funroll-loops -march=native -mcpu=native -O3 dolbeau/amd64-xmm6int
gcc -funroll-loops -march=native -mcpu=native -Os dolbeau/amd64-xmm6int
gcc -march=native -mcpu=native -O2 dolbeau/amd64-xmm6int
gcc -march=native -mcpu=native -O3 dolbeau/amd64-xmm6int
gcc -march=native -mcpu=native -Os dolbeau/amd64-xmm6int

Compiler output

Implementation: crypto_stream/salsa20/e/merged
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
api.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
salsa20.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 18, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 e/merged e/ref e/regs
gcc -funroll-loops -march=native -mcpu=native -O3 e/merged e/ref e/regs
gcc -funroll-loops -march=native -mcpu=native -Os e/merged e/ref e/regs
gcc -march=native -mcpu=native -O2 e/merged e/ref e/regs
gcc -march=native -mcpu=native -O3 e/merged e/ref e/regs
gcc -march=native -mcpu=native -Os e/merged e/ref e/regs

Compiler output

Implementation: crypto_stream/salsa20/e/amd64-1
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
api.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
salsa20.s: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 42, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 e/amd64-1 e/amd64-2 e/amd64-3 e/amd64-xmm e/amd64-xmm2 e/amd64-xmm5 e/amd64-xmm6
gcc -funroll-loops -march=native -mcpu=native -O3 e/amd64-1 e/amd64-2 e/amd64-3 e/amd64-xmm e/amd64-xmm2 e/amd64-xmm5 e/amd64-xmm6
gcc -funroll-loops -march=native -mcpu=native -Os e/amd64-1 e/amd64-2 e/amd64-3 e/amd64-xmm e/amd64-xmm2 e/amd64-xmm5 e/amd64-xmm6
gcc -march=native -mcpu=native -O2 e/amd64-1 e/amd64-2 e/amd64-3 e/amd64-xmm e/amd64-xmm2 e/amd64-xmm5 e/amd64-xmm6
gcc -march=native -mcpu=native -O3 e/amd64-1 e/amd64-2 e/amd64-3 e/amd64-xmm e/amd64-xmm2 e/amd64-xmm5 e/amd64-xmm6
gcc -march=native -mcpu=native -Os e/amd64-1 e/amd64-2 e/amd64-3 e/amd64-xmm e/amd64-xmm2 e/amd64-xmm5 e/amd64-xmm6

Compiler output

Implementation: crypto_stream/salsa20/ref
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
stream.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
xor.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 ref
gcc -funroll-loops -march=native -mcpu=native -O3 ref
gcc -funroll-loops -march=native -mcpu=native -Os ref
gcc -march=native -mcpu=native -O2 ref
gcc -march=native -mcpu=native -O3 ref
gcc -march=native -mcpu=native -Os ref

Compiler output

Implementation: crypto_stream/salsa20/amd64_xmm6
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
stream.s: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 amd64_xmm6
gcc -funroll-loops -march=native -mcpu=native -O3 amd64_xmm6
gcc -funroll-loops -march=native -mcpu=native -Os amd64_xmm6
gcc -march=native -mcpu=native -O2 amd64_xmm6
gcc -march=native -mcpu=native -O3 amd64_xmm6
gcc -march=native -mcpu=native -Os amd64_xmm6