Implementation notes: amd64, par, crypto_stream/salsa208

Computer: par
Architecture: amd64
CPU ID: GenuineIntel-000406c3-bfebfbff
SUPERCOP version: 20161026
Operation: crypto_stream
Primitive: salsa208
TimeImplementationCompilerBenchmark dateSUPERCOP version
4700dolbeau/amd64-xmm6intgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
4960dolbeau/amd64-xmm6intgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
4960dolbeau/amd64-xmm6intgcc -march=native -mcpu=native -Os2016121420161026
4980dolbeau/amd64-xmm6intgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
5120e/amd64-xmm2gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
5140e/amd64-xmm2gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
5140e/amd64-xmm2gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
5140e/amd64-xmm2gcc -march=native -mcpu=native -O22016121420161026
5140e/amd64-xmm2gcc -march=native -mcpu=native -O32016121420161026
5160e/amd64-xmm2gcc -march=native -mcpu=native -Os2016121420161026
5340dolbeau/amd64-xmm6intgcc -march=native -mcpu=native -O32016121420161026
5360dolbeau/amd64-xmm6intgcc -march=native -mcpu=native -O22016121420161026
5420e/amd64-xmmgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
5420e/amd64-xmmgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
5420e/amd64-xmmgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
5420e/amd64-xmmgcc -march=native -mcpu=native -O22016121420161026
5420e/amd64-xmmgcc -march=native -mcpu=native -O32016121420161026
5420e/amd64-xmmgcc -march=native -mcpu=native -Os2016121420161026
5640amd64_xmm6gcc -march=native -mcpu=native -O22016121420161026
5660amd64_xmm6gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
5660amd64_xmm6gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
5660amd64_xmm6gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
5660amd64_xmm6gcc -march=native -mcpu=native -O32016121420161026
5660amd64_xmm6gcc -march=native -mcpu=native -Os2016121420161026
5860e/amd64-xmm6gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
5860e/amd64-xmm6gcc -march=native -mcpu=native -Os2016121420161026
5880e/amd64-xmm6gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
5880e/amd64-xmm6gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
5880e/amd64-xmm6gcc -march=native -mcpu=native -O22016121420161026
5880e/amd64-xmm6gcc -march=native -mcpu=native -O32016121420161026
5900e/amd64-xmm5gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
5900e/amd64-xmm5gcc -march=native -mcpu=native -O32016121420161026
5920e/amd64-xmm5gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
5920e/amd64-xmm5gcc -march=native -mcpu=native -Os2016121420161026
5940e/amd64-xmm5gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
5940e/amd64-xmm5gcc -march=native -mcpu=native -O22016121420161026
9620e/amd64-3gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
9660e/amd64-3gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
9700e/amd64-3gcc -march=native -mcpu=native -Os2016121420161026
9720e/amd64-3gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
9720e/amd64-3gcc -march=native -mcpu=native -O22016121420161026
9740e/amd64-3gcc -march=native -mcpu=native -O32016121420161026
10360e/amd64-2gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
10360e/amd64-2gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
10360e/amd64-2gcc -march=native -mcpu=native -O22016121420161026
10360e/amd64-2gcc -march=native -mcpu=native -O32016121420161026
10360e/amd64-2gcc -march=native -mcpu=native -Os2016121420161026
10400e/amd64-2gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
10420e/refgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
10600e/refgcc -march=native -mcpu=native -O32016121420161026
10620e/regsgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
10660e/mergedgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
10700e/mergedgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
10860e/mergedgcc -march=native -mcpu=native -O32016121420161026
10960e/mergedgcc -march=native -mcpu=native -O22016121420161026
11080e/regsgcc -march=native -mcpu=native -O32016121420161026
11320e/mergedgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
11520e/mergedgcc -march=native -mcpu=native -Os2016121420161026
12320e/amd64-1gcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
12320e/amd64-1gcc -march=native -mcpu=native -O32016121420161026
12320e/amd64-1gcc -march=native -mcpu=native -Os2016121420161026
12340e/amd64-1gcc -funroll-loops -march=native -mcpu=native -O22016121420161026
12340e/amd64-1gcc -funroll-loops -march=native -mcpu=native -O32016121420161026
12460e/amd64-1gcc -march=native -mcpu=native -O22016121420161026
12860refgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
12980refgcc -march=native -mcpu=native -O32016121420161026
14760e/regsgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
14960e/refgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
15640e/regsgcc -march=native -mcpu=native -O22016121420161026
17220refgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
18140refgcc -march=native -mcpu=native -O22016121420161026
18900e/refgcc -march=native -mcpu=native -O22016121420161026
19700e/regsgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
19740refgcc -march=native -mcpu=native -Os2016121420161026
19760refgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
23960e/refgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
24780e/regsgcc -march=native -mcpu=native -Os2016121420161026
28820e/refgcc -march=native -mcpu=native -Os2016121420161026

Compiler output

Implementation: crypto_stream/salsa208/dolbeau/amd64-xmm6int
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
api.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
salsa20-xmm6int.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 dolbeau/amd64-xmm6int
gcc -funroll-loops -march=native -mcpu=native -O3 dolbeau/amd64-xmm6int
gcc -funroll-loops -march=native -mcpu=native -Os dolbeau/amd64-xmm6int
gcc -march=native -mcpu=native -O2 dolbeau/amd64-xmm6int
gcc -march=native -mcpu=native -O3 dolbeau/amd64-xmm6int
gcc -march=native -mcpu=native -Os dolbeau/amd64-xmm6int

Compiler output

Implementation: crypto_stream/salsa208/e/merged
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
api.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
salsa20.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 18, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 e/merged e/ref e/regs
gcc -funroll-loops -march=native -mcpu=native -O3 e/merged e/ref e/regs
gcc -funroll-loops -march=native -mcpu=native -Os e/merged e/ref e/regs
gcc -march=native -mcpu=native -O2 e/merged e/ref e/regs
gcc -march=native -mcpu=native -O3 e/merged e/ref e/regs
gcc -march=native -mcpu=native -Os e/merged e/ref e/regs

Compiler output

Implementation: crypto_stream/salsa208/e/amd64-1
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
api.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
salsa20.s: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 42, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 e/amd64-1 e/amd64-2 e/amd64-3 e/amd64-xmm e/amd64-xmm2 e/amd64-xmm5 e/amd64-xmm6
gcc -funroll-loops -march=native -mcpu=native -O3 e/amd64-1 e/amd64-2 e/amd64-3 e/amd64-xmm e/amd64-xmm2 e/amd64-xmm5 e/amd64-xmm6
gcc -funroll-loops -march=native -mcpu=native -Os e/amd64-1 e/amd64-2 e/amd64-3 e/amd64-xmm e/amd64-xmm2 e/amd64-xmm5 e/amd64-xmm6
gcc -march=native -mcpu=native -O2 e/amd64-1 e/amd64-2 e/amd64-3 e/amd64-xmm e/amd64-xmm2 e/amd64-xmm5 e/amd64-xmm6
gcc -march=native -mcpu=native -O3 e/amd64-1 e/amd64-2 e/amd64-3 e/amd64-xmm e/amd64-xmm2 e/amd64-xmm5 e/amd64-xmm6
gcc -march=native -mcpu=native -Os e/amd64-1 e/amd64-2 e/amd64-3 e/amd64-xmm e/amd64-xmm2 e/amd64-xmm5 e/amd64-xmm6

Compiler output

Implementation: crypto_stream/salsa208/ref
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
stream.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
xor.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 ref
gcc -funroll-loops -march=native -mcpu=native -O3 ref
gcc -funroll-loops -march=native -mcpu=native -Os ref
gcc -march=native -mcpu=native -O2 ref
gcc -march=native -mcpu=native -O3 ref
gcc -march=native -mcpu=native -Os ref

Compiler output

Implementation: crypto_stream/salsa208/amd64_xmm6
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
stream.s: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 amd64_xmm6
gcc -funroll-loops -march=native -mcpu=native -O3 amd64_xmm6
gcc -funroll-loops -march=native -mcpu=native -Os amd64_xmm6
gcc -march=native -mcpu=native -O2 amd64_xmm6
gcc -march=native -mcpu=native -O3 amd64_xmm6
gcc -march=native -mcpu=native -Os amd64_xmm6