Implementation notes: amd64, mangetsu, crypto_stream/rijn256ctr

Computer: mangetsu
Architecture: amd64
CPU ID: GenuineIntel-000206a7-bfebfbff
SUPERCOP version: 20141014
Operation: crypto_stream
Primitive: rijn256ctr

Compiler output

Implementation: crypto_stream/rijn256ctr/gil
Compiler: cc
rijn_k32b32.gnu.S: rijn_k32b32.gnu.S: Assembler messages:
rijn_k32b32.gnu.S: rijn_k32b32.gnu.S:1: Error: alignment not a power of 2
rijn_k32b32.gnu.S: rijn_k32b32.gnu.S:15: Error: alignment not a power of 2

Number of similar (compiler,implementation) pairs: 203, namely:
CompilerImplementations
cc gil
gcc-4.7 gil
gcc-4.7 -O2 -fomit-frame-pointer gil
gcc-4.7 -O3 -fomit-frame-pointer gil
gcc-4.7 -O -fomit-frame-pointer gil
gcc-4.7 -Os -fomit-frame-pointer gil
gcc-4.7 -fno-schedule-insns -O2 -fomit-frame-pointer gil
gcc-4.7 -fno-schedule-insns -O3 -fomit-frame-pointer gil
gcc-4.7 -fno-schedule-insns -O -fomit-frame-pointer gil
gcc-4.7 -fno-schedule-insns -Os -fomit-frame-pointer gil
gcc-4.7 -funroll-loops gil
gcc-4.7 -funroll-loops -O2 -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -O3 -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -O -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -Os -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -m64 -O2 -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -m64 -O3 -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -m64 -O -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -m64 -Os -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -march=barcelona -O2 -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -march=barcelona -O3 -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -march=barcelona -O -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -march=barcelona -Os -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -march=k8 -O2 -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -march=k8 -O3 -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -march=k8 -O -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -march=k8 -Os -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -march=nocona -O2 -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -march=nocona -O3 -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -march=nocona -O -fomit-frame-pointer gil
gcc-4.7 -funroll-loops -march=nocona -Os -fomit-frame-pointer gil
gcc-4.7 -m64 -O2 -fomit-frame-pointer gil
gcc-4.7 -m64 -O3 -fomit-frame-pointer gil
gcc-4.7 -m64 -O -fomit-frame-pointer gil
gcc-4.7 -m64 -Os -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core-avx-i -O2 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core-avx-i -O3 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core-avx-i -O -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core-avx-i -Os -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core-avx2 -O2 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core-avx2 -O -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core-avx2 -Os -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core2 -O2 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core2 -O3 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core2 -O -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core2 -Os -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core2 -msse4 -O -fomit-frame-pointer gil
gcc-4.7 -m64 -march=core2 -msse4 -Os -fomit-frame-pointer gil
gcc-4.7 -m64 -march=corei7-avx -O2 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=corei7-avx -O3 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=corei7-avx -O -fomit-frame-pointer gil
gcc-4.7 -m64 -march=corei7-avx -Os -fomit-frame-pointer gil
gcc-4.7 -m64 -march=corei7 -O2 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=corei7 -O3 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=corei7 -O -fomit-frame-pointer gil
gcc-4.7 -m64 -march=corei7 -Os -fomit-frame-pointer gil
gcc-4.7 -m64 -march=k8 -O2 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=k8 -O3 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=k8 -O -fomit-frame-pointer gil
gcc-4.7 -m64 -march=k8 -Os -fomit-frame-pointer gil
gcc-4.7 -m64 -march=native -mtune=native -O2 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=native -mtune=native -O3 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=native -mtune=native -O -fomit-frame-pointer gil
gcc-4.7 -m64 -march=native -mtune=native -Os -fomit-frame-pointer gil
gcc-4.7 -m64 -march=nocona -O2 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=nocona -O3 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=nocona -O -fomit-frame-pointer gil
gcc-4.7 -m64 -march=nocona -Os -fomit-frame-pointer gil
gcc-4.7 -march=barcelona -O2 -fomit-frame-pointer gil
gcc-4.7 -march=barcelona -O3 -fomit-frame-pointer gil
gcc-4.7 -march=barcelona -O -fomit-frame-pointer gil
gcc-4.7 -march=barcelona -Os -fomit-frame-pointer gil
gcc-4.7 -march=k8 -O2 -fomit-frame-pointer gil
gcc-4.7 -march=k8 -O3 -fomit-frame-pointer gil
gcc-4.7 -march=k8 -O -fomit-frame-pointer gil
gcc-4.7 -march=k8 -Os -fomit-frame-pointer gil
gcc-4.7 -march=nocona -O2 -fomit-frame-pointer gil
gcc-4.7 -march=nocona -O3 -fomit-frame-pointer gil
gcc-4.7 -march=nocona -O -fomit-frame-pointer gil
gcc-4.7 -march=nocona -Os -fomit-frame-pointer gil
gcc-4.8 gil
gcc-4.8 -O2 -fomit-frame-pointer gil
gcc-4.8 -O3 -fomit-frame-pointer gil
gcc-4.8 -O -fomit-frame-pointer gil
gcc-4.8 -Os -fomit-frame-pointer gil
gcc-4.8 -fno-schedule-insns -O2 -fomit-frame-pointer gil
gcc-4.8 -fno-schedule-insns -O3 -fomit-frame-pointer gil
gcc-4.8 -fno-schedule-insns -O -fomit-frame-pointer gil
gcc-4.8 -fno-schedule-insns -Os -fomit-frame-pointer gil
gcc-4.8 -funroll-loops gil
gcc-4.8 -funroll-loops -O2 -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -O3 -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -O -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -Os -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -m64 -O2 -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -m64 -O3 -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -m64 -O -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -m64 -Os -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -march=barcelona -O2 -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -march=barcelona -O3 -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -march=barcelona -O -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -march=barcelona -Os -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -march=k8 -O2 -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -march=k8 -O3 -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -march=k8 -O -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -march=k8 -Os -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -march=nocona -O2 -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -march=nocona -O3 -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -march=nocona -O -fomit-frame-pointer gil
gcc-4.8 -funroll-loops -march=nocona -Os -fomit-frame-pointer gil
gcc-4.8 -m64 -O2 -fomit-frame-pointer gil
gcc-4.8 -m64 -O3 -fomit-frame-pointer gil
gcc-4.8 -m64 -O -fomit-frame-pointer gil
gcc-4.8 -m64 -Os -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core-avx-i -O2 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core-avx-i -O3 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core-avx-i -O -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core-avx-i -Os -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core-avx2 -O2 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core-avx2 -O -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core-avx2 -Os -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core2 -O2 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core2 -O3 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core2 -O -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core2 -Os -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core2 -msse4 -O -fomit-frame-pointer gil
gcc-4.8 -m64 -march=core2 -msse4 -Os -fomit-frame-pointer gil
gcc-4.8 -m64 -march=corei7-avx -O2 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=corei7-avx -O3 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=corei7-avx -O -fomit-frame-pointer gil
gcc-4.8 -m64 -march=corei7-avx -Os -fomit-frame-pointer gil
gcc-4.8 -m64 -march=corei7 -O2 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=corei7 -O3 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=corei7 -O -fomit-frame-pointer gil
gcc-4.8 -m64 -march=corei7 -Os -fomit-frame-pointer gil
gcc-4.8 -m64 -march=k8 -O2 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=k8 -O3 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=k8 -O -fomit-frame-pointer gil
gcc-4.8 -m64 -march=k8 -Os -fomit-frame-pointer gil
gcc-4.8 -m64 -march=native -mtune=native -O2 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=native -mtune=native -O3 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=native -mtune=native -O -fomit-frame-pointer gil
gcc-4.8 -m64 -march=native -mtune=native -Os -fomit-frame-pointer gil
gcc-4.8 -m64 -march=nocona -O2 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=nocona -O3 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=nocona -O -fomit-frame-pointer gil
gcc-4.8 -m64 -march=nocona -Os -fomit-frame-pointer gil
gcc-4.8 -march=barcelona -O2 -fomit-frame-pointer gil
gcc-4.8 -march=barcelona -O3 -fomit-frame-pointer gil
gcc-4.8 -march=barcelona -O -fomit-frame-pointer gil
gcc-4.8 -march=barcelona -Os -fomit-frame-pointer gil
gcc-4.8 -march=k8 -O2 -fomit-frame-pointer gil
gcc-4.8 -march=k8 -O3 -fomit-frame-pointer gil
gcc-4.8 -march=k8 -O -fomit-frame-pointer gil
gcc-4.8 -march=k8 -Os -fomit-frame-pointer gil
gcc-4.8 -march=nocona -O2 -fomit-frame-pointer gil
gcc-4.8 -march=nocona -O3 -fomit-frame-pointer gil
gcc-4.8 -march=nocona -O -fomit-frame-pointer gil
gcc-4.8 -march=nocona -Os -fomit-frame-pointer gil

Compiler output

Implementation: crypto_stream/rijn256ctr/gil
Compiler: clang -O3 -fomit-frame-pointer
rijn_k32b32.gnu.S: rijn_k32b32.gnu.S:1:8: error: alignment must be a power of 2
rijn_k32b32.gnu.S: .align 5
rijn_k32b32.gnu.S: ^
rijn_k32b32.gnu.S: rijn_k32b32.gnu.S:15:8: error: alignment must be a power of 2
rijn_k32b32.gnu.S: .align 5
rijn_k32b32.gnu.S: ^

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer gil
clang -O3 -fwrapv -march=native -fomit-frame-pointer gil
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer gil
clang -O3 -fwrapv -mavx -fomit-frame-pointer gil
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer gil

Compiler output

Implementation: crypto_stream/rijn256ctr/gil
Compiler: clang -O3 -fwrapv -march=native -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer
stream.c: clang-3.3: warning: argument unused during compilation: '-fpolly'
rijn_k32b32.gnu.S: clang-3.3: warning: argument unused during compilation: '-fpolly'
rijn_k32b32.gnu.S: rijn_k32b32.gnu.S:1:8: error: alignment must be a power of 2
rijn_k32b32.gnu.S: .align 5
rijn_k32b32.gnu.S: ^
rijn_k32b32.gnu.S: rijn_k32b32.gnu.S:15:8: error: alignment must be a power of 2
rijn_k32b32.gnu.S: .align 5
rijn_k32b32.gnu.S: ^

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
clang -O3 -fwrapv -march=native -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer gil
clang -O3 -fwrapv -mavx -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer gil
clang -O3 -fwrapv -mavx -maes -mpclmul -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer gil

Compiler output

Implementation: crypto_stream/rijn256ctr/gil
Compiler: clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer
stream.c: clang-3.3: warning: argument unused during compilation: '-mcpu=core-avx2'
rijn_k32b32.gnu.S: clang-3.3: warning: argument unused during compilation: '-mcpu=core-avx2'
rijn_k32b32.gnu.S: rijn_k32b32.gnu.S:1:8: error: alignment must be a power of 2
rijn_k32b32.gnu.S: .align 5
rijn_k32b32.gnu.S: ^
rijn_k32b32.gnu.S: rijn_k32b32.gnu.S:15:8: error: alignment must be a power of 2
rijn_k32b32.gnu.S: .align 5
rijn_k32b32.gnu.S: ^

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer gil

Compiler output

Implementation: crypto_stream/rijn256ctr/gil
Compiler: gcc-4.7 -m64 -march=barcelona -O2 -fomit-frame-pointer
rijn_k32b32.gnu.S: rijn_k32b32.gnu.S: Assembler messages:
rijn_k32b32.gnu.S: rijn_k32b32.gnu.S:1: Error: alignment not a power of 2
rijn_k32b32.gnu.S: rijn_k32b32.gnu.S:15: Error: alignment not a power of 2
rijn_k32b32.gnu.S: rijn_k32b32.gnu.S: Assembler messages:
rijn_k32b32.gnu.S: rijn_k32b32.gnu.S:1: Error: alignment not a power of 2
rijn_k32b32.gnu.S: rijn_k32b32.gnu.S:15: Error: alignment not a power of 2

Number of similar (compiler,implementation) pairs: 8, namely:
CompilerImplementations
gcc-4.7 -m64 -march=barcelona -O2 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=barcelona -O3 -fomit-frame-pointer gil
gcc-4.7 -m64 -march=barcelona -O -fomit-frame-pointer gil
gcc-4.7 -m64 -march=barcelona -Os -fomit-frame-pointer gil
gcc-4.8 -m64 -march=barcelona -O2 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=barcelona -O3 -fomit-frame-pointer gil
gcc-4.8 -m64 -march=barcelona -O -fomit-frame-pointer gil
gcc-4.8 -m64 -march=barcelona -Os -fomit-frame-pointer gil