Implementation notes: amd64, par, crypto_hash/blake2b

Computer: par
Architecture: amd64
CPU ID: GenuineIntel-000406c3-bfebfbff
SUPERCOP version: 20161026
Operation: crypto_hash
Primitive: blake2b
TimeImplementationCompilerBenchmark dateSUPERCOP version
12900regsgcc -march=native -mcpu=native -O22016121420161026
12920regsgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
12920regsgcc -march=native -mcpu=native -O32016121420161026
12940regsgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
13060regsgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
13120regsgcc -march=native -mcpu=native -Os2016121420161026
14680refgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
14860refgcc -march=native -mcpu=native -O32016121420161026
15000refgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
15760refgcc -march=native -mcpu=native -O22016121420161026
18060refgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
18120refgcc -march=native -mcpu=native -Os2016121420161026
37860xmmgcc -march=native -mcpu=native -Os2016121420161026
38820xmmgcc -funroll-loops -march=native -mcpu=native -Os2016121420161026
38880xmmgcc -funroll-loops -march=native -mcpu=native -O32016121420161026
38900xmmgcc -funroll-loops -march=native -mcpu=native -O22016121420161026
40280xmmgcc -march=native -mcpu=native -O22016121420161026
40280xmmgcc -march=native -mcpu=native -O32016121420161026

Test failure

Implementation: crypto_hash/blake2b/avxicc
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
error 111

Number of similar (compiler,implementation) pairs: 24, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 avxicc moon/avx/64 moon/avx2/64 moon/xop/64
gcc -funroll-loops -march=native -mcpu=native -O3 avxicc moon/avx/64 moon/avx2/64 moon/xop/64
gcc -funroll-loops -march=native -mcpu=native -Os avxicc moon/avx/64 moon/avx2/64 moon/xop/64
gcc -march=native -mcpu=native -O2 avxicc moon/avx/64 moon/avx2/64 moon/xop/64
gcc -march=native -mcpu=native -O3 avxicc moon/avx/64 moon/avx2/64 moon/xop/64
gcc -march=native -mcpu=native -Os avxicc moon/avx/64 moon/avx2/64 moon/xop/64

Compiler output

Implementation: crypto_hash/blake2b/ref
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
blake2b-ref.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 ref
gcc -funroll-loops -march=native -mcpu=native -O3 ref
gcc -funroll-loops -march=native -mcpu=native -Os ref
gcc -march=native -mcpu=native -O2 ref
gcc -march=native -mcpu=native -O3 ref
gcc -march=native -mcpu=native -Os ref

Compiler output

Implementation: crypto_hash/blake2b/avx2-1
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
blake2b.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.c: blake2b.c: In function 'crypto_hash_blake2b_avx2_1':
blake2b.c: blake2b.c:37:17: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
blake2b.c: const __m256i r16 = _mm256_setr_epi8( 2, 3, 4, 5, 6, 7, 0, 1, 10, 11, 12, 13, 14, 15, 8, 9,
blake2b.c: ^~~
blake2b.c: In file included from /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/immintrin.h:41:0,
blake2b.c: from blake2b.c:11:
blake2b.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/avxintrin.h:1270:1: error: inlining failed in call to always_inline '_mm256_set_epi64x': target specific option mismatch
blake2b.c: _mm256_set_epi64x (long long __A, long long __B, long long __C,
blake2b.c: ^~~~~~~~~~~~~~~~~
blake2b.c: blake2b.c:57:8: note: called from here
blake2b.c: ymm1 = _mm256_set_epi64x(0x5BE0CD19137E2179ULL, 0x1F83D9ABFB41BD6BULL, 0x9B05688C2B3E6C1FULL, 0x510E527FADE682D1ULL);
blake2b.c: ~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blake2b.c: In file included from /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/immintrin.h:41:0,
blake2b.c: from blake2b.c:11:
blake2b.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/avxintrin.h:1270:1: error: inlining failed in call to always_inline '_mm256_set_epi64x': target specific option mismatch
blake2b.c: _mm256_set_epi64x (long long __A, long long __B, long long __C,
blake2b.c: ^~~~~~~~~~~~~~~~~
blake2b.c: blake2b.c:56:8: note: called from here
blake2b.c: ymm0 = _mm256_set_epi64x(0xA54FF53A5F1D36F1ULL, 0x3C6EF372FE94F82BULL, 0xBB67AE8584CAA73BULL, 0x6A09E667F3BCC908ULL ^ 0x0000000001010040ULL);
blake2b.c: ~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blake2b.c: In file included from /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/immintrin.h:41:0,
blake2b.c: from blake2b.c:11:
blake2b.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/avxintrin.h:1357:1: error: inlining failed in call to always_inline '_mm256_setr_epi8': target specific option mismatch
blake2b.c: _mm256_setr_epi8 (char __q31, char __q30, char __q29, char __q28,
blake2b.c: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 avx2-1
gcc -funroll-loops -march=native -mcpu=native -O3 avx2-1
gcc -funroll-loops -march=native -mcpu=native -Os avx2-1
gcc -march=native -mcpu=native -O2 avx2-1
gcc -march=native -mcpu=native -O3 avx2-1
gcc -march=native -mcpu=native -Os avx2-1

Compiler output

Implementation: crypto_hash/blake2b/avx2-2
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
blake2b.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.c: blake2b.c: In function 'crypto_hash_blake2b_avx2_2':
blake2b.c: blake2b.c:38:17: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
blake2b.c: const __m256i r16 = _mm256_setr_epi8( 2, 3, 4, 5, 6, 7, 0, 1, 10, 11, 12, 13, 14, 15, 8, 9,
blake2b.c: ^~~
blake2b.c: In file included from /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/immintrin.h:41:0,
blake2b.c: from blake2b.c:12:
blake2b.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/avxintrin.h:1270:1: error: inlining failed in call to always_inline '_mm256_set_epi64x': target specific option mismatch
blake2b.c: _mm256_set_epi64x (long long __A, long long __B, long long __C,
blake2b.c: ^~~~~~~~~~~~~~~~~
blake2b.c: blake2b.c:58:8: note: called from here
blake2b.c: ymm1 = _mm256_set_epi64x(0x5BE0CD19137E2179ULL, 0x1F83D9ABFB41BD6BULL, 0x9B05688C2B3E6C1FULL, 0x510E527FADE682D1ULL);
blake2b.c: ~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blake2b.c: In file included from /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/immintrin.h:41:0,
blake2b.c: from blake2b.c:12:
blake2b.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/avxintrin.h:1270:1: error: inlining failed in call to always_inline '_mm256_set_epi64x': target specific option mismatch
blake2b.c: _mm256_set_epi64x (long long __A, long long __B, long long __C,
blake2b.c: ^~~~~~~~~~~~~~~~~
blake2b.c: blake2b.c:57:8: note: called from here
blake2b.c: ymm0 = _mm256_set_epi64x(0xA54FF53A5F1D36F1ULL, 0x3C6EF372FE94F82BULL, 0xBB67AE8584CAA73BULL, 0x6A09E667F3BCC908ULL ^ 0x0000000001010040ULL);
blake2b.c: ~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blake2b.c: In file included from /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/immintrin.h:41:0,
blake2b.c: from blake2b.c:12:
blake2b.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/avxintrin.h:1357:1: error: inlining failed in call to always_inline '_mm256_setr_epi8': target specific option mismatch
blake2b.c: _mm256_setr_epi8 (char __q31, char __q30, char __q29, char __q28,
blake2b.c: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 avx2-2
gcc -funroll-loops -march=native -mcpu=native -O3 avx2-2
gcc -funroll-loops -march=native -mcpu=native -Os avx2-2
gcc -march=native -mcpu=native -O2 avx2-2
gcc -march=native -mcpu=native -O3 avx2-2
gcc -march=native -mcpu=native -Os avx2-2

Compiler output

Implementation: crypto_hash/blake2b/avx2-3
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
blake2b.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.c: blake2b.c: In function 'crypto_hash_blake2b_avx2_3':
blake2b.c: blake2b.c:38:17: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
blake2b.c: const __m256i r16 = _mm256_setr_epi8( 2, 3, 4, 5, 6, 7, 0, 1, 10, 11, 12, 13, 14, 15, 8, 9,
blake2b.c: ^~~
blake2b.c: In file included from /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/immintrin.h:41:0,
blake2b.c: from blake2b.c:12:
blake2b.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/avxintrin.h:1270:1: error: inlining failed in call to always_inline '_mm256_set_epi64x': target specific option mismatch
blake2b.c: _mm256_set_epi64x (long long __A, long long __B, long long __C,
blake2b.c: ^~~~~~~~~~~~~~~~~
blake2b.c: blake2b.c:58:8: note: called from here
blake2b.c: ymm1 = _mm256_set_epi64x(0x5BE0CD19137E2179ULL, 0x1F83D9ABFB41BD6BULL, 0x9B05688C2B3E6C1FULL, 0x510E527FADE682D1ULL);
blake2b.c: ~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blake2b.c: In file included from /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/immintrin.h:41:0,
blake2b.c: from blake2b.c:12:
blake2b.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/avxintrin.h:1270:1: error: inlining failed in call to always_inline '_mm256_set_epi64x': target specific option mismatch
blake2b.c: _mm256_set_epi64x (long long __A, long long __B, long long __C,
blake2b.c: ^~~~~~~~~~~~~~~~~
blake2b.c: blake2b.c:57:8: note: called from here
blake2b.c: ymm0 = _mm256_set_epi64x(0xA54FF53A5F1D36F1ULL, 0x3C6EF372FE94F82BULL, 0xBB67AE8584CAA73BULL, 0x6A09E667F3BCC908ULL ^ 0x0000000001010040ULL);
blake2b.c: ~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
blake2b.c: In file included from /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/immintrin.h:41:0,
blake2b.c: from blake2b.c:12:
blake2b.c: /usr/lib/gcc/x86_64-pc-linux-gnu/6.2.1/include/avxintrin.h:1357:1: error: inlining failed in call to always_inline '_mm256_setr_epi8': target specific option mismatch
blake2b.c: _mm256_setr_epi8 (char __q31, char __q30, char __q29, char __q28,
blake2b.c: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 avx2-3
gcc -funroll-loops -march=native -mcpu=native -O3 avx2-3
gcc -funroll-loops -march=native -mcpu=native -Os avx2-3
gcc -march=native -mcpu=native -O2 avx2-3
gcc -march=native -mcpu=native -O3 avx2-3
gcc -march=native -mcpu=native -Os avx2-3

Compiler output

Implementation: crypto_hash/blake2b/ymm
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
blake2b.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.c: blake2b.c:15:2: error: #error "This code requires at least AVX."
blake2b.c: #error "This code requires at least AVX."
blake2b.c: ^~~~~
blake2b.c: blake2b.c: In function 'crypto_hash_blake2b_ymm':
blake2b.c: blake2b.c:203:10: error: incompatible types when assigning to type '__m128i {aka __vector(2) long long int}' from type '__m256i {aka __vector(4) long long int}'
blake2b.c: m0 = _mm256_loadu_si256((__m256i*)(in + 00));
blake2b.c: ^
blake2b.c: blake2b.c:204:10: error: incompatible types when assigning to type '__m128i {aka __vector(2) long long int}' from type '__m256i {aka __vector(4) long long int}'
blake2b.c: m1 = _mm256_loadu_si256((__m256i*)(in + 32));
blake2b.c: ^
blake2b.c: blake2b.c:205:10: error: incompatible types when assigning to type '__m128i {aka __vector(2) long long int}' from type '__m256i {aka __vector(4) long long int}'
blake2b.c: m2 = _mm256_loadu_si256((__m256i*)(in + 64));
blake2b.c: ^
blake2b.c: blake2b.c:206:10: error: incompatible types when assigning to type '__m128i {aka __vector(2) long long int}' from type '__m256i {aka __vector(4) long long int}'
blake2b.c: m3 = _mm256_loadu_si256((__m256i*)(in + 96));
blake2b.c: ^
blake2b.c: blake2b.c:225:10: error: incompatible types when assigning to type '__m128i {aka __vector(2) long long int}' from type '__m256i {aka __vector(4) long long int}'
blake2b.c: m0 = _mm256_loadu_si256((__m256i*)(buffer + 00));
blake2b.c: ^
blake2b.c: blake2b.c:226:10: error: incompatible types when assigning to type '__m128i {aka __vector(2) long long int}' from type '__m256i {aka __vector(4) long long int}'
blake2b.c: m1 = _mm256_loadu_si256((__m256i*)(buffer + 32));
blake2b.c: ^
blake2b.c: blake2b.c:227:10: error: incompatible types when assigning to type '__m128i {aka __vector(2) long long int}' from type '__m256i {aka __vector(4) long long int}'
blake2b.c: m2 = _mm256_loadu_si256((__m256i*)(buffer + 64));
blake2b.c: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 ymm
gcc -funroll-loops -march=native -mcpu=native -O3 ymm
gcc -funroll-loops -march=native -mcpu=native -Os ymm
gcc -march=native -mcpu=native -O2 ymm
gcc -march=native -mcpu=native -O3 ymm
gcc -march=native -mcpu=native -Os ymm

Compiler output

Implementation: crypto_hash/blake2b/regs
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
blake2b.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
measure.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 12, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 regs xmm
gcc -funroll-loops -march=native -mcpu=native -O3 regs xmm
gcc -funroll-loops -march=native -mcpu=native -Os regs xmm
gcc -march=native -mcpu=native -O2 regs xmm
gcc -march=native -mcpu=native -O3 regs xmm
gcc -march=native -mcpu=native -Os regs xmm

Compiler output

Implementation: crypto_hash/blake2b/avxicc
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
blake2b.s: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 avxicc
gcc -funroll-loops -march=native -mcpu=native -O3 avxicc
gcc -funroll-loops -march=native -mcpu=native -Os avxicc
gcc -march=native -mcpu=native -O2 avxicc
gcc -march=native -mcpu=native -O3 avxicc
gcc -march=native -mcpu=native -Os avxicc

Compiler output

Implementation: crypto_hash/blake2b/moon/neon/32
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
crypto_hash.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.S: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.S: blake2b.S: Assembler messages:
blake2b.S: blake2b.S:3: Error: no such architecture: `armv7'
blake2b.S: blake2b.S:3: Error: junk at end of line, first unrecognized character is `-'
blake2b.S: blake2b.S:4: Error: unknown pseudo-op: `.fpu'
blake2b.S: blake2b.S:9: Error: no such instruction: `vstmdb sp!,{q4-q7}'
blake2b.S: blake2b.S:10: Error: no such instruction: `stmfd sp!,{r4-r12,r14}'
blake2b.S: blake2b.S:11: Error: too many memory references for `mov'
blake2b.S: blake2b.S:12: Error: too many memory references for `sub'
blake2b.S: blake2b.S:13: Error: too many memory references for `and'
blake2b.S: blake2b.S:14: Error: invalid char '[' beginning operand 2 `[sp'
blake2b.S: blake2b.S:15: Error: too many memory references for `sub'
blake2b.S: blake2b.S:16: Error: expecting operand after ','; got nothing
blake2b.S: blake2b.S:17: Error: expecting operand after ','; got nothing
blake2b.S: blake2b.S:18: Error: no such instruction: `bhs blake2b_blocks_neon_use128'
blake2b.S: blake2b.S:19: Error: too many memory references for `mov'
blake2b.S: blake2b.S:21: Error: no such instruction: `ldr r6,[r0,'
blake2b.S: blake2b.S:22: Error: no such instruction: `ldr r7,[r0,'
blake2b.S: blake2b.S:23: Error: too many memory references for `and'
blake2b.S: blake2b.S:24: Error: no such instruction: `beq blake2b_blocks_neon_nocopymessage'
blake2b.S: blake2b.S:25: Error: expecting operand after ','; got nothing
blake2b.S: blake2b.S:26: Error: no such instruction: `beq blake2b_blocks_neon_nocopymessage'
blake2b.S: blake2b.S:27: Error: no such instruction: `veor q0,q0,q0'
blake2b.S: blake2b.S:28: Error: no such instruction: `veor q1,q1,q1'
blake2b.S: blake2b.S:29: Error: no such instruction: `vst1.64 {q0,q1},[sp,:256]!'
blake2b.S: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 moon/neon/32
gcc -funroll-loops -march=native -mcpu=native -O3 moon/neon/32
gcc -funroll-loops -march=native -mcpu=native -Os moon/neon/32
gcc -march=native -mcpu=native -O2 moon/neon/32
gcc -march=native -mcpu=native -O3 moon/neon/32
gcc -march=native -mcpu=native -Os moon/neon/32

Compiler output

Implementation: crypto_hash/blake2b/moon/armv6/32
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
crypto_hash.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.S: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.S: blake2b.S: Assembler messages:
blake2b.S: blake2b.S:5: Error: no such architecture: `armv5'
blake2b.S: blake2b.S:34: Error: no such instruction: `stmfd sp!,{r4-r12,r14}'
blake2b.S: blake2b.S:35: Error: too many memory references for `sub'
blake2b.S: blake2b.S:36: Error: expecting operand after ','; got nothing
blake2b.S: blake2b.S:37: Error: expecting operand after ','; got nothing
blake2b.S: blake2b.S:38: Error: no such instruction: `bhs blake2b_blocks_armv6_use128'
blake2b.S: blake2b.S:39: Error: too many memory references for `mov'
blake2b.S: blake2b.S:41: Error: too many memory references for `mov'
blake2b.S: blake2b.S:42: Error: invalid char '[' beginning operand 2 `[sp'
blake2b.S: blake2b.S:43: Error: invalid char '[' beginning operand 2 `[sp'
blake2b.S: blake2b.S:44: Error: invalid char '[' beginning operand 2 `[sp'
blake2b.S: blake2b.S:45: Error: invalid char '[' beginning operand 2 `[sp'
blake2b.S: blake2b.S:46: Error: no such instruction: `ldr r4,[r0,'
blake2b.S: blake2b.S:47: Error: no such instruction: `ldr r5,[r0,'
blake2b.S: blake2b.S:48: Error: too many memory references for `and'
blake2b.S: blake2b.S:49: Error: no such instruction: `beq blake2b_blocks_armv6_nocopymessage'
blake2b.S: blake2b.S:50: Error: expecting operand after ','; got nothing
blake2b.S: blake2b.S:51: Error: no such instruction: `beq blake2b_blocks_armv6_nocopymessage'
blake2b.S: blake2b.S:52: Error: no such instruction: `eor r4,r4,r4'
blake2b.S: blake2b.S:53: Error: no such instruction: `eor r5,r5,r5'
blake2b.S: blake2b.S:54: Error: no such instruction: `eor r6,r6,r6'
blake2b.S: blake2b.S:55: Error: no such instruction: `eor r7,r7,r7'
blake2b.S: blake2b.S:56: Error: no such instruction: `eor r8,r8,r8'
blake2b.S: ...

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 moon/armv6/32
gcc -funroll-loops -march=native -mcpu=native -O3 moon/armv6/32
gcc -funroll-loops -march=native -mcpu=native -Os moon/armv6/32
gcc -march=native -mcpu=native -O2 moon/armv6/32
gcc -march=native -mcpu=native -O3 moon/armv6/32
gcc -march=native -mcpu=native -Os moon/armv6/32

Compiler output

Implementation: crypto_hash/blake2b/moon/avx2/32
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
crypto_hash.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.S: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.S: blake2b.S: Assembler messages:
blake2b.S: blake2b.S:6: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:9: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:10: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:11: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:14: Error: incorrect register `%ecx' used with `q' suffix
blake2b.S: blake2b.S:15: Error: incorrect register `%eax' used with `q' suffix
blake2b.S: blake2b.S:248: Error: invalid instruction suffix for `pop'
blake2b.S: blake2b.S:249: Error: invalid instruction suffix for `pop'
blake2b.S: blake2b.S:250: Error: invalid instruction suffix for `pop'
blake2b.S: blake2b.S:252: Error: invalid instruction suffix for `pop'

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 moon/avx2/32
gcc -funroll-loops -march=native -mcpu=native -O3 moon/avx2/32
gcc -funroll-loops -march=native -mcpu=native -Os moon/avx2/32
gcc -march=native -mcpu=native -O2 moon/avx2/32
gcc -march=native -mcpu=native -O3 moon/avx2/32
gcc -march=native -mcpu=native -Os moon/avx2/32

Compiler output

Implementation: crypto_hash/blake2b/moon/ssse3/32
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
crypto_hash.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.S: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.S: blake2b.S: Assembler messages:
blake2b.S: blake2b.S:6: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:9: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:10: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:11: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:16: Error: incorrect register `%ebx' used with `q' suffix
blake2b.S: blake2b.S:19: Error: incorrect register `%ebx' used with `q' suffix
blake2b.S: blake2b.S:1566: Error: invalid instruction suffix for `pop'
blake2b.S: blake2b.S:1567: Error: invalid instruction suffix for `pop'
blake2b.S: blake2b.S:1568: Error: invalid instruction suffix for `pop'
blake2b.S: blake2b.S:1570: Error: invalid instruction suffix for `pop'

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 moon/ssse3/32
gcc -funroll-loops -march=native -mcpu=native -O3 moon/ssse3/32
gcc -funroll-loops -march=native -mcpu=native -Os moon/ssse3/32
gcc -march=native -mcpu=native -O2 moon/ssse3/32
gcc -march=native -mcpu=native -O3 moon/ssse3/32
gcc -march=native -mcpu=native -Os moon/ssse3/32

Compiler output

Implementation: crypto_hash/blake2b/moon/avx/32
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
crypto_hash.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.S: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.S: blake2b.S: Assembler messages:
blake2b.S: blake2b.S:6: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:9: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:10: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:11: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:16: Error: incorrect register `%ecx' used with `q' suffix
blake2b.S: blake2b.S:19: Error: incorrect register `%ecx' used with `q' suffix
blake2b.S: blake2b.S:927: Error: invalid instruction suffix for `pop'
blake2b.S: blake2b.S:928: Error: invalid instruction suffix for `pop'
blake2b.S: blake2b.S:929: Error: invalid instruction suffix for `pop'
blake2b.S: blake2b.S:931: Error: invalid instruction suffix for `pop'

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 moon/avx/32
gcc -funroll-loops -march=native -mcpu=native -O3 moon/avx/32
gcc -funroll-loops -march=native -mcpu=native -Os moon/avx/32
gcc -march=native -mcpu=native -O2 moon/avx/32
gcc -march=native -mcpu=native -O3 moon/avx/32
gcc -march=native -mcpu=native -Os moon/avx/32

Compiler output

Implementation: crypto_hash/blake2b/moon/sse2/32
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
crypto_hash.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.S: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.S: blake2b.S: Assembler messages:
blake2b.S: blake2b.S:6: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:9: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:10: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:11: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:16: Error: incorrect register `%esi' used with `q' suffix
blake2b.S: blake2b.S:1966: Error: invalid instruction suffix for `pop'
blake2b.S: blake2b.S:1967: Error: invalid instruction suffix for `pop'
blake2b.S: blake2b.S:1968: Error: invalid instruction suffix for `pop'
blake2b.S: blake2b.S:1970: Error: invalid instruction suffix for `pop'

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 moon/sse2/32
gcc -funroll-loops -march=native -mcpu=native -O3 moon/sse2/32
gcc -funroll-loops -march=native -mcpu=native -Os moon/sse2/32
gcc -march=native -mcpu=native -O2 moon/sse2/32
gcc -march=native -mcpu=native -O3 moon/sse2/32
gcc -march=native -mcpu=native -Os moon/sse2/32

Compiler output

Implementation: crypto_hash/blake2b/moon/xop/32
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
crypto_hash.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.S: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.S: blake2b.S: Assembler messages:
blake2b.S: blake2b.S:8: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:9: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:11: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:12: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:13: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:14: Error: incorrect register `%ebx' used with `q' suffix
blake2b.S: blake2b.S:15: Error: invalid instruction suffix for `push'
blake2b.S: blake2b.S:287: Error: invalid instruction suffix for `pop'
blake2b.S: blake2b.S:288: Error: invalid instruction suffix for `pop'
blake2b.S: blake2b.S:289: Error: invalid instruction suffix for `pop'
blake2b.S: blake2b.S:290: Error: invalid instruction suffix for `pop'
blake2b.S: blake2b.S:291: Error: invalid instruction suffix for `pop'

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 moon/xop/32
gcc -funroll-loops -march=native -mcpu=native -O3 moon/xop/32
gcc -funroll-loops -march=native -mcpu=native -Os moon/xop/32
gcc -march=native -mcpu=native -O2 moon/xop/32
gcc -march=native -mcpu=native -O3 moon/xop/32
gcc -march=native -mcpu=native -Os moon/xop/32

Compiler output

Implementation: crypto_hash/blake2b/moon/avx/64
Compiler: gcc -funroll-loops -march=native -mcpu=native -O2
crypto_hash.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
blake2b.S: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead
try.c: gcc: warning: '-mcpu=' is deprecated; use '-mtune=' or '-march=' instead

Number of similar (compiler,implementation) pairs: 18, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mcpu=native -O2 moon/avx/64 moon/avx2/64 moon/xop/64
gcc -funroll-loops -march=native -mcpu=native -O3 moon/avx/64 moon/avx2/64 moon/xop/64
gcc -funroll-loops -march=native -mcpu=native -Os moon/avx/64 moon/avx2/64 moon/xop/64
gcc -march=native -mcpu=native -O2 moon/avx/64 moon/avx2/64 moon/xop/64
gcc -march=native -mcpu=native -O3 moon/avx/64 moon/avx2/64 moon/xop/64
gcc -march=native -mcpu=native -Os moon/avx/64 moon/avx2/64 moon/xop/64