Implementation notes: amd64, waldorf, crypto_hash/echo256

Computer: waldorf
Architecture: amd64
CPU ID: GenuineIntel-000106e5-bfebfbff
SUPERCOP version: 20160715
Operation: crypto_hash
Primitive: echo256
TimeImplementationCompilerBenchmark dateSUPERCOP version
77484core2/core2v64gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016071820160715
80236core2/core2v64gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016071820160715
83996athlon/athlon64gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016071820160715
84272core2/core2v64gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016071820160715
86160athlon/athlon64gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016071820160715
88676core2/core2v64gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016071820160715
90160athlon/athlon64gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016071820160715
92568athlon/athlon64gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016071820160715
99932generic/opt64gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016071820160715
103956sphlibgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016071820160715
104504sphlibgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016071820160715
104516sphlib-smallgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016071820160715
105076sphlibgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016071820160715
107680ccalik/vpermclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016071820160715
107872generic/opt64gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016071820160715
108956generic/opt64gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016071820160715
109116sphlibgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016071820160715
111844ccalik/vpermgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016071820160715
112632sphlib-smallgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016071820160715
113952pentium/pentium4v64v1gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016071820160715
113956ccalik/bitslicedgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016071820160715
114620pentium/pentium4v64v2gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016071820160715
114700ccalik/vpermgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016071820160715
115220pentium/pentium4v64v2gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016071820160715
116608ccalik/bitslicedgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016071820160715
117072ccalik/bitslicedgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016071820160715
117868pentium/pentium4v64v1gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016071820160715
118024sphlib-smallgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016071820160715
119164pentium/pentium4v64v1gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016071820160715
119324pentium/pentium4v64v1gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016071820160715
119616ccalik/vpermgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016071820160715
120172ccalik/vpermgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016071820160715
120208sphlibclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016071820160715
120408pentium/pentium4v64v2gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016071820160715
120888pentium/pentium4v64v2gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016071820160715
123028generic/opt64gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016071820160715
123364generic/opt32gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016071820160715
124956ccalik/bitslicedgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016071820160715
125852sphlib-smallclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016071820160715
126268generic/opt32gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016071820160715
126876generic/opt32gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016071820160715
131196generic/opt32gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016071820160715
131884sphlib-smallgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016071820160715
132900generic/opt32clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016071820160715
134428ccalik/bitslicedclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016071820160715
142656powerpc/pp32cv2clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016071820160715
165004powerpc/pp32cv2gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016071820160715
180544generic/opt64clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016071820160715
182296powerpc/pp32cv1clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016071820160715
185816powerpc/pp32cv2gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016071820160715
194188powerpc/pp32cv2gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016071820160715
197908powerpc/pp32cv2gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016071820160715
204640powerpc/pp32cv1gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016071820160715
243308powerpc/pp32cv1gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016071820160715
244216powerpc/pp32cv1gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016071820160715
255244powerpc/pp32cv1gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016071820160715

Test failure

Implementation: crypto_hash/echo256/aes/aes64
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv
error 111

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv aes/aes64
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv aes/aes64
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv aes/aes64
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv aes/aes64

Compiler output

Implementation: crypto_hash/echo256/pentium/pentium4v64v1
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
echo32.s: echo32.c:545:2: error: invalid instruction mnemonic 'addd'
echo32.s: addd [SHA3_CNT], 1
echo32.s: ^
echo32.s: echo32.c:545:2: error: invalid instruction mnemonic 'addd'
echo32.s: addd [SHA3_CNT], 1
echo32.s: ^
echo32.s: echo32.c:545:2: error: invalid instruction mnemonic 'addd'
echo32.s: addd [SHA3_CNT], 1
echo32.s: ^
echo32.s: echo32.c:545:2: error: invalid instruction mnemonic 'addd'
echo32.s: addd [SHA3_CNT], 1
echo32.s: ^
echo32.s: echo32.c:546:2: error: invalid instruction mnemonic 'addd'
echo32.s: addd [SHA3_CNT], 1
echo32.s: ^
echo32.s: echo32.c:546:2: error: invalid instruction mnemonic 'addd'
echo32.s: addd [SHA3_CNT], 1
echo32.s: ^
echo32.s: echo32.c:546:2: error: invalid instruction mnemonic 'addd'
echo32.s: addd [SHA3_CNT], 1
echo32.s: ^
echo32.s: echo32.c:546:2: error: invalid instruction mnemonic 'addd'
echo32.s: addd [SHA3_CNT], 1
echo32.s: ^
echo32.s: echo32.c:547:2: error: invalid instruction mnemonic 'addd'
echo32.s: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments pentium/pentium4v64v1

Compiler output

Implementation: crypto_hash/echo256/generic/opt64
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
echo64.c: echo64.c:501:29: warning: passing 'const unsigned char *' to parameter of type 'BitSequence *' (aka 'unsigned char *') discards qualifiers [-Wincompatible-pointer-types-discards-qualifiers]
echo64.c: if (Hash(CRYPTO_BYTES * 8,in,inlen * 8,out) == SUCCESS) return 0;
echo64.c: ^~
echo64.c: echo64.c:488:46: note: passing argument to parameter 'data' here
echo64.c: HashReturn Hash(int hashbitlen, BitSequence *data, DataLength databitlen, BitSequence *hashval)
echo64.c: ^
echo64.c: 1 warning generated.

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments generic/opt64

Compiler output

Implementation: crypto_hash/echo256/core2/core2v64
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
echo64.s: echo64.c:473:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl ebp, ah
echo64.s: ^
echo64.s: echo64.c:473:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl esi, al
echo64.s: ^
echo64.s: echo64.c:473:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl ebp, bl
echo64.s: ^
echo64.s: echo64.c:473:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl esi, bh
echo64.s: ^
echo64.s: echo64.c:473:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl ebp, al
echo64.s: ^
echo64.s: echo64.c:473:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl esi, ah
echo64.s: ^
echo64.s: echo64.c:473:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl ebp, bh
echo64.s: ^
echo64.s: echo64.c:473:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl esi, bl
echo64.s: ^
echo64.s: echo64.c:473:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments core2/core2v64

Compiler output

Implementation: crypto_hash/echo256/athlon/athlon64
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
echo64.s: echo64.c:519:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl ebp, ah
echo64.s: ^
echo64.s: echo64.c:519:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl esi, al
echo64.s: ^
echo64.s: echo64.c:519:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl ebp, bl
echo64.s: ^
echo64.s: echo64.c:519:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl esi, bh
echo64.s: ^
echo64.s: echo64.c:519:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl ebp, al
echo64.s: ^
echo64.s: echo64.c:519:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl esi, ah
echo64.s: ^
echo64.s: echo64.c:519:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl ebp, bh
echo64.s: ^
echo64.s: echo64.c:519:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl esi, bl
echo64.s: ^
echo64.s: echo64.c:519:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments athlon/athlon64

Compiler output

Implementation: crypto_hash/echo256/pentium/pentium4v64v2
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
echo64.s: echo64.c:571:2: error: invalid instruction mnemonic 'addd'
echo64.s: addd [SHA3_CNT], 1
echo64.s: ^
echo64.s: echo64.c:571:2: error: invalid instruction mnemonic 'addd'
echo64.s: addd [SHA3_CNT], 1
echo64.s: ^
echo64.s: echo64.c:571:2: error: invalid instruction mnemonic 'addd'
echo64.s: addd [SHA3_CNT], 1
echo64.s: ^
echo64.s: echo64.c:571:2: error: invalid instruction mnemonic 'addd'
echo64.s: addd [SHA3_CNT], 1
echo64.s: ^
echo64.s: echo64.c:572:2: error: invalid instruction mnemonic 'addd'
echo64.s: addd [SHA3_CNT], 1
echo64.s: ^
echo64.s: echo64.c:572:2: error: invalid instruction mnemonic 'addd'
echo64.s: addd [SHA3_CNT], 1
echo64.s: ^
echo64.s: echo64.c:572:2: error: invalid instruction mnemonic 'addd'
echo64.s: addd [SHA3_CNT], 1
echo64.s: ^
echo64.s: echo64.c:572:2: error: invalid instruction mnemonic 'addd'
echo64.s: addd [SHA3_CNT], 1
echo64.s: ^
echo64.s: echo64.c:573:2: error: invalid instruction mnemonic 'addd'
echo64.s: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments pentium/pentium4v64v2

Compiler output

Implementation: crypto_hash/echo256/aes/aes64
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
echo64.s: echo64.c:636:14: error: invalid instruction mnemonic 'noprefix'
echo64.s: .att_syntax noprefix
echo64.s: ^

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments aes/aes64

Compiler output

Implementation: crypto_hash/echo256/ccalik/aesni
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
hash.c: In file included from hash.c:25:
hash.c: /usr/include/clang/3.5.0/include/wmmintrin.h:30:3: error: "AES/PCLMUL instructions not enabled"
hash.c: # error "AES/PCLMUL instructions not enabled"
hash.c: ^
hash.c: hash.c:295:4: warning: implicit declaration of function '_mm_aesenc_si128' is invalid in C99 [-Wimplicit-function-declaration]
hash.c: ECHO_ROUND_UNROLL2;
hash.c: ^
hash.c: hash.c:185:4: note: expanded from macro 'ECHO_ROUND_UNROLL2'
hash.c: ECHO_SUBBYTES(_state, 0, 0);\
hash.c: ^
hash.c: hash.c:114:19: note: expanded from macro 'ECHO_SUBBYTES'
hash.c: state[i][j] = _mm_aesenc_si128(state[i][j], k1);\
hash.c: ^
hash.c: hash.c:295:4: error: assigning to '__m128i' (vector of 2 'long long' values) from incompatible type 'int'
hash.c: ECHO_ROUND_UNROLL2;
hash.c: ^~~~~~~~~~~~~~~~~~
hash.c: hash.c:185:4: note: expanded from macro 'ECHO_ROUND_UNROLL2'
hash.c: ECHO_SUBBYTES(_state, 0, 0);\
hash.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~
hash.c: hash.c:114:17: note: expanded from macro 'ECHO_SUBBYTES'
hash.c: state[i][j] = _mm_aesenc_si128(state[i][j], k1);\
hash.c: ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
hash.c: hash.c:295:4: error: assigning to '__m128i' (vector of 2 'long long' values) from incompatible type 'int'
hash.c: ECHO_ROUND_UNROLL2;
hash.c: ^~~~~~~~~~~~~~~~~~
hash.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments ccalik/aesni

Compiler output

Implementation: crypto_hash/echo256/generic/opt64
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv
echo64.c: echo64.c: In function 'crypto_hash_echo256_generic_opt64':
echo64.c: echo64.c:501:29: warning: passing argument 2 of 'Hash' discards 'const' qualifier from pointer target type
echo64.c: if (Hash(CRYPTO_BYTES * 8,in,inlen * 8,out) == SUCCESS) return 0;
echo64.c: ^
echo64.c: echo64.c:488:12: note: expected 'BitSequence *' but argument is of type 'const unsigned char *'
echo64.c: HashReturn Hash(int hashbitlen, BitSequence *data, DataLength databitlen, BitSequence *hashval)
echo64.c: ^

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv generic/opt64
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv generic/opt64
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv generic/opt64
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv generic/opt64

Compiler output

Implementation: crypto_hash/echo256/ccalik/aesni
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv
hash.c: In file included from hash.c:25:0:
hash.c: hash.c: In function 'Compress':
hash.c: /usr/lib/gcc/x86_64-linux-gnu/4.9/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '_mm_aesenc_si128': target specific option mismatch
hash.c: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
hash.c: ^
hash.c: hash.c:114:19: error: called from here
hash.c: state[i][j] = _mm_aesenc_si128(state[i][j], k1);\
hash.c: ^
hash.c: hash.c:185:4: note: in expansion of macro 'ECHO_SUBBYTES'
hash.c: ECHO_SUBBYTES(_state, 0, 0);\
hash.c: ^
hash.c: hash.c:295:4: note: in expansion of macro 'ECHO_ROUND_UNROLL2'
hash.c: ECHO_ROUND_UNROLL2;
hash.c: ^
hash.c: In file included from hash.c:25:0:
hash.c: /usr/lib/gcc/x86_64-linux-gnu/4.9/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '_mm_aesenc_si128': target specific option mismatch
hash.c: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
hash.c: ^
hash.c: hash.c:115:19: error: called from here
hash.c: state[i][j] = _mm_aesenc_si128(state[i][j], M128(zero));\
hash.c: ^
hash.c: hash.c:185:4: note: in expansion of macro 'ECHO_SUBBYTES'
hash.c: ECHO_SUBBYTES(_state, 0, 0);\
hash.c: ^
hash.c: hash.c:295:4: note: in expansion of macro 'ECHO_ROUND_UNROLL2'
hash.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv ccalik/aesni
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv ccalik/aesni
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv ccalik/aesni
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv ccalik/aesni