Implementation notes: amd64, h4atom, crypto_hash/echo256

Computer: h4atom
Architecture: amd64
CPU ID: GenuineIntel-000106ca-bfe9fbff
SUPERCOP version: 20160806
Operation: crypto_hash
Primitive: echo256
TimeImplementationCompilerBenchmark dateSUPERCOP version
157552athlon/athlon64gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016081220160806
157560athlon/athlon64gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016081220160806
157576athlon/athlon64gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016081220160806
157592athlon/athlon64gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016081220160806
167224sphlibclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016081220160806
168320sphlibgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016081220160806
168704sphlibgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016081220160806
172088powerpc/pp32cv2clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016081220160806
178680generic/opt64gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016081220160806
178920generic/opt64gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016081220160806
181488sphlib-smallclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016081220160806
182080pentium/pentium4v64v2gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016081220160806
182096pentium/pentium4v64v2gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016081220160806
182104pentium/pentium4v64v2gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016081220160806
182120pentium/pentium4v64v2gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016081220160806
183480pentium/pentium4v64v1gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016081220160806
183488pentium/pentium4v64v1gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016081220160806
183488pentium/pentium4v64v1gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016081220160806
183568pentium/pentium4v64v1gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016081220160806
186904sphlib-smallgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016081220160806
187608sphlibgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016081220160806
190288powerpc/pp32cv2gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016081220160806
191192ccalik/bitslicedgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016081220160806
198224sphlibgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016081220160806
199552generic/opt64gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016081220160806
200088sphlib-smallgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016081220160806
200616generic/opt32gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016081220160806
200792generic/opt64clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016081220160806
201568ccalik/bitslicedclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016081220160806
207352generic/opt32clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016081220160806
208784core2/core2v64gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016081220160806
208792core2/core2v64gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016081220160806
208808core2/core2v64gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016081220160806
208864core2/core2v64gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016081220160806
209280generic/opt32gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016081220160806
209320ccalik/bitslicedgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016081220160806
209520generic/opt32gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016081220160806
209872ccalik/bitslicedgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016081220160806
212584generic/opt64gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016081220160806
214032ccalik/bitslicedgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016081220160806
216848sphlib-smallgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016081220160806
217336powerpc/pp32cv2gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016081220160806
227768sphlib-smallgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016081220160806
230328powerpc/pp32cv2gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016081220160806
230992powerpc/pp32cv1clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016081220160806
235904generic/opt32gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016081220160806
240712powerpc/pp32cv1gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016081220160806
277016powerpc/pp32cv1gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016081220160806
281384powerpc/pp32cv1gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016081220160806
305504powerpc/pp32cv2gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016081220160806
343544ccalik/vpermclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016081220160806
368176powerpc/pp32cv1gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016081220160806
404056ccalik/vpermgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016081220160806
414224ccalik/vpermgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016081220160806
422216ccalik/vpermgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016081220160806
436488ccalik/vpermgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016081220160806

Test failure

Implementation: crypto_hash/echo256/aes/aes64
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv
error 111

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv aes/aes64
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv aes/aes64
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv aes/aes64
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv aes/aes64

Compiler output

Implementation: crypto_hash/echo256/pentium/pentium4v64v1
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
echo32.s: echo32.c:545:2: error: invalid instruction mnemonic 'addd'
echo32.s: addd [SHA3_CNT], 1
echo32.s: ^
echo32.s: echo32.c:545:2: error: invalid instruction mnemonic 'addd'
echo32.s: addd [SHA3_CNT], 1
echo32.s: ^
echo32.s: echo32.c:545:2: error: invalid instruction mnemonic 'addd'
echo32.s: addd [SHA3_CNT], 1
echo32.s: ^
echo32.s: echo32.c:545:2: error: invalid instruction mnemonic 'addd'
echo32.s: addd [SHA3_CNT], 1
echo32.s: ^
echo32.s: echo32.c:546:2: error: invalid instruction mnemonic 'addd'
echo32.s: addd [SHA3_CNT], 1
echo32.s: ^
echo32.s: echo32.c:546:2: error: invalid instruction mnemonic 'addd'
echo32.s: addd [SHA3_CNT], 1
echo32.s: ^
echo32.s: echo32.c:546:2: error: invalid instruction mnemonic 'addd'
echo32.s: addd [SHA3_CNT], 1
echo32.s: ^
echo32.s: echo32.c:546:2: error: invalid instruction mnemonic 'addd'
echo32.s: addd [SHA3_CNT], 1
echo32.s: ^
echo32.s: echo32.c:547:2: error: invalid instruction mnemonic 'addd'
echo32.s: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments pentium/pentium4v64v1

Compiler output

Implementation: crypto_hash/echo256/generic/opt64
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
echo64.c: echo64.c:501:29: warning: passing 'const unsigned char *' to parameter of type 'BitSequence *' (aka 'unsigned char *') discards qualifiers [-Wincompatible-pointer-types-discards-qualifiers]
echo64.c: if (Hash(CRYPTO_BYTES * 8,in,inlen * 8,out) == SUCCESS) return 0;
echo64.c: ^~
echo64.c: echo64.c:488:46: note: passing argument to parameter 'data' here
echo64.c: HashReturn Hash(int hashbitlen, BitSequence *data, DataLength databitlen, BitSequence *hashval)
echo64.c: ^
echo64.c: 1 warning generated.

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments generic/opt64

Compiler output

Implementation: crypto_hash/echo256/core2/core2v64
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
echo64.s: echo64.c:473:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl ebp, ah
echo64.s: ^
echo64.s: echo64.c:473:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl esi, al
echo64.s: ^
echo64.s: echo64.c:473:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl ebp, bl
echo64.s: ^
echo64.s: echo64.c:473:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl esi, bh
echo64.s: ^
echo64.s: echo64.c:473:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl ebp, al
echo64.s: ^
echo64.s: echo64.c:473:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl esi, ah
echo64.s: ^
echo64.s: echo64.c:473:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl ebp, bh
echo64.s: ^
echo64.s: echo64.c:473:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl esi, bl
echo64.s: ^
echo64.s: echo64.c:473:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments core2/core2v64

Compiler output

Implementation: crypto_hash/echo256/athlon/athlon64
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
echo64.s: echo64.c:519:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl ebp, ah
echo64.s: ^
echo64.s: echo64.c:519:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl esi, al
echo64.s: ^
echo64.s: echo64.c:519:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl ebp, bl
echo64.s: ^
echo64.s: echo64.c:519:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl esi, bh
echo64.s: ^
echo64.s: echo64.c:519:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl ebp, al
echo64.s: ^
echo64.s: echo64.c:519:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl esi, ah
echo64.s: ^
echo64.s: echo64.c:519:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl ebp, bh
echo64.s: ^
echo64.s: echo64.c:519:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: movzbl esi, bl
echo64.s: ^
echo64.s: echo64.c:519:2: error: invalid instruction mnemonic 'movzbl'
echo64.s: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments athlon/athlon64

Compiler output

Implementation: crypto_hash/echo256/pentium/pentium4v64v2
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
echo64.s: echo64.c:571:2: error: invalid instruction mnemonic 'addd'
echo64.s: addd [SHA3_CNT], 1
echo64.s: ^
echo64.s: echo64.c:571:2: error: invalid instruction mnemonic 'addd'
echo64.s: addd [SHA3_CNT], 1
echo64.s: ^
echo64.s: echo64.c:571:2: error: invalid instruction mnemonic 'addd'
echo64.s: addd [SHA3_CNT], 1
echo64.s: ^
echo64.s: echo64.c:571:2: error: invalid instruction mnemonic 'addd'
echo64.s: addd [SHA3_CNT], 1
echo64.s: ^
echo64.s: echo64.c:572:2: error: invalid instruction mnemonic 'addd'
echo64.s: addd [SHA3_CNT], 1
echo64.s: ^
echo64.s: echo64.c:572:2: error: invalid instruction mnemonic 'addd'
echo64.s: addd [SHA3_CNT], 1
echo64.s: ^
echo64.s: echo64.c:572:2: error: invalid instruction mnemonic 'addd'
echo64.s: addd [SHA3_CNT], 1
echo64.s: ^
echo64.s: echo64.c:572:2: error: invalid instruction mnemonic 'addd'
echo64.s: addd [SHA3_CNT], 1
echo64.s: ^
echo64.s: echo64.c:573:2: error: invalid instruction mnemonic 'addd'
echo64.s: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments pentium/pentium4v64v2

Compiler output

Implementation: crypto_hash/echo256/aes/aes64
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
echo64.s: echo64.c:636:2: error: '.att_syntax noprefix' is not supported: registers must have a '%' prefix in .att_syntax
echo64.s: .att_syntax noprefix
echo64.s: ^
echo64.s: echo64.c:636:2: error: unknown directive
echo64.s: .att_syntax noprefix
echo64.s: ^
echo64.s: echo64.s:1053:10: error: invalid token in expression
echo64.s: movabsq $-9187201950435737472, %rax
echo64.s: ^
echo64.s: echo64.s:1053:11: error: unknown token in expression
echo64.s: movabsq $-9187201950435737472, %rax
echo64.s: ^
echo64.s: echo64.s:1054:10: error: invalid token in expression
echo64.s: movabsq $-9187201950435737472, %r11
echo64.s: ^
echo64.s: echo64.s:1054:11: error: unknown token in expression
echo64.s: movabsq $-9187201950435737472, %r11
echo64.s: ^
echo64.s: echo64.s:1055:10: error: invalid token in expression
echo64.s: movabsq $7599940337074203, %r10
echo64.s: ^
echo64.s: echo64.s:1055:11: error: unknown token in expression
echo64.s: movabsq $7599940337074203, %r10
echo64.s: ^
echo64.s: echo64.s:1056:7: error: unknown token in expression
echo64.s: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments aes/aes64

Compiler output

Implementation: crypto_hash/echo256/ccalik/aesni
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
hash.c: hash.c:295:4: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'Compress' that is compiled without support for 'aes'
hash.c: ECHO_ROUND_UNROLL2;
hash.c: ^
hash.c: hash.c:185:4: note: expanded from macro 'ECHO_ROUND_UNROLL2'
hash.c: ECHO_SUBBYTES(_state, 0, 0);\
hash.c: ^
hash.c: hash.c:114:19: note: expanded from macro 'ECHO_SUBBYTES'
hash.c: state[i][j] = _mm_aesenc_si128(state[i][j], k1);\
hash.c: ^
hash.c: hash.c:295:4: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'Compress' that is compiled without support for 'aes'
hash.c: hash.c:185:4: note: expanded from macro 'ECHO_ROUND_UNROLL2'
hash.c: ECHO_SUBBYTES(_state, 0, 0);\
hash.c: ^
hash.c: hash.c:115:19: note: expanded from macro 'ECHO_SUBBYTES'
hash.c: state[i][j] = _mm_aesenc_si128(state[i][j], M128(zero));\
hash.c: ^
hash.c: hash.c:295:4: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'Compress' that is compiled without support for 'aes'
hash.c: hash.c:186:4: note: expanded from macro 'ECHO_ROUND_UNROLL2'
hash.c: ECHO_SUBBYTES(_state, 1, 0);\
hash.c: ^
hash.c: hash.c:114:19: note: expanded from macro 'ECHO_SUBBYTES'
hash.c: state[i][j] = _mm_aesenc_si128(state[i][j], k1);\
hash.c: ^
hash.c: hash.c:295:4: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'Compress' that is compiled without support for 'aes'
hash.c: hash.c:186:4: note: expanded from macro 'ECHO_ROUND_UNROLL2'
hash.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments ccalik/aesni

Compiler output

Implementation: crypto_hash/echo256/generic/opt64
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv
echo64.c: echo64.c: In function 'crypto_hash_echo256_generic_opt64':
echo64.c: echo64.c:501:29: warning: passing argument 2 of 'Hash' discards 'const' qualifier from pointer target type [-Wdiscarded-qualifiers]
echo64.c: if (Hash(CRYPTO_BYTES * 8,in,inlen * 8,out) == SUCCESS) return 0;
echo64.c: ^
echo64.c: echo64.c:488:12: note: expected 'BitSequence * {aka unsigned char *}' but argument is of type 'const unsigned char *'
echo64.c: HashReturn Hash(int hashbitlen, BitSequence *data, DataLength databitlen, BitSequence *hashval)
echo64.c: ^

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv generic/opt64
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv generic/opt64
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv generic/opt64
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv generic/opt64

Compiler output

Implementation: crypto_hash/echo256/ccalik/aesni
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv
hash.c: In file included from hash.c:25:0:
hash.c: hash.c: In function 'Compress':
hash.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '_mm_aesenc_si128': target specific option mismatch
hash.c: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
hash.c: ^
hash.c: hash.c:115:19: error: called from here
hash.c: state[i][j] = _mm_aesenc_si128(state[i][j], M128(zero));\
hash.c: ^
hash.c: hash.c:220:4: note: in expansion of macro 'ECHO_SUBBYTES'
hash.c: ECHO_SUBBYTES(_state2, 3, 3);\
hash.c: ^
hash.c: hash.c:295:4: note: in expansion of macro 'ECHO_ROUND_UNROLL2'
hash.c: ECHO_ROUND_UNROLL2;
hash.c: ^
hash.c: In file included from hash.c:25:0:
hash.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/wmmintrin.h:61:1: error: inlining failed in call to always_inline '_mm_aesenc_si128': target specific option mismatch
hash.c: _mm_aesenc_si128 (__m128i __X, __m128i __Y)
hash.c: ^
hash.c: hash.c:114:19: error: called from here
hash.c: state[i][j] = _mm_aesenc_si128(state[i][j], k1);\
hash.c: ^
hash.c: hash.c:220:4: note: in expansion of macro 'ECHO_SUBBYTES'
hash.c: ECHO_SUBBYTES(_state2, 3, 3);\
hash.c: ^
hash.c: hash.c:295:4: note: in expansion of macro 'ECHO_ROUND_UNROLL2'
hash.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv ccalik/aesni
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv ccalik/aesni
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv ccalik/aesni
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv ccalik/aesni