Implementation notes: amd64, waldorf, crypto_hash/simd512

Computer: waldorf
Architecture: amd64
CPU ID: GenuineIntel-000106e5-bfebfbff
SUPERCOP version: 20160715
Operation: crypto_hash
Primitive: simd512
TimeImplementationCompilerBenchmark dateSUPERCOP version
32884vect128gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016071820160715
34144vect128gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016071820160715
36604vect128gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016071820160715
36828vect128gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016071820160715
123620optgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016071820160715
135344sphlibgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016071820160715
147216sphlibclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016071820160715
160028sphlibgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016071820160715
166640sphlibgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016071820160715
171280sphlib-smallgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016071820160715
176828sphlibgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016071820160715
200616sphlib-smallclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016071820160715
215872sphlib-smallgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016071820160715
218532sphlib-smallgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016071820160715
223044optgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016071820160715
235308optclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016071820160715
244724sphlib-smallgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016071820160715
250044optgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016071820160715
264136optgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016071820160715
23118244refclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2016071820160715
24842412refgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2016071820160715
27117752refgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2016071820160715
27177816refgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2016071820160715
28214268refgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2016071820160715

Compiler output

Implementation: crypto_hash/simd512/ref
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
reference.c: reference.c:69:82: warning: expression result unused [-Wunused-value]
reference.c: state->gt;A[j] = state->gt;D[j] + w[j] + F(state->gt;A[j], state->gt;B[j], state->gt;C[j]), s;
reference.c: ^
reference.c: 1 warning generated.

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments ref

Compiler output

Implementation: crypto_hash/simd512/vect128
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
vector.c: vector.c:73:9: warning: 'X' macro redefined [-Wmacro-redefined]
vector.c: #define X(i) X##i
vector.c: ^
vector.c: vector.c:68:9: note: previous definition is here
vector.c: #define X(i) A[i]
vector.c: ^
vector.c: vector.c:129:3: error: use of unknown builtin '__builtin_ia32_pcmpgtw128' [-Wimplicit-function-declaration]
vector.c: DO_REDUCE_FULL_S(0);
vector.c: ^
vector.c: vector.c:56:12: note: expanded from macro 'DO_REDUCE_FULL_S'
vector.c: X(i) = EXTRA_REDUCE_S(X(i)); \
vector.c: ^
vector.c: vector.c:42:32: note: expanded from macro 'EXTRA_REDUCE_S'
vector.c: v16_sub(x, v16_and(V257.v16, v16_cmp(x, V128.v16)))
vector.c: ^
vector.c: ./vector.h:92:22: note: expanded from macro 'v16_cmp'
vector.c: #define v16_cmp __builtin_ia32_pcmpgtw128
vector.c: ^
vector.c: ./vector.h:68:28: note: expanded from macro 'vec_and'
vector.c: #define vec_and(x,y) ((x)&(y))
vector.c: ^
vector.c: ./vector.h:87:28: note: expanded from macro 'v16_sub'
vector.c: #define v16_sub(x,y) ((x)-(y))
vector.c: ^
vector.c: vector.c:129:3: error: can't convert between vector values of different size ('v16' (aka 'v8hi') and 'int')
vector.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments vect128

Compiler output

Implementation: crypto_hash/simd512/vect128
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv
vector.c: vector.c: In function 'fft64':
vector.c: vector.c:73:0: warning: "X" redefined
vector.c: #define X(i) X##i
vector.c: ^
vector.c: vector.c:68:0: note: this is the location of the previous definition
vector.c: #define X(i) A[i]
vector.c: ^
vector.c: vector.c: In function 'rounds512':
vector.c: vector.c:796:0: warning: "STEP_1" redefined
vector.c: #define STEP_1(a,b,c,d,w,fun,r,s,z) \
vector.c: ^
vector.c: vector.c:542:0: note: this is the location of the previous definition
vector.c: #define STEP_1(a,b,c,d,w,fun,r,s,z) \
vector.c: ^
vector.c: vector.c:805:0: warning: "STEP_2" redefined
vector.c: #define STEP_2(a,b,c,d,w,fun,r,s) \
vector.c: ^
vector.c: vector.c:566:0: note: this is the location of the previous definition
vector.c: #define STEP_2(a,b,c,d,w,fun,r,s) \
vector.c: ^
vector.c: vector.c:808:0: warning: "STEP" redefined
vector.c: #define STEP(a,b,c,d,w1,w2,fun,r,s,z) \
vector.c: ^
vector.c: vector.c:571:0: note: this is the location of the previous definition
vector.c: #define STEP(a,b,c,d,w,fun,r,s,z) \
vector.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv vect128
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv vect128
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv vect128
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv vect128