Implementation notes: amd64, scw1b63b1, crypto_hash/keccakc512treed2

Computer: scw1b63b1
Architecture: amd64
CPU ID: GenuineIntel-000506f1-0f8bfbff
SUPERCOP version: 20191017
Operation: crypto_hash
Primitive: keccakc512treed2
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
17724149250 0 0158399 784 776sseu24gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019121520191017
1813635206 0 046153 784 736sseu4clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019121520191017
1813835206 0 045329 784 736sseu4clang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019121520191017
1814035206 0 045329 784 736sseu4clang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019121520191017
1817234627 0 043819 776 736sseu4clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019121520191017
1868237413 0 049908 808 808sseu4gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019121520191017
1871236976 0 047307 800 808sseu4gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019121520191017
1963831980 0 041135 784 776sseu4gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019121520191017
2021436912 0 047035 800 808sseu4gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019121520191017
22194179318 0 0188001 784 736sseu24clang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019121520191017
22338179318 0 0188001 784 736sseu24clang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019121520191017
22386178728 0 0186507 776 736sseu24clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019121520191017
22722179318 0 0188825 784 736sseu24clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2019121520191017
26808191440 0 0201771 800 808sseu24gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019121520191017
26824191369 0 0201483 800 808sseu24gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019121520191017
26866191877 0 0204372 808 808sseu24gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2019121520191017

Compiler output

Implementation: sseu24
Security model: unknown
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
DoublePermutation.c: DoublePermutation.c:507:5: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'KeccakDoublePermutationOnWords' that is compiled without support for 'ssse3'
DoublePermutation.c: rounds
DoublePermutation.c: ^
DoublePermutation.c: ./KeccakF-1600-unrolling.macros:17:5: note: expanded from macro 'rounds'
DoublePermutation.c: thetaRhoPiChiIotaPrepareTheta( 0, A, E) ^
DoublePermutation.c: DoublePermutation.c:136:11: note: expanded from macro 'thetaRhoPiChiIotaPrepareTheta'
DoublePermutation.c: Bko = ROL64in128_8(A##mu); ^
DoublePermutation.c: DoublePermutation.c:38:37: note: expanded from macro 'ROL64in128_8'
DoublePermutation.c: ...
DoublePermutation.c: thetaRhoPiChiIotaPrepareTheta( 9, E, A) ^
DoublePermutation.c: DoublePermutation.c:136:11: note: expanded from macro 'thetaRhoPiChiIotaPrepareTheta'
DoublePermutation.c: Bko = ROL64in128_8(A##mu); ^
DoublePermutation.c: DoublePermutation.c:38:37: note: expanded from macro 'ROL64in128_8'
DoublePermutation.c: #define ROL64in128_8(a) _mm_shuffle_epi8(a, CONST128(rho8))
DoublePermutation.c: ^
DoublePermutation.c: fatal error: too many errors emitted, stopping now [-ferror-limit=]
DoublePermutation.c: 20 errors generated.

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sseu24

Compiler output

Implementation: sseu4
Security model: unknown
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
DoublePermutation.c: DoublePermutation.c:507:5: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'KeccakDoublePermutationOnWords' that is compiled without support for 'ssse3'
DoublePermutation.c: rounds
DoublePermutation.c: ^
DoublePermutation.c: ./KeccakF-1600-unrolling.macros:90:9: note: expanded from macro 'rounds'
DoublePermutation.c: thetaRhoPiChiIotaPrepareTheta(i , A, E) ^
DoublePermutation.c: DoublePermutation.c:136:11: note: expanded from macro 'thetaRhoPiChiIotaPrepareTheta'
DoublePermutation.c: Bko = ROL64in128_8(A##mu); ^
DoublePermutation.c: DoublePermutation.c:38:37: note: expanded from macro 'ROL64in128_8'
DoublePermutation.c: ...
DoublePermutation.c: ./KeccakF-1600-unrolling.macros:93:9: note: expanded from macro 'rounds'
DoublePermutation.c: thetaRhoPiChiIotaPrepareTheta(i+3, E, A) ^
DoublePermutation.c: DoublePermutation.c:161:11: note: expanded from macro 'thetaRhoPiChiIotaPrepareTheta'
DoublePermutation.c: Bmu = ROL64in128_56(A##so); ^
DoublePermutation.c: DoublePermutation.c:39:37: note: expanded from macro 'ROL64in128_56'
DoublePermutation.c: #define ROL64in128_56(a) _mm_shuffle_epi8(a, CONST128(rho56))
DoublePermutation.c: ^
DoublePermutation.c: 8 errors generated.

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sseu4

Compiler output

Implementation: xopu24
Security model: unknown
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
DoublePermutation.c: DoublePermutation.c:507:5: error: '__builtin_ia32_vprotqi' needs target feature xop
DoublePermutation.c: rounds
DoublePermutation.c: ^
DoublePermutation.c: ./KeccakF-1600-unrolling.macros:17:5: note: expanded from macro 'rounds'
DoublePermutation.c: thetaRhoPiChiIotaPrepareTheta( 0, A, E) ^
DoublePermutation.c: DoublePermutation.c:78:21: note: expanded from macro 'thetaRhoPiChiIotaPrepareTheta'
DoublePermutation.c: Da = XOR128(Cu, ROL64in128(Ce, 1)); ^
DoublePermutation.c: DoublePermutation.c:33:37: note: expanded from macro 'ROL64in128'
DoublePermutation.c: ...
DoublePermutation.c: Bku = ROL64in128(A##sa, 18); ^
DoublePermutation.c: DoublePermutation.c:33:37: note: expanded from macro 'ROL64in128'
DoublePermutation.c: #define ROL64in128(a, o) _mm_roti_epi64(a, o)
DoublePermutation.c: ^
DoublePermutation.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.1/include/xopintrin.h:250:12: note: expanded from macro '_mm_roti_epi64'
DoublePermutation.c: (__m128i)__builtin_ia32_vprotqi((__v2di)(__m128i)(A), (N)); })
DoublePermutation.c: ^
DoublePermutation.c: fatal error: too many errors emitted, stopping now [-ferror-limit=]
DoublePermutation.c: 20 errors generated.

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE xopu24
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE xopu24
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE xopu24
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE xopu24
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE xopu24

Compiler output

Implementation: xopu24
Security model: unknown
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
DoublePermutation.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/6/include/x86intrin.h:54:0,
DoublePermutation.c: from DoublePermutation.c:17:
DoublePermutation.c: DoublePermutation.c: In function ‘KeccakDoublePermutationOnWords’:
DoublePermutation.c: /usr/lib/gcc/x86_64-linux-gnu/6/include/xopintrin.h:266:1: error: inlining failed in call to always_inline ‘_mm_roti_epi64’: target specific option mismatch
DoublePermutation.c: _mm_roti_epi64(__m128i __A, const int __B)
DoublePermutation.c: ^~~~~~~~~~~~~~
DoublePermutation.c: DoublePermutation.c:276:9: note: called from here
DoublePermutation.c: Bsu = ROL64in128(A##se, 2); ^
DoublePermutation.c: KeccakF-1600-unrolling.macros:40:5: note: in expansion of macro ‘thetaRhoPiChiIota’
DoublePermutation.c: ...
DoublePermutation.c: ^~~~~~~~~~~~~~
DoublePermutation.c: DoublePermutation.c:276:9: note: called from here
DoublePermutation.c: Bsu = ROL64in128(A##se, 2); ^
DoublePermutation.c: KeccakF-1600-unrolling.macros:40:5: note: in expansion of macro ‘thetaRhoPiChiIota’
DoublePermutation.c: thetaRhoPiChiIota(23, E, A) ^~~~~~~~~~~~~~~~~
DoublePermutation.c: DoublePermutation.c:507:5: note: in expansion of macro ‘rounds’
DoublePermutation.c: rounds
DoublePermutation.c: ^~~~~~

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE xopu24
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE xopu24
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE xopu24
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE xopu24

Namespace violations

Implementation: sseu24
Security model: unknown
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
DoublePermutation.o KeccakDoublePermutationOnWords T
DoublePermutation.o KeccakDoublePermutationOnWordsAfterXoring2x1024bits T
DoublePermutation.o KeccakDoublePermutationOnWordsAfterXoring2x1088bits T
DoublePermutation.o KeccakDoublePermutationOnWordsAfterXoring2x1344bits T
DoublePermutation.o KeccakDoublePermutationOnWordsAfterXoring2x512bits T
DoublePermutation.o KeccakF1600DoubleRoundConstants R
DoublePermutation.o KeccakInitializeDoubleState T
DoublePermutation.o rho56 R
DoublePermutation.o rho8 R

Number of similar (compiler,implementation) pairs: 16, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sseu24
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sseu24
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sseu24
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sseu24
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE sseu24
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE sseu24
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE sseu24
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE sseu24
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sseu4
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sseu4
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sseu4
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sseu4
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE sseu4
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE sseu4
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE sseu4
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE sseu4