Implementation notes: aarch64, hikey960, crypto_sign/sphincss128shake256simple

Computer: hikey960
Architecture: aarch64
CPU ID: 410fd034
SUPERCOP version: 20190816
Operation: crypto_sign
Primitive: sphincss128shake256simple
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
266183536515704 0 035162 952 1592refgcc_-funroll-loops_-fno-schedule-insns_-O2_-fomit-frame-pointer2019120620190816
367757088327268 0 045362 952 1624refgcc_-O3_-fomit-frame-pointer2019120620190816
398794972511028 0 028042 832 1608refclang_-O3_-fwrapv_-mavx_-maes_-mpclmul_-fomit-frame-pointer_-Qunused-arguments2019120620190816
410925330011028 0 028018 832 1608refclang_-O3_-fomit-frame-pointer_-Qunused-arguments2019120620190816
411806115011028 0 028042 832 1608refclang_-O3_-fwrapv_-mavx_-fomit-frame-pointer_-Qunused-arguments2019120620190816
421583827527216 0 045250 952 1624refgcc_-fno-schedule-insns_-O3_-fomit-frame-pointer2019120620190816
504574255215948 0 035634 952 1592refgcc_-funroll-loops_-fno-schedule-insns_-O_-fomit-frame-pointer2019120620190816
572696246127244 0 045338 952 1624refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv2019120620190816
58221553508768 0 025242 952 1592refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv2019120620190816
61632139508780 0 025250 952 1592refgcc_-O2_-fomit-frame-pointer2019120620190816
62362075508156 0 023766 936 1592refgcc_-Os_-fomit-frame-pointer2019120620190816
64892542508156 0 023790 936 1592refgcc_-funroll-loops_-Os_-fomit-frame-pointer2019120620190816
686070742511028 0 028042 832 1608refclang_-O3_-fwrapv_-mavx2_-fomit-frame-pointer_-Qunused-arguments2019120620190816
69126637509004 0 025490 952 1592refgcc_-fno-schedule-insns_-O_-fomit-frame-pointer2019120620190816
69160876498156 0 023790 936 1592refgcc_-funroll-loops_-fno-schedule-insns_-Os_-fomit-frame-pointer2019120620190816
70257956588784 0 025178 952 1592refgcc_-fno-schedule-insns_-O2_-fomit-frame-pointer2019120620190816
70855160679012 0 025498 952 1592refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv2019120620190816
713744707530996 0 050738 952 1624refgcc_-funroll-loops_-O3_-fomit-frame-pointer2019120620190816
799279087515760 0 035218 952 1592refgcc_-funroll-loops_-O2_-fomit-frame-pointer2019120620190816
799293918531056 0 050730 952 1624refgcc_-funroll-loops_-fno-schedule-insns_-O3_-fomit-frame-pointer2019120620190816
1084875705015948 0 035634 952 1592refgcc_-funroll-loops_-O_-fomit-frame-pointer2019120620190816
121315563008148 0 023758 936 1592refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv2019120620190816
133137729008156 0 023766 936 1592refgcc_-fno-schedule-insns_-Os_-fomit-frame-pointer2019120620190816
134506725489004 0 025490 952 1592refgcc_-O_-fomit-frame-pointer2019120620190816
2274830392515932 0 036174 936 1592refgcc2019120620190816

Test failure

Implementation: ref
Security model: unknown
Compiler: cc
error 142
Alarm clock

Number of similar (compiler,implementation) pairs: 2, namely:
CompilerImplementations
cc ref
gcc -funroll-loops ref

Compiler output

Implementation: avx2
Security model: unknown
Compiler: cc
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:19:23: fatal error: smmintrin.h: No such file or directory
KeccakP-1600-times4-SIMD256.c: #include <smmintrin.h>
KeccakP-1600-times4-SIMD256.c: ^
KeccakP-1600-times4-SIMD256.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 23, namely:
CompilerImplementations
cc avx2
gcc avx2
gcc -O2 -fomit-frame-pointer avx2
gcc -O3 -fomit-frame-pointer avx2
gcc -O -fomit-frame-pointer avx2
gcc -Os -fomit-frame-pointer avx2
gcc -fno-schedule-insns -O2 -fomit-frame-pointer avx2
gcc -fno-schedule-insns -O3 -fomit-frame-pointer avx2
gcc -fno-schedule-insns -O -fomit-frame-pointer avx2
gcc -fno-schedule-insns -Os -fomit-frame-pointer avx2
gcc -funroll-loops avx2
gcc -funroll-loops -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -O -fomit-frame-pointer avx2
gcc -funroll-loops -Os -fomit-frame-pointer avx2
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer avx2
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer avx2
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv avx2
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv avx2
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv avx2
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv avx2

Compiler output

Implementation: avx2
Security model: unknown
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
KeccakP-1600-times4-SIMD256.c: In file included from KeccakP-1600-times4-SIMD256.c:19:
KeccakP-1600-times4-SIMD256.c: In file included from /usr/lib/llvm-3.8/bin/../lib/clang/3.8.1/include/smmintrin.h:27:
KeccakP-1600-times4-SIMD256.c: In file included from /usr/lib/llvm-3.8/bin/../lib/clang/3.8.1/include/tmmintrin.h:27:
KeccakP-1600-times4-SIMD256.c: In file included from /usr/lib/llvm-3.8/bin/../lib/clang/3.8.1/include/pmmintrin.h:27:
KeccakP-1600-times4-SIMD256.c: In file included from /usr/lib/llvm-3.8/bin/../lib/clang/3.8.1/include/emmintrin.h:27:
KeccakP-1600-times4-SIMD256.c: In file included from /usr/lib/llvm-3.8/bin/../lib/clang/3.8.1/include/xmmintrin.h:27:
KeccakP-1600-times4-SIMD256.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.1/include/mmintrin.h:45:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
KeccakP-1600-times4-SIMD256.c: return (__m64)__builtin_ia32_vec_init_v2si(__i, 0);
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.1/include/mmintrin.h:69:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
KeccakP-1600-times4-SIMD256.c: return (__m64)__builtin_ia32_packsswb((__v4hi)__m1, (__v4hi)__m2);
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.1/include/mmintrin.h:75:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
KeccakP-1600-times4-SIMD256.c: return (__m64)__builtin_ia32_packssdw((__v2si)__m1, (__v2si)__m2);
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.1/include/mmintrin.h:81:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
KeccakP-1600-times4-SIMD256.c: return (__m64)__builtin_ia32_packuswb((__v4hi)__m1, (__v4hi)__m2);
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.1/include/mmintrin.h:87:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
KeccakP-1600-times4-SIMD256.c: return (__m64)__builtin_ia32_punpckhbw((__v8qi)__m1, (__v8qi)__m2);
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.1/include/mmintrin.h:93:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
KeccakP-1600-times4-SIMD256.c: return (__m64)__builtin_ia32_punpckhwd((__v4hi)__m1, (__v4hi)__m2);
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: /usr/lib/llvm-3.8/bin/../lib/clang/3.8.1/include/mmintrin.h:99:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
KeccakP-1600-times4-SIMD256.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments avx2
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments avx2
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments avx2
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments avx2