Implementation notes: aarch64, supercoplxc, crypto_aead/deoxysi256v141

Computer: supercoplxc
Architecture: aarch64
CPU ID: 410fd034
SUPERCOP version: 20190816
Operation: crypto_aead
Primitive: deoxysi256v141
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
34456035997 0 56049833 920 1448tablegcc_-O3_-fomit-frame-pointer2019090520190816
34992037701 0 56052369 920 1424tablegcc_-funroll-loops_-O2_-fomit-frame-pointer2019090520190816
35248036325 0 56051481 920 1448tablegcc_-funroll-loops_-fno-schedule-insns_-O3_-fomit-frame-pointer2019090520190816
35352036257 0 56051433 920 1448tablegcc_-funroll-loops_-O3_-fomit-frame-pointer2019090520190816
35640034396 0 55649118 800 1448tableclang_-mcpu=native_-mfpu=neon_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments2019090520190816
35840037945 0 56052713 920 1424tablegcc_-funroll-loops_-fno-schedule-insns_-O2_-fomit-frame-pointer2019090520190816
38568035109 0 56047737 920 1424tablegcc_-O2_-fomit-frame-pointer2019090520190816
40344034460 0 55649166 800 1448tableclang_-O3_-fwrapv_-mavx_-fomit-frame-pointer_-Qunused-arguments2019090520190816
40408034460 0 55649166 800 1448tableclang_-O3_-fwrapv_-mavx2_-fomit-frame-pointer_-Qunused-arguments2019090520190816
40464034464 0 55649118 800 1448tableclang_-O3_-fomit-frame-pointer_-Qunused-arguments2019090520190816
40488034460 0 55649166 800 1448tableclang_-O3_-fwrapv_-mavx_-maes_-mpclmul_-fomit-frame-pointer_-Qunused-arguments2019090520190816
42080033473 0 54845393 904 1400tablegcc_-funroll-loops_-Os_-fomit-frame-pointer2019090520190816
42168033473 0 54845393 904 1400tablegcc_-funroll-loops_-fno-schedule-insns_-Os_-fomit-frame-pointer2019090520190816
50680036069 0 56049889 920 1448tablegcc_-fno-schedule-insns_-O3_-fomit-frame-pointer2019090520190816
54656035161 0 56047737 920 1424tablegcc_-fno-schedule-insns_-O2_-fomit-frame-pointer2019090520190816
57976033425 0 54845289 904 1400tablegcc_-Os_-fomit-frame-pointer2019090520190816
58096033425 0 54845289 904 1400tablegcc_-fno-schedule-insns_-Os_-fomit-frame-pointer2019090520190816
70232038429 0 56054481 920 1424tablegcc_-funroll-loops_-O_-fomit-frame-pointer2019090520190816
70232038429 0 56054481 920 1424tablegcc_-funroll-loops_-fno-schedule-insns_-O_-fomit-frame-pointer2019090520190816
72912034773 0 56047593 920 1424tablegcc_-O_-fomit-frame-pointer2019090520190816
72952034773 0 56047593 920 1424tablegcc_-fno-schedule-insns_-O_-fomit-frame-pointer2019090520190816
207360026477 0 041665 920 880refgcc_-funroll-loops_-O3_-fomit-frame-pointer2019090520190816
209280026297 0 040137 920 880refgcc_-O3_-fomit-frame-pointer2019090520190816
213608026485 0 041657 920 880refgcc_-funroll-loops_-fno-schedule-insns_-O3_-fomit-frame-pointer2019090520190816
216904055249 0 54772337 904 1416tablecc2019090520190816
218744055249 0 54772337 904 1416tablegcc_-funroll-loops2019090520190816
218896055249 0 54772337 904 1416tablegcc2019090520190816
256632024404 0 039126 800 888refclang_-mcpu=native_-mfpu=neon_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments2019090520190816
258672024260 0 038918 800 888refclang_-O3_-fomit-frame-pointer_-Qunused-arguments2019090520190816
258680024264 0 038974 800 888refclang_-O3_-fwrapv_-mavx_-maes_-mpclmul_-fomit-frame-pointer_-Qunused-arguments2019090520190816
258784024264 0 038974 800 888refclang_-O3_-fwrapv_-mavx_-fomit-frame-pointer_-Qunused-arguments2019090520190816
258992024264 0 038974 800 888refclang_-O3_-fwrapv_-mavx2_-fomit-frame-pointer_-Qunused-arguments2019090520190816
299040026261 0 040097 920 880refgcc_-fno-schedule-insns_-O3_-fomit-frame-pointer2019090520190816
446208026905 0 041561 920 864refgcc_-funroll-loops_-O2_-fomit-frame-pointer2019090520190816
449584027113 0 041881 920 864refgcc_-funroll-loops_-fno-schedule-insns_-O2_-fomit-frame-pointer2019090520190816
698480024369 0 036985 920 864refgcc_-O2_-fomit-frame-pointer2019090520190816
758808024433 0 037001 920 864refgcc_-fno-schedule-insns_-O2_-fomit-frame-pointer2019090520190816
959744026525 0 042569 920 864refgcc_-funroll-loops_-O_-fomit-frame-pointer2019090520190816
963120026525 0 042569 920 864refgcc_-funroll-loops_-fno-schedule-insns_-O_-fomit-frame-pointer2019090520190816
1124464023060 0 034985 904 856refgcc_-funroll-loops_-fno-schedule-insns_-Os_-fomit-frame-pointer2019090520190816
1128512023060 0 034985 904 856refgcc_-funroll-loops_-Os_-fomit-frame-pointer2019090520190816
1172864023737 0 036553 920 864refgcc_-O_-fomit-frame-pointer2019090520190816
1174176023068 0 034937 904 856refgcc_-fno-schedule-insns_-Os_-fomit-frame-pointer2019090520190816
1177784023737 0 036553 920 864refgcc_-fno-schedule-insns_-O_-fomit-frame-pointer2019090520190816
1179336023068 0 034937 904 856refgcc_-Os_-fomit-frame-pointer2019090520190816
3132960038005 0 055089 904 864refgcc2019090520190816
3133328038005 0 055089 904 864refgcc_-funroll-loops2019090520190816
3209720038005 0 055089 904 864refcc2019090520190816

Compiler output

Implementation: aesni
Security model: unknown
Compiler: cc
deoxys.c: deoxys.c:27:10: fatal error: xmmintrin.h: No such file or directory
deoxys.c: #include <xmmintrin.h> /* SSE instructions and _mm_malloc */
deoxys.c: ^~~~~~~~~~~~~
deoxys.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 19, namely:
CompilerImplementations
cc aesni
gcc aesni
gcc -O2 -fomit-frame-pointer aesni
gcc -O3 -fomit-frame-pointer aesni
gcc -O -fomit-frame-pointer aesni
gcc -Os -fomit-frame-pointer aesni
gcc -fno-schedule-insns -O2 -fomit-frame-pointer aesni
gcc -fno-schedule-insns -O3 -fomit-frame-pointer aesni
gcc -fno-schedule-insns -O -fomit-frame-pointer aesni
gcc -fno-schedule-insns -Os -fomit-frame-pointer aesni
gcc -funroll-loops aesni
gcc -funroll-loops -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -O -fomit-frame-pointer aesni
gcc -funroll-loops -Os -fomit-frame-pointer aesni
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer aesni
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer aesni
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer aesni
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer aesni

Compiler output

Implementation: aesni
Security model: unknown
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
deoxys.c: In file included from deoxys.c:27:
deoxys.c: In file included from /usr/lib/llvm-7/lib/clang/7.0.1/include/xmmintrin.h:27:
deoxys.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:64:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys.c: return (__m64)__builtin_ia32_vec_init_v2si(__i, 0);
deoxys.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:143:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys.c: return (__m64)__builtin_ia32_packsswb((__v4hi)__m1, (__v4hi)__m2);
deoxys.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:173:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys.c: return (__m64)__builtin_ia32_packssdw((__v2si)__m1, (__v2si)__m2);
deoxys.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:203:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys.c: return (__m64)__builtin_ia32_packuswb((__v4hi)__m1, (__v4hi)__m2);
deoxys.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:230:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys.c: return (__m64)__builtin_ia32_punpckhbw((__v8qi)__m1, (__v8qi)__m2);
deoxys.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:253:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys.c: return (__m64)__builtin_ia32_punpckhwd((__v4hi)__m1, (__v4hi)__m2);
deoxys.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:274:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys.c: return (__m64)__builtin_ia32_punpckhdq((__v2si)__m1, (__v2si)__m2);
deoxys.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:301:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys.c: return (__m64)__builtin_ia32_punpcklbw((__v8qi)__m1, (__v8qi)__m2);
deoxys.c: ...

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments aesni
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments aesni
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments aesni
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments aesni
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments aesni

Compiler output

Implementation: aesnis
Security model: unknown
Compiler: cc
deoxys.c: deoxys.c:26:10: fatal error: xmmintrin.h: No such file or directory
deoxys.c: #include <xmmintrin.h> /* SSE instructions and _mm_malloc */
deoxys.c: ^~~~~~~~~~~~~
deoxys.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 19, namely:
CompilerImplementations
cc aesnis
gcc aesnis
gcc -O2 -fomit-frame-pointer aesnis
gcc -O3 -fomit-frame-pointer aesnis
gcc -O -fomit-frame-pointer aesnis
gcc -Os -fomit-frame-pointer aesnis
gcc -fno-schedule-insns -O2 -fomit-frame-pointer aesnis
gcc -fno-schedule-insns -O3 -fomit-frame-pointer aesnis
gcc -fno-schedule-insns -O -fomit-frame-pointer aesnis
gcc -fno-schedule-insns -Os -fomit-frame-pointer aesnis
gcc -funroll-loops aesnis
gcc -funroll-loops -O2 -fomit-frame-pointer aesnis
gcc -funroll-loops -O3 -fomit-frame-pointer aesnis
gcc -funroll-loops -O -fomit-frame-pointer aesnis
gcc -funroll-loops -Os -fomit-frame-pointer aesnis
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer aesnis
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer aesnis
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer aesnis
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer aesnis

Compiler output

Implementation: aesnis
Security model: unknown
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
deoxys.c: In file included from deoxys.c:26:
deoxys.c: In file included from /usr/lib/llvm-7/lib/clang/7.0.1/include/xmmintrin.h:27:
deoxys.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:64:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys.c: return (__m64)__builtin_ia32_vec_init_v2si(__i, 0);
deoxys.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:143:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys.c: return (__m64)__builtin_ia32_packsswb((__v4hi)__m1, (__v4hi)__m2);
deoxys.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:173:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys.c: return (__m64)__builtin_ia32_packssdw((__v2si)__m1, (__v2si)__m2);
deoxys.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:203:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys.c: return (__m64)__builtin_ia32_packuswb((__v4hi)__m1, (__v4hi)__m2);
deoxys.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:230:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys.c: return (__m64)__builtin_ia32_punpckhbw((__v8qi)__m1, (__v8qi)__m2);
deoxys.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:253:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys.c: return (__m64)__builtin_ia32_punpckhwd((__v4hi)__m1, (__v4hi)__m2);
deoxys.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:274:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys.c: return (__m64)__builtin_ia32_punpckhdq((__v2si)__m1, (__v2si)__m2);
deoxys.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:301:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys.c: return (__m64)__builtin_ia32_punpcklbw((__v8qi)__m1, (__v8qi)__m2);
deoxys.c: ...

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments aesnis
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments aesnis
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments aesnis
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments aesnis
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments aesnis

Compiler output

Implementation: bitslice
Security model: unknown
Compiler: cc
deoxys256.c: deoxys256.c:29:10: fatal error: tmmintrin.h: No such file or directory
deoxys256.c: #include <tmmintrin.h>
deoxys256.c: ^~~~~~~~~~~~~
deoxys256.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 19, namely:
CompilerImplementations
cc bitslice
gcc bitslice
gcc -O2 -fomit-frame-pointer bitslice
gcc -O3 -fomit-frame-pointer bitslice
gcc -O -fomit-frame-pointer bitslice
gcc -Os -fomit-frame-pointer bitslice
gcc -fno-schedule-insns -O2 -fomit-frame-pointer bitslice
gcc -fno-schedule-insns -O3 -fomit-frame-pointer bitslice
gcc -fno-schedule-insns -O -fomit-frame-pointer bitslice
gcc -fno-schedule-insns -Os -fomit-frame-pointer bitslice
gcc -funroll-loops bitslice
gcc -funroll-loops -O2 -fomit-frame-pointer bitslice
gcc -funroll-loops -O3 -fomit-frame-pointer bitslice
gcc -funroll-loops -O -fomit-frame-pointer bitslice
gcc -funroll-loops -Os -fomit-frame-pointer bitslice
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer bitslice
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer bitslice
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer bitslice
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer bitslice

Compiler output

Implementation: bitslice
Security model: unknown
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
deoxys256.c: In file included from deoxys256.c:29:
deoxys256.c: In file included from /usr/lib/llvm-7/lib/clang/7.0.1/include/tmmintrin.h:27:
deoxys256.c: In file included from /usr/lib/llvm-7/lib/clang/7.0.1/include/pmmintrin.h:27:
deoxys256.c: In file included from /usr/lib/llvm-7/lib/clang/7.0.1/include/emmintrin.h:27:
deoxys256.c: In file included from /usr/lib/llvm-7/lib/clang/7.0.1/include/xmmintrin.h:27:
deoxys256.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:64:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys256.c: return (__m64)__builtin_ia32_vec_init_v2si(__i, 0);
deoxys256.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys256.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:143:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys256.c: return (__m64)__builtin_ia32_packsswb((__v4hi)__m1, (__v4hi)__m2);
deoxys256.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys256.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:173:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys256.c: return (__m64)__builtin_ia32_packssdw((__v2si)__m1, (__v2si)__m2);
deoxys256.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys256.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:203:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys256.c: return (__m64)__builtin_ia32_packuswb((__v4hi)__m1, (__v4hi)__m2);
deoxys256.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys256.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:230:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys256.c: return (__m64)__builtin_ia32_punpckhbw((__v8qi)__m1, (__v8qi)__m2);
deoxys256.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys256.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:253:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys256.c: return (__m64)__builtin_ia32_punpckhwd((__v4hi)__m1, (__v4hi)__m2);
deoxys256.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys256.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:274:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
deoxys256.c: return (__m64)__builtin_ia32_punpckhdq((__v2si)__m1, (__v2si)__m2);
deoxys256.c: ...

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments bitslice
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments bitslice
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments bitslice
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments bitslice
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments bitslice