Implementation notes: amd64, saber214, crypto_hash/lsh256

Computer: saber214
Microarchitecture: amd64; Bulldozer (600f20)
Architecture: amd64
CPU ID: AuthenticAMD-00600f20-1789c3f5
SUPERCOP version: 20240625
Operation: crypto_hash
Primitive: lsh256
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
591410531 0 018676 816 744ssse3clang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
592010441 0 016228 816 728ssse3clang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
592010441 0 016228 816 728xopclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
59219159 0 014150 808 728xopclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
592710531 0 018676 816 744xopclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
59349159 0 014150 808 728ssse3clang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
594010582 0 015084 816 728ssse3clang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
594010582 0 015084 816 728xopclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
664211138 0 016381 768 800xopgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
770110367 0 014977 752 768xopgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
781912013 0 018606 776 800xopgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
784812029 0 017118 776 800xopgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
856712020 0 016829 768 800ssse3gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
885813117 0 019158 776 800ssse3gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
891613005 0 017614 776 800ssse3gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
924411253 0 015425 752 768ssse3gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
1009014811 0 020900 816 744sse2clang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
1009314721 0 018468 816 728sse2clang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
1024912953 0 016054 808 728sse2clang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
1037614862 0 017276 816 728sse2clang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
1614617632 0 019629 768 800sse2gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
1620819213 0 020718 776 800sse2gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
1623719325 0 022262 776 800sse2gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
1681017059 0 018321 752 768sse2gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
1681521483 0 022852 816 728sse2clang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
2252020695 0 023900 816 744refclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
2298420695 0 021508 816 728refclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
2310520649 0 019846 808 728refclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
2614823191 0 024076 816 728refclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
3050623079 0 021329 752 768refgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
3082730477 0 027846 776 800refgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
3116829821 0 026022 776 800refgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
3168726596 0 024109 768 800refgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
352952362 336 01749400 144704 10136cryptoppg++_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
353612786 336 01752482 144688 10168cryptoppg++_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
356223013 336 01750856 144696 10168cryptoppg++_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
457292877 336 01750830 144696 10168cryptoppg++_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625
5131936378 0 028284 816 728refclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062720240625

Compiler output


lsh-impl.c: lsh-impl.c:190:10: error: always_inline function '_mm256_add_epi32' requires target feature 'avx2', but would be inlined into function 'add_blk' that is compiled without support for 'avx2'
lsh-impl.c:         *cv_l = ADD(*cv_l, *cv_r);
lsh-impl.c:                 ^
lsh-impl.c: lsh-impl.c:152:18: note: expanded from macro 'ADD'
lsh-impl.c: #define ADD(x,y) _mm256_add_epi32(x,y)
lsh-impl.c:                  ^
lsh-impl.c: lsh-impl.c:194:11: error: always_inline function '_mm256_slli_epi32' requires target feature 'avx2', but would be inlined into function 'rotate_blk_even_alpha' that is compiled without support for 'avx2'
lsh-impl.c:         *cv = OR(SHIFT_L(*cv, ROT_EVEN_ALPHA), SHIFT_R(*cv, WORD_BIT_LEN - ROT_EVEN_ALPHA));
lsh-impl.c:                  ^
lsh-impl.c: lsh-impl.c:153:22: note: expanded from macro 'SHIFT_L'
lsh-impl.c: #define SHIFT_L(x,r) _mm256_slli_epi32(x,r)
lsh-impl.c:                      ^
lsh-impl.c: lsh-impl.c:194:41: error: always_inline function '_mm256_srli_epi32' requires target feature 'avx2', but would be inlined into function 'rotate_blk_even_alpha' that is compiled without support for 'avx2'
lsh-impl.c:         *cv = OR(SHIFT_L(*cv, ROT_EVEN_ALPHA), SHIFT_R(*cv, WORD_BIT_LEN - ROT_EVEN_ALPHA));
lsh-impl.c:                                                ^
lsh-impl.c: lsh-impl.c:154:22: note: expanded from macro 'SHIFT_R'
lsh-impl.c: #define SHIFT_R(x,r) _mm256_srli_epi32(x,r)
lsh-impl.c:                      ^
lsh-impl.c: lsh-impl.c:194:8: error: always_inline function '_mm256_or_si256' requires target feature 'avx2', but would be inlined into function 'rotate_blk_even_alpha' that is compiled without support for 'avx2'
lsh-impl.c:         *cv = OR(SHIFT_L(*cv, ROT_EVEN_ALPHA), SHIFT_R(*cv, WORD_BIT_LEN - ROT_EVEN_ALPHA));
lsh-impl.c:               ^
lsh-impl.c: lsh-impl.c:148:17: note: expanded from macro 'OR'
lsh-impl.c: #define OR(x,y) _mm256_or_si256(x,y)
lsh-impl.c:                 ^
lsh-impl.c: lsh-impl.c:210:10: error: always_inline function '_mm256_xor_si256' requires target feature 'avx2', but would be inlined into function 'xor_with_const' that is compiled without support for 'avx2'
lsh-impl.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
avx2clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
avx2clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
avx2clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
avx2clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


lsh-impl.c: lsh-impl.c:367:10: error: always_inline function '_mm256_set_epi32' requires target feature 'avx', but would be inlined into function 'lsh256_init' that is compiled without support for 'avx'
lsh-impl.c:         *cv_l = _mm256_set_epi32(0, 0, 0, 0, 0, 0, LSH_GET_HASHBIT(algtype), LSH256_HASH_VAL_MAX_BYTE_LEN);
lsh-impl.c:                 ^
lsh-impl.c: lsh-impl.c:367:10: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
lsh-impl.c: lsh-impl.c:368:10: error: always_inline function '_mm256_setzero_si256' requires target feature 'avx', but would be inlined into function 'lsh256_init' that is compiled without support for 'avx'
lsh-impl.c:         *cv_r = _mm256_setzero_si256();
lsh-impl.c:                 ^
lsh-impl.c: lsh-impl.c:368:10: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
lsh-impl.c: lsh-impl.c:369:19: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'lsh256_init' that is compiled without support for 'avx'
lsh-impl.c:         byte_perm_step = LOAD(g_BytePermInfo);
lsh-impl.c:                          ^
lsh-impl.c: lsh-impl.c:145:17: note: expanded from macro 'LOAD'
lsh-impl.c: #define LOAD(x) _mm256_loadu_si256((__m256i*)x)
lsh-impl.c:                 ^
lsh-impl.c: lsh-impl.c:369:19: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
lsh-impl.c: lsh-impl.c:145:17: note: expanded from macro 'LOAD'
lsh-impl.c: #define LOAD(x) _mm256_loadu_si256((__m256i*)x)
lsh-impl.c:                 ^
lsh-impl.c: lsh-impl.c:375:3: warning: AVX vector argument of type 'const __m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI [-Wpsabi]
lsh-impl.c:                 mix_even(cv_l, cv_r, const_v, byte_perm_step);
lsh-impl.c:                 ^
lsh-impl.c: lsh-impl.c:379:3: warning: AVX vector argument of type 'const __m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI [-Wpsabi]
lsh-impl.c:                 mix_odd(cv_l, cv_r, const_v, byte_perm_step);
lsh-impl.c:                 ^
lsh-impl.c: 2 warnings and 6 errors generated.

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
avx2clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


lsh-impl.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
lsh-impl.c:                  from lsh-impl.c:35:
lsh-impl.c: lsh-impl.c: In function 'msg_exp_even':
lsh-impl.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:119:1: error: inlining failed in call to 'always_inline' '_mm256_add_epi32': target specific option mismatch
lsh-impl.c:   119 | _mm256_add_epi32 (__m256i __A, __m256i __B)
lsh-impl.c:       | ^~~~~~~~~~~~~~~~
lsh-impl.c: lsh-impl.c:152:18: note: called from here
lsh-impl.c:   152 | #define ADD(x,y) _mm256_add_epi32(x,y)
lsh-impl.c:       |                  ^~~~~~~~~~~~~~~~~~~~~
lsh-impl.c: lsh-impl.c:172:34: note: in expansion of macro 'ADD'
lsh-impl.c:   172 |         i_state->submsg_e_r[0] = ADD(i_state->submsg_o_r[0], SHUFFLE8(i_state->submsg_e_r[0], perm_step));
lsh-impl.c:       |                                  ^~~
lsh-impl.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
lsh-impl.c:                  from lsh-impl.c:35:
lsh-impl.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:588:1: error: inlining failed in call to 'always_inline' '_mm256_shuffle_epi8': target specific option mismatch
lsh-impl.c:   588 | _mm256_shuffle_epi8 (__m256i __X, __m256i __Y)
lsh-impl.c:       | ^~~~~~~~~~~~~~~~~~~
lsh-impl.c: lsh-impl.c:152:18: note: called from here
lsh-impl.c:   152 | #define ADD(x,y) _mm256_add_epi32(x,y)
lsh-impl.c:       |                  ^~~~~~~~~~~~~~~~~~~~~
lsh-impl.c: lsh-impl.c:172:34: note: in expansion of macro 'ADD'
lsh-impl.c:   172 |         i_state->submsg_e_r[0] = ADD(i_state->submsg_o_r[0], SHUFFLE8(i_state->submsg_e_r[0], perm_step));
lsh-impl.c:       |                                  ^~~
lsh-impl.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
lsh-impl.c:                  from lsh-impl.c:35:
lsh-impl.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
avx2gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
avx2gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
avx2gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
avx2gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

Compiler output


lsh-impl.c: lsh-impl.c:126:39: warning: unused variable 'g_MsgWordPermInfo' [-Wunused-const-variable]
lsh-impl.c: static const LSH_ALIGNED_(32) lsh_u32 g_MsgWordPermInfo[8] = {
lsh-impl.c:                                       ^
lsh-impl.c: 1 warning generated.
lsh.c: lsh.c:126:39: warning: unused variable 'g_MsgWordPermInfo' [-Wunused-const-variable]
lsh.c: static const LSH_ALIGNED_(32) lsh_u32 g_MsgWordPermInfo[8] = {
lsh.c:                                       ^
lsh.c: 1 warning generated.

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
ssse3clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
ssse3clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
ssse3clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
ssse3clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


lsh-impl.c: lsh-impl.c:126:39: warning: unused variable 'g_MsgWordPermInfo' [-Wunused-const-variable]
lsh-impl.c: static const LSH_ALIGNED_(32) lsh_u32 g_MsgWordPermInfo[8] = {
lsh-impl.c:                                       ^
lsh-impl.c: lsh-impl.c:221:12: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'rotate_msg_gamma' that is compiled without support for 'ssse3'
lsh-impl.c:         cv_r[0] = SHUFFLE8(cv_r[0], perm_step[0]);
lsh-impl.c:                   ^
lsh-impl.c: lsh-impl.c:143:23: note: expanded from macro 'SHUFFLE8'
lsh-impl.c: #define SHUFFLE8(x,y) _mm_shuffle_epi8(x,y)
lsh-impl.c:                       ^
lsh-impl.c: lsh-impl.c:222:12: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'rotate_msg_gamma' that is compiled without support for 'ssse3'
lsh-impl.c:         cv_r[1] = SHUFFLE8(cv_r[1], perm_step[1]);
lsh-impl.c:                   ^
lsh-impl.c: lsh-impl.c:143:23: note: expanded from macro 'SHUFFLE8'
lsh-impl.c: #define SHUFFLE8(x,y) _mm_shuffle_epi8(x,y)
lsh-impl.c:                       ^
lsh-impl.c: 1 warning and 2 errors generated.

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
ssse3clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


lsh-impl.c: lsh-impl.c:126:39: warning: 'g_MsgWordPermInfo' defined but not used [-Wunused-const-variable=]
lsh-impl.c:   126 | static const LSH_ALIGNED_(32) lsh_u32 g_MsgWordPermInfo[8] = {
lsh-impl.c:       |                                       ^~~~~~~~~~~~~~~~~
lsh.c: lsh.c:126:39: warning: 'g_MsgWordPermInfo' defined but not used [-Wunused-const-variable=]
lsh.c:   126 | static const LSH_ALIGNED_(32) lsh_u32 g_MsgWordPermInfo[8] = {
lsh.c:       |                                       ^~~~~~~~~~~~~~~~~

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
ssse3gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
ssse3gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
ssse3gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
ssse3gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

Compiler output


lsh-impl.c: lsh-impl.c:125:39: warning: unused variable 'g_MsgWordPermInfo' [-Wunused-const-variable]
lsh-impl.c: static const LSH_ALIGNED_(32) lsh_u32 g_MsgWordPermInfo[8] = {
lsh-impl.c:                                       ^
lsh-impl.c: 1 warning generated.
lsh.c: lsh.c:125:39: warning: unused variable 'g_MsgWordPermInfo' [-Wunused-const-variable]
lsh.c: static const LSH_ALIGNED_(32) lsh_u32 g_MsgWordPermInfo[8] = {
lsh.c:                                       ^
lsh.c: 1 warning generated.

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
xopclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
xopclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
xopclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
xopclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


lsh-impl.c: lsh-impl.c:125:39: warning: unused variable 'g_MsgWordPermInfo' [-Wunused-const-variable]
lsh-impl.c: static const LSH_ALIGNED_(32) lsh_u32 g_MsgWordPermInfo[8] = {
lsh-impl.c:                                       ^
lsh-impl.c: lsh-impl.c:203:10: error: '__builtin_ia32_vprotdi' needs target feature xop
lsh-impl.c:         cv[0] = _mm_roti_epi32(cv[0], ROT_EVEN_ALPHA);
lsh-impl.c:                 ^
lsh-impl.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/xopintrin.h:234:13: note: expanded from macro '_mm_roti_epi32'
lsh-impl.c:   ((__m128i)__builtin_ia32_vprotdi((__v4si)(__m128i)(A), (N)))
lsh-impl.c:             ^
lsh-impl.c: lsh-impl.c:204:10: error: '__builtin_ia32_vprotdi' needs target feature xop
lsh-impl.c:         cv[1] = _mm_roti_epi32(cv[1], ROT_EVEN_ALPHA);
lsh-impl.c:                 ^
lsh-impl.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/xopintrin.h:234:13: note: expanded from macro '_mm_roti_epi32'
lsh-impl.c:   ((__m128i)__builtin_ia32_vprotdi((__v4si)(__m128i)(A), (N)))
lsh-impl.c:             ^
lsh-impl.c: lsh-impl.c:207:10: error: '__builtin_ia32_vprotdi' needs target feature xop
lsh-impl.c:         cv[0] = _mm_roti_epi32(cv[0], ROT_EVEN_BETA);
lsh-impl.c:                 ^
lsh-impl.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/xopintrin.h:234:13: note: expanded from macro '_mm_roti_epi32'
lsh-impl.c:   ((__m128i)__builtin_ia32_vprotdi((__v4si)(__m128i)(A), (N)))
lsh-impl.c:             ^
lsh-impl.c: lsh-impl.c:208:10: error: '__builtin_ia32_vprotdi' needs target feature xop
lsh-impl.c:         cv[1] = _mm_roti_epi32(cv[1], ROT_EVEN_BETA);
lsh-impl.c:                 ^
lsh-impl.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/xopintrin.h:234:13: note: expanded from macro '_mm_roti_epi32'
lsh-impl.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
xopclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


lsh-impl.c: lsh-impl.c:125:39: warning: 'g_MsgWordPermInfo' defined but not used [-Wunused-const-variable=]
lsh-impl.c:   125 | static const LSH_ALIGNED_(32) lsh_u32 g_MsgWordPermInfo[8] = {
lsh-impl.c:       |                                       ^~~~~~~~~~~~~~~~~
lsh.c: lsh.c:125:39: warning: 'g_MsgWordPermInfo' defined but not used [-Wunused-const-variable=]
lsh.c:   125 | static const LSH_ALIGNED_(32) lsh_u32 g_MsgWordPermInfo[8] = {
lsh.c:       |                                       ^~~~~~~~~~~~~~~~~

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
xopgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
xopgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
xopgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
xopgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x2637A4
   at 0x...: CryptoPP::DetectX86Features() (cpu.cpp:605)
   by 0x...: call_init (libc-start.c:145)
   by 0x...: __libc_start_main@@GLIBC_2.34 (libc-start.c:379)
   by 0x...: (below main) (in /home/djb/supercop-data/saber214/amd64/try/cpp/g++_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall/constbranchindex/crypto_hash/lsh256/cryptopp/work/try-timecop)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
cryptoppg++ -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x263CC4
   at 0x...: CryptoPP::DetectX86Features() (cpu.cpp:605)
   by 0x...: call_init (libc-start.c:145)
   by 0x...: __libc_start_main@@GLIBC_2.34 (libc-start.c:379)
   by 0x...: (below main) (in /home/djb/supercop-data/saber214/amd64/try/cpp/g++_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall/constbranchindex/crypto_hash/lsh256/cryptopp/work/try-timecop)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
cryptoppg++ -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x2635A4
   at 0x...: CryptoPP::DetectX86Features() (cpu.cpp:605)
   by 0x...: call_init (libc-start.c:145)
   by 0x...: __libc_start_main@@GLIBC_2.34 (libc-start.c:379)
   by 0x...: (below main) (in /home/djb/supercop-data/saber214/amd64/try/cpp/g++_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall/constbranchindex/crypto_hash/lsh256/cryptopp/work/try-timecop)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
cryptoppg++ -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x2630A4
   at 0x...: CryptoPP::DetectX86Features() (cpu.cpp:605)
   by 0x...: call_init (libc-start.c:145)
   by 0x...: __libc_start_main@@GLIBC_2.34 (libc-start.c:379)
   by 0x...: (below main) (in /home/djb/supercop-data/saber214/amd64/try/cpp/g++_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall/constbranchindex/crypto_hash/lsh256/cryptopp/work/try-timecop)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
cryptoppg++ -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10B577
   at 0x...: rotate_blk (lsh-impl.c:240)
   by 0x...: mix (lsh-impl.c:295)
   by 0x...: compress (lsh-impl.c:319)
   by 0x...: lsh256_update (lsh-impl.c:526)
   by 0x...: lsh256_digest (lsh-impl.c:586)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
refclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10B617
   at 0x...: rotate_blk (lsh-impl.c:240)
   by 0x...: mix (lsh-impl.c:295)
   by 0x...: compress (lsh-impl.c:319)
   by 0x...: lsh256_update (lsh-impl.c:526)
   by 0x...: lsh256_digest (lsh-impl.c:586)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
refclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10AE77
   at 0x...: rotate_blk (lsh-impl.c:240)
   by 0x...: mix (lsh-impl.c:295)
   by 0x...: compress (lsh-impl.c:319)
   by 0x...: lsh256_update (lsh-impl.c:526)
   by 0x...: lsh256_digest (lsh-impl.c:586)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
refclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x109DDD
   at 0x...: salsa20.part.0 (try-anything.c:102)
   by 0x...: salsa20 (try-anything.c:85)
   by 0x...: testvector (try-anything.c:124)
   by 0x...: myrandom (try-anything.c:132)
   by 0x...: test (try.c:85)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
refgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
sse2gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
ssse3gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
xopgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x109A5A
   at 0x...: core (try-anything.c:53)
   by 0x...: salsa20 (try-anything.c:101)
   by 0x...: salsa20 (try-anything.c:81)
   by 0x...: testvector (try-anything.c:124)
   by 0x...: myrandom (try-anything.c:132)
   by 0x...: test (try.c:85)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
refgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
sse2gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
ssse3gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
xopgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10D252
   at 0x...: memset (string_fortified.h:59)
   by 0x...: lsh256_final (lsh-impl.c:574)
   by 0x...: lsh256_digest (lsh-impl.c:589)
   by 0x...: crypto_hash_lsh256_ref_constbranchindex (wrapper.c:10)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
refgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10AF2D
   at 0x...: rotate_blk_even_alpha (lsh-impl.c:187)
   by 0x...: mix_even (lsh-impl.c:252)
   by 0x...: compress (lsh-impl.c:284)
   by 0x...: lsh256_update (lsh-impl.c:463)
   by 0x...: lsh256_digest (lsh-impl.c:534)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
sse2clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10AFDD
   at 0x...: rotate_blk_even_alpha (lsh-impl.c:187)
   by 0x...: mix_even (lsh-impl.c:252)
   by 0x...: compress (lsh-impl.c:284)
   by 0x...: lsh256_update (lsh-impl.c:463)
   by 0x...: lsh256_digest (lsh-impl.c:534)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
sse2clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10AACD
   at 0x...: rotate_blk_even_alpha (lsh-impl.c:187)
   by 0x...: mix_even (lsh-impl.c:252)
   by 0x...: compress (lsh-impl.c:284)
   by 0x...: lsh256_update (lsh-impl.c:463)
   by 0x...: lsh256_digest (lsh-impl.c:534)
   by 0x...: crypto_hash_lsh256_sse2_constbranchindex (wrapper.c:10)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
sse2clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10A832
   at 0x...: rotate_blk_even_alpha (lsh-impl.c:187)
   by 0x...: mix_even (lsh-impl.c:252)
   by 0x...: compress (lsh-impl.c:284)
   by 0x...: lsh256_update (lsh-impl.c:463)
   by 0x...: lsh256_digest (lsh-impl.c:534)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
sse2clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10BFD4
   at 0x...: memset (string_fortified.h:59)
   by 0x...: lsh256_final (lsh-impl.c:521)
   by 0x...: lsh256_digest (lsh-impl.c:537)
   by 0x...: crypto_hash_lsh256_sse2_constbranchindex (wrapper.c:10)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
sse2gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10ABFB
   at 0x...: rotate_blk_even_alpha (lsh-impl.c:201)
   by 0x...: mix_even (lsh-impl.c:239)
   by 0x...: compress (lsh-impl.c:270)
   by 0x...: lsh256_update (lsh-impl.c:453)
   by 0x...: lsh256_digest (lsh-impl.c:523)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
ssse3clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10ACAB
   at 0x...: rotate_blk_even_alpha (lsh-impl.c:201)
   by 0x...: mix_even (lsh-impl.c:239)
   by 0x...: compress (lsh-impl.c:270)
   by 0x...: lsh256_update (lsh-impl.c:453)
   by 0x...: lsh256_digest (lsh-impl.c:523)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
ssse3clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10A7B1
   at 0x...: rotate_blk_even_alpha (lsh-impl.c:201)
   by 0x...: mix_even (lsh-impl.c:239)
   by 0x...: compress (lsh-impl.c:270)
   by 0x...: lsh256_update (lsh-impl.c:453)
   by 0x...: lsh256_digest (lsh-impl.c:523)
   by 0x...: crypto_hash_lsh256_ssse3_constbranchindex (wrapper.c:10)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
ssse3clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10A50E
   at 0x...: rotate_blk_even_alpha (lsh-impl.c:201)
   by 0x...: mix_even (lsh-impl.c:239)
   by 0x...: compress (lsh-impl.c:270)
   by 0x...: lsh256_update (lsh-impl.c:453)
   by 0x...: lsh256_digest (lsh-impl.c:523)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
ssse3clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10B51E
   at 0x...: memset (string_fortified.h:59)
   by 0x...: lsh256_final (lsh-impl.c:510)
   by 0x...: lsh256_digest (lsh-impl.c:526)
   by 0x...: crypto_hash_lsh256_ssse3_constbranchindex (wrapper.c:10)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
ssse3gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10ABFB
   at 0x...: rotate_blk_even_alpha (lsh-impl.c:203)
   by 0x...: mix_even (lsh-impl.c:245)
   by 0x...: compress (lsh-impl.c:280)
   by 0x...: lsh256_update (lsh-impl.c:462)
   by 0x...: lsh256_digest (lsh-impl.c:532)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
xopclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10ACAB
   at 0x...: rotate_blk_even_alpha (lsh-impl.c:203)
   by 0x...: mix_even (lsh-impl.c:245)
   by 0x...: compress (lsh-impl.c:280)
   by 0x...: lsh256_update (lsh-impl.c:462)
   by 0x...: lsh256_digest (lsh-impl.c:532)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
xopclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10A7B1
   at 0x...: rotate_blk_even_alpha (lsh-impl.c:203)
   by 0x...: mix_even (lsh-impl.c:245)
   by 0x...: compress (lsh-impl.c:280)
   by 0x...: lsh256_update (lsh-impl.c:462)
   by 0x...: lsh256_digest (lsh-impl.c:532)
   by 0x...: crypto_hash_lsh256_xop_constbranchindex (wrapper.c:10)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
xopclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10A50E
   at 0x...: rotate_blk_even_alpha (lsh-impl.c:203)
   by 0x...: mix_even (lsh-impl.c:245)
   by 0x...: compress (lsh-impl.c:280)
   by 0x...: lsh256_update (lsh-impl.c:462)
   by 0x...: lsh256_digest (lsh-impl.c:532)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
xopclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10A702
   at 0x...: _mm_roti_epi32 (xopintrin.h:268)
   by 0x...: rotate_blk_even_alpha (lsh-impl.c:203)
   by 0x...: mix_even (lsh-impl.c:245)
   by 0x...: compress (lsh-impl.c:280)
   by 0x...: lsh256_update (lsh-impl.c:462)
   by 0x...: lsh256_digest (lsh-impl.c:532)
   by 0x...: crypto_hash_lsh256_xop_constbranchindex (wrapper.c:10)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
xopgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

TIMECOP error (can be valgrind bug)


Process terminating with default action of signal 4 (SIGILL)
 Illegal opcode at address 0x10A7D3
   at 0x...: _mm_roti_epi32 (xopintrin.h:268)
   by 0x...: rotate_blk_even_alpha (lsh-impl.c:203)
   by 0x...: mix_even (lsh-impl.c:245)
   by 0x...: compress (lsh-impl.c:280)
   by 0x...: lsh256_update (lsh-impl.c:462)
   by 0x...: lsh256_digest (lsh-impl.c:532)
   by 0x...: test (try.c:90)
   by 0x...: main (try-anything.c:345)

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
xopgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

Passed TIMECOP


TIMECOP iterations: 10

Number of similar (implementation,compiler) pairs: 6, namely:
ImplementationCompiler
refclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
refclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
refgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
sse2clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
sse2gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
ssse3gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)