Implementation notes: amd64, r24000, crypto_kem/sikep434comp

Computer: r24000
Microarchitecture: amd64; Coffee Lake (906ea)
Architecture: amd64
CPU ID: GenuineIntel-000906ea-bfebfbff
SUPERCOP version: 20240107
Operation: crypto_kem
Primitive: sikep434comp
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
25821543444366 0 8467853 812 1744T:mulxadxclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024011920240107
25893792448102 0 8471453 812 1744T:mulxadxclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024011920240107
25940555455247 0 8477781 812 1744T:mulxadxclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024011920240107
25948927444366 0 8467853 812 1744T:mulxadxclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024011920240107
26267131370866 0 8391463 804 1744T:mulxadxclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024011920240107
26772906447713 0 8471069 812 1744T:mulxclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024011920240107
26939751443977 0 8467469 812 1744T:mulxclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024011920240107
26939843454858 0 8477397 812 1744T:mulxclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024011920240107
26977218443977 0 8467469 812 1744T:mulxclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024011920240107
27019809461006 0 8486408 812 1808T:mulxadxgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024011920240107
27377420370477 0 8391063 804 1744T:mulxclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024011920240107
27715683391617 0 8414408 812 1808T:mulxadxgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024011920240107
28063826460617 0 8486024 812 1808T:mulxgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024011920240107
28403139378778 0 8401504 812 1808T:mulxadxgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024011920240107
28631907348646 0 8369728 804 1776T:mulxadxgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024011920240107
28992026391228 0 8414024 812 1808T:mulxgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024011920240107
29150904378389 0 8401120 812 1808T:mulxgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024011920240107
29685206348257 0 8369328 804 1776T:mulxgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024011920240107
240132964425103 0 8448597 812 1744T:amd64clang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024011920240107
240232805425103 0 8448597 812 1744T:amd64clang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024011920240107
246726717432135 0 8455485 812 1744T:amd64clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024011920240107
247372470438460 0 8460981 812 1744T:amd64clang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024011920240107
257469955468085 0 8493520 812 1808T:amd64gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024011920240107
283480193374782 0 8395383 804 1744T:amd64clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024011920240107
294679254387962 0 8410752 812 1808T:amd64gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024011920240107
299248699379813 0 8402544 812 1808T:amd64gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024011920240107
468016377347458 0 8368552 804 1776T:amd64gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024011920240107

Compiler output

Implementation: T:amd64asm
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
fp_x64_asm.S: fp_x64_asm.S:635:3: error: "CONFIGURATION NOT SUPPORTED. TRY USE_MULX=TRUE"
fp_x64_asm.S: # error "CONFIGURATION NOT SUPPORTED. TRY USE_MULX=TRUE"
fp_x64_asm.S: ^
fp_x64_asm.S: fp_x64_asm.S:826:5: error: "CONFIGURATION NOT SUPPORTED. TRY USE_MULX=TRUE"
fp_x64_asm.S: # error "CONFIGURATION NOT SUPPORTED. TRY USE_MULX=TRUE"
fp_x64_asm.S: ^
fp_x64_asm.S: 2 errors generated.

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:amd64asm
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:amd64asm
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:amd64asm
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:amd64asm
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:amd64asm

Compiler output

Implementation: T:amd64asm
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
fp_x64_asm.S: fp_x64_asm.S:635:3: error: #error "CONFIGURATION NOT SUPPORTED. TRY USE_MULX=TRUE"
fp_x64_asm.S: 635 | # error "CONFIGURATION NOT SUPPORTED. TRY USE_MULX=TRUE"
fp_x64_asm.S: | ^~~~~
fp_x64_asm.S: fp_x64_asm.S:826:5: error: #error "CONFIGURATION NOT SUPPORTED. TRY USE_MULX=TRUE"
fp_x64_asm.S: 826 | # error "CONFIGURATION NOT SUPPORTED. TRY USE_MULX=TRUE"
fp_x64_asm.S: | ^~~~~

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:amd64asm
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:amd64asm
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:amd64asm
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:amd64asm

Namespace violations

Implementation: T:amd64
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
P434_compressed.o A_basis_zero R
P434_compressed.o A_gen R
P434_compressed.o Alice_order R
P434_compressed.o B_basis_zero R
P434_compressed.o B_gen R
P434_compressed.o B_gen_3_tors R
P434_compressed.o Bob_order R
P434_compressed.o CompleteMPoint T
P434_compressed.o Double T
P434_compressed.o EphemeralKeyGeneration_A_SIDHp434_Compressed T
P434_compressed.o EphemeralKeyGeneration_B_SIDHp434_Compressed T
P434_compressed.o EphemeralSecretAgreement_A_SIDHp434_Compressed T
P434_compressed.o EphemeralSecretAgreement_B_SIDHp434_Compressed T
P434_compressed.o FormatPrivKey_B T
P434_compressed.o Ladder T
P434_compressed.o Montgomery_R2 R
P434_compressed.o Montgomery_RB1 R
P434_compressed.o Montgomery_RB2 R
P434_compressed.o Montgomery_inversion_mod_order_bingcd T
P434_compressed.o Montgomery_multiply_mod_order T
P434_compressed.o Montgomery_neg T
P434_compressed.o Montgomery_one R
P434_compressed.o T_tate2_P R
P434_compressed.o T_tate2_Q R
P434_compressed.o T_tate2_firststep_P R
P434_compressed.o T_tate2_firststep_Q R
P434_compressed.o T_tate3 R
P434_compressed.o Tate2_pairings T
P434_compressed.o Tate3_pairings T
P434_compressed.o Traverse_w_div_e_fullsigned T
P434_compressed.o Traverse_w_div_e_torus T
P434_compressed.o Traverse_w_notdiv_e_fullsigned T
P434_compressed.o U3 R
P434_compressed.o XQB3 R
P434_compressed.o clear_words T
P434_compressed.o cmp_f2elm T
P434_compressed.o copy_words T
P434_compressed.o ct_cmov T
P434_compressed.o ct_compare T
P434_compressed.o cube_Fp2_cycl T
P434_compressed.o eval_3_isog T
P434_compressed.o eval_4_isog T
P434_compressed.o fp2_conj T
P434_compressed.o fp2add434 T
P434_compressed.o fp2copy434 T
P434_compressed.o fp2correction434 T
P434_compressed.o fp2div2_434 T
P434_compressed.o fp2inv434_mont T
P434_compressed.o fp2inv434_mont_bingcd T
P434_compressed.o fp2mul434_mont T
P434_compressed.o fp2neg434 T
P434_compressed.o fp2shl T
P434_compressed.o fp2sqr434_mont T
P434_compressed.o fp2sub434 T
P434_compressed.o fp2zero434 T
P434_compressed.o fpcopy434 T
P434_compressed.o fpinv434_chain_mont T
P434_compressed.o fpinv434_mont T
P434_compressed.o fpinv434_mont_bingcd T
P434_compressed.o fpmul434_mont T
P434_compressed.o fpsqr434_mont T
P434_compressed.o fpzero434 T
P434_compressed.o from_Montgomery_mod_order T
P434_compressed.o from_base T
P434_compressed.o from_fp2mont T
P434_compressed.o from_mont T
P434_compressed.o fromproj T
P434_compressed.o g_R_S_im R
P434_compressed.o g_phiR_phiS_im R
P434_compressed.o g_phiR_phiS_re R
P434_compressed.o get_3_isog T
P434_compressed.o get_4_isog T
P434_compressed.o get_A T
P434_compressed.o highest_2t T
P434_compressed.o highest_t T
P434_compressed.o inv_3_way T
P434_compressed.o inv_Fp2_cycl_proj T
P434_compressed.o inv_mod_orderA T
P434_compressed.o is_sqr_fp2 T
P434_compressed.o j_inv T
P434_compressed.o mod T
P434_compressed.o mod3 T
P434_compressed.o mont_n_way_inv T
P434_compressed.o mp_add T
P434_compressed.o mp_shiftl1 T
P434_compressed.o mp_shiftleft T
P434_compressed.o mp_shiftr1 T
P434_compressed.o mp_sub T
P434_compressed.o mul3 T
P434_compressed.o mulmixed_montproj T
P434_compressed.o multiply T
P434_compressed.o ord2w_dlog T
P434_compressed.o ord2w_dloghyb T
P434_compressed.o p434 R
P434_compressed.o p434p1 R
P434_compressed.o p434x16p R
P434_compressed.o p434x2 R
P434_compressed.o p434x4 R
P434_compressed.o ph2_CT R
P434_compressed.o ph2_G R
P434_compressed.o ph2_Log R
P434_compressed.o ph2_Texp R
P434_compressed.o ph2_path R
P434_compressed.o ph3_T B
P434_compressed.o ph3_T1 R
P434_compressed.o ph3_T2 R
P434_compressed.o ph3_path R
P434_compressed.o random_mod_order_A_SIDHp434 T
P434_compressed.o random_mod_order_B_SIDHp434 T
P434_compressed.o recover_os T
P434_compressed.o reverse_bits T
P434_compressed.o solve_dlog T
P434_compressed.o sqr_Fp2_cycl T
P434_compressed.o sqr_Fp2_cycl_proj T
P434_compressed.o sqrt_Fp2 T
P434_compressed.o strat_Alice R
P434_compressed.o strat_Bob R
P434_compressed.o table_r_qnr R
P434_compressed.o table_r_qr R
P434_compressed.o table_v_qnr R
P434_compressed.o table_v_qr R
P434_compressed.o threeinv R
P434_compressed.o to_Montgomery_mod_order T
P434_compressed.o to_fp2mont T
P434_compressed.o to_mont T
P434_compressed.o toproj T
P434_compressed.o u_entang R
P434_compressed.o v_3_torsion R
P434_compressed.o validate_ciphertext T
P434_compressed.o xDBL T
P434_compressed.o xDBLADD T
P434_compressed.o xDBL_e T
P434_compressed.o xDBLe T
P434_compressed.o xTPL T
P434_compressed.o xTPL_fast T
P434_compressed.o xTPLe T
P434_compressed.o xTPLe_fast T
fips202.o KeccakF1600_StatePermute T
fips202.o shake128 T
fips202.o shake128_absorb T
fips202.o shake128_squeezeblocks T
fips202.o shake256 T
fips202.o shake256_absorb T
fips202.o shake256_squeezeblocks T
fp_generic.o digit_x_digit T
fp_generic.o fpadd434 T
fp_generic.o fpcorrection434 T
fp_generic.o fpdiv2_434 T
fp_generic.o fpneg434 T
fp_generic.o fpsub434 T
fp_generic.o mp_mul T
fp_generic.o mp_sub434_p2 T
fp_generic.o mp_sub434_p4 T
fp_generic.o rdc_mont T

Number of similar (compiler,implementation) pairs: 9, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:amd64
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:amd64
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:amd64
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:amd64
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:amd64
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:amd64
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:amd64
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:amd64
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:amd64

Namespace violations

Implementation: T:mulx
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
P434_compressed.o A_basis_zero R
P434_compressed.o A_gen R
P434_compressed.o Alice_order R
P434_compressed.o B_basis_zero R
P434_compressed.o B_gen R
P434_compressed.o B_gen_3_tors R
P434_compressed.o Bob_order R
P434_compressed.o CompleteMPoint T
P434_compressed.o Double T
P434_compressed.o EphemeralKeyGeneration_A_SIDHp434_Compressed T
P434_compressed.o EphemeralKeyGeneration_B_SIDHp434_Compressed T
P434_compressed.o EphemeralSecretAgreement_A_SIDHp434_Compressed T
P434_compressed.o EphemeralSecretAgreement_B_SIDHp434_Compressed T
P434_compressed.o FormatPrivKey_B T
P434_compressed.o Ladder T
P434_compressed.o Montgomery_R2 R
P434_compressed.o Montgomery_RB1 R
P434_compressed.o Montgomery_RB2 R
P434_compressed.o Montgomery_inversion_mod_order_bingcd T
P434_compressed.o Montgomery_multiply_mod_order T
P434_compressed.o Montgomery_neg T
P434_compressed.o Montgomery_one R
P434_compressed.o T_tate2_P R
P434_compressed.o T_tate2_Q R
P434_compressed.o T_tate2_firststep_P R
P434_compressed.o T_tate2_firststep_Q R
P434_compressed.o T_tate3 R
P434_compressed.o Tate2_pairings T
P434_compressed.o Tate3_pairings T
P434_compressed.o Traverse_w_div_e_fullsigned T
P434_compressed.o Traverse_w_div_e_torus T
P434_compressed.o Traverse_w_notdiv_e_fullsigned T
P434_compressed.o U3 R
P434_compressed.o XQB3 R
P434_compressed.o clear_words T
P434_compressed.o cmp_f2elm T
P434_compressed.o copy_words T
P434_compressed.o ct_cmov T
P434_compressed.o ct_compare T
P434_compressed.o cube_Fp2_cycl T
P434_compressed.o eval_3_isog T
P434_compressed.o eval_4_isog T
P434_compressed.o fp2_conj T
P434_compressed.o fp2add434 T
P434_compressed.o fp2copy434 T
P434_compressed.o fp2correction434 T
P434_compressed.o fp2div2_434 T
P434_compressed.o fp2inv434_mont T
P434_compressed.o fp2inv434_mont_bingcd T
P434_compressed.o fp2mul434_mont T
P434_compressed.o fp2neg434 T
P434_compressed.o fp2shl T
P434_compressed.o fp2sqr434_mont T
P434_compressed.o fp2sub434 T
P434_compressed.o fp2zero434 T
P434_compressed.o fpcopy434 T
P434_compressed.o fpinv434_chain_mont T
P434_compressed.o fpinv434_mont T
P434_compressed.o fpinv434_mont_bingcd T
P434_compressed.o fpmul434_mont T
P434_compressed.o fpsqr434_mont T
P434_compressed.o fpzero434 T
P434_compressed.o from_Montgomery_mod_order T
P434_compressed.o from_base T
P434_compressed.o from_fp2mont T
P434_compressed.o from_mont T
P434_compressed.o fromproj T
P434_compressed.o g_R_S_im R
P434_compressed.o g_phiR_phiS_im R
P434_compressed.o g_phiR_phiS_re R
P434_compressed.o get_3_isog T
P434_compressed.o get_4_isog T
P434_compressed.o get_A T
P434_compressed.o highest_2t T
P434_compressed.o highest_t T
P434_compressed.o inv_3_way T
P434_compressed.o inv_Fp2_cycl_proj T
P434_compressed.o inv_mod_orderA T
P434_compressed.o is_sqr_fp2 T
P434_compressed.o j_inv T
P434_compressed.o mod T
P434_compressed.o mod3 T
P434_compressed.o mont_n_way_inv T
P434_compressed.o mp_add T
P434_compressed.o mp_shiftl1 T
P434_compressed.o mp_shiftleft T
P434_compressed.o mp_shiftr1 T
P434_compressed.o mp_sub T
P434_compressed.o mul3 T
P434_compressed.o mulmixed_montproj T
P434_compressed.o multiply T
P434_compressed.o ord2w_dlog T
P434_compressed.o ord2w_dloghyb T
P434_compressed.o p434 R
P434_compressed.o p434p1 R
P434_compressed.o p434x16p R
P434_compressed.o p434x2 R
P434_compressed.o p434x4 R
P434_compressed.o ph2_CT R
P434_compressed.o ph2_G R
P434_compressed.o ph2_Log R
P434_compressed.o ph2_Texp R
P434_compressed.o ph2_path R
P434_compressed.o ph3_T B
P434_compressed.o ph3_T1 R
P434_compressed.o ph3_T2 R
P434_compressed.o ph3_path R
P434_compressed.o random_mod_order_A_SIDHp434 T
P434_compressed.o random_mod_order_B_SIDHp434 T
P434_compressed.o recover_os T
P434_compressed.o reverse_bits T
P434_compressed.o solve_dlog T
P434_compressed.o sqr_Fp2_cycl T
P434_compressed.o sqr_Fp2_cycl_proj T
P434_compressed.o sqrt_Fp2 T
P434_compressed.o strat_Alice R
P434_compressed.o strat_Bob R
P434_compressed.o table_r_qnr R
P434_compressed.o table_r_qr R
P434_compressed.o table_v_qnr R
P434_compressed.o table_v_qr R
P434_compressed.o threeinv R
P434_compressed.o to_Montgomery_mod_order T
P434_compressed.o to_fp2mont T
P434_compressed.o to_mont T
P434_compressed.o toproj T
P434_compressed.o u_entang R
P434_compressed.o v_3_torsion R
P434_compressed.o validate_ciphertext T
P434_compressed.o xDBL T
P434_compressed.o xDBLADD T
P434_compressed.o xDBL_e T
P434_compressed.o xDBLe T
P434_compressed.o xTPL T
P434_compressed.o xTPL_fast T
P434_compressed.o xTPLe T
P434_compressed.o xTPLe_fast T
fips202.o KeccakF1600_StatePermute T
fips202.o shake128 T
fips202.o shake128_absorb T
fips202.o shake128_squeezeblocks T
fips202.o shake256 T
fips202.o shake256_absorb T
fips202.o shake256_squeezeblocks T
fp_x64.o fpadd434 T
fp_x64.o fpcorrection434 T
fp_x64.o fpdiv2_434 T
fp_x64.o fpneg434 T
fp_x64.o fpsub434 T
fp_x64.o mp_mul T
fp_x64.o mp_sub434_p2 T
fp_x64.o mp_sub434_p4 T
fp_x64.o rdc_mont T
fp_x64_asm.o fpadd434_asm T
fp_x64_asm.o fpsub434_asm T
fp_x64_asm.o mp_add434_asm T
fp_x64_asm.o mp_dblsub434x2_asm T
fp_x64_asm.o mp_sub434_p2_asm T
fp_x64_asm.o mp_sub434_p4_asm T
fp_x64_asm.o mp_subadd434x2_asm T
fp_x64_asm.o mul434_asm T
fp_x64_asm.o rdc434_asm T

Number of similar (compiler,implementation) pairs: 18, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:mulx
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:mulx
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:mulx
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:mulx
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:mulx
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:mulx
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:mulx
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:mulx
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:mulx
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:mulxadx
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:mulxadx
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:mulxadx
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:mulxadx
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:mulxadx
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:mulxadx
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:mulxadx
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:mulxadx
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:mulxadx