Implementation notes: amd64, renoir, crypto_sign/falcon1024dyn

Computer: renoir
Microarchitecture: amd64; Zen 2 (860f01)
Architecture: amd64
CPU ID: AuthenticAMD-00860f01-178bfbff
SUPERCOP version: 20240716
Operation: crypto_sign
Primitive: falcon1024dyn
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
1656129144447 0 0165126 836 1752T:avx2clang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
1685655135671 0 0156206 836 1752T:avx2clang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
1733644144999 0 0166022 836 1720T:avx2clang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
185507580915 0 099756 828 1720T:avx2clang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
186804789654 0 0110090 796 1784T:avx2gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
1892688158829 0 0180842 796 1784T:avx2gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
190437383449 0 0103450 796 1784T:avx2gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
2233560131214 0 0151934 836 1752T:fpuclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
2250519139958 0 0160790 836 1752T:fpuclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
2250928156346 0 0178394 796 1784T:fpugcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
234965372918 0 091930 788 1752T:avx2gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
2405557140706 0 0161942 836 1720T:fpuclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
253163781117 0 0101202 796 1784T:fpugcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
255716587193 0 0107658 796 1784T:fpugcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
259522278278 0 097132 820 1720T:fpuclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
274449285913 0 0106124 828 1720T:avx2clang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
294770070857 0 089922 788 1752T:fpugcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
567439384336 0 0104548 820 1720T:fpuclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
35000605147512 0 0169642 796 1784T:refgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
3532813093628 0 0114138 796 1784T:refgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
35678898145781 0 0167022 828 1720T:refclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
3576755077789 0 097002 788 1752T:refgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
3600499389665 0 0109746 796 1784T:refgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
3698197088709 0 0107676 820 1720T:refclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
3956276885650 0 0105956 820 1720T:refclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
39601278134332 0 0155326 828 1752T:refclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
39653998143050 0 0164238 828 1752T:refclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716

Namespace violations


codec.o falcon1024dyn_avx2_comp_decode T
codec.o falcon1024dyn_avx2_comp_encode T
codec.o falcon1024dyn_avx2_max_FG_bits R
codec.o falcon1024dyn_avx2_max_fg_bits R
codec.o falcon1024dyn_avx2_max_sig_bits R
codec.o falcon1024dyn_avx2_modq_decode T
codec.o falcon1024dyn_avx2_modq_encode T
codec.o falcon1024dyn_avx2_trim_i16_decode T
codec.o falcon1024dyn_avx2_trim_i16_encode T
codec.o falcon1024dyn_avx2_trim_i8_decode T
codec.o falcon1024dyn_avx2_trim_i8_encode T
common.o falcon1024dyn_avx2_hash_to_point_ct T
common.o falcon1024dyn_avx2_hash_to_point_vartime T
common.o falcon1024dyn_avx2_is_short T
common.o falcon1024dyn_avx2_is_short_half T
fft.o falcon1024dyn_avx2_FFT T
fft.o falcon1024dyn_avx2_iFFT T
fft.o falcon1024dyn_avx2_poly_LDL_fft T
fft.o falcon1024dyn_avx2_poly_LDLmv_fft T
fft.o falcon1024dyn_avx2_poly_add T
fft.o falcon1024dyn_avx2_poly_add_muladj_fft T
fft.o falcon1024dyn_avx2_poly_adj_fft T
fft.o falcon1024dyn_avx2_poly_div_autoadj_fft T
fft.o falcon1024dyn_avx2_poly_div_fft T
fft.o falcon1024dyn_avx2_poly_invnorm2_fft T
fft.o falcon1024dyn_avx2_poly_merge_fft T
fft.o falcon1024dyn_avx2_poly_mul_autoadj_fft T
fft.o falcon1024dyn_avx2_poly_mul_fft T
fft.o falcon1024dyn_avx2_poly_muladj_fft T
fft.o falcon1024dyn_avx2_poly_mulconst T
fft.o falcon1024dyn_avx2_poly_mulselfadj_fft T
fft.o falcon1024dyn_avx2_poly_neg T
fft.o falcon1024dyn_avx2_poly_split_fft T
fft.o falcon1024dyn_avx2_poly_sub T
fpr.o falcon1024dyn_avx2_fpr_gm_tab R
fpr.o falcon1024dyn_avx2_fpr_p2_tab R
keygen.o falcon1024dyn_avx2_keygen T
rng.o falcon1024dyn_avx2_get_seed T
rng.o falcon1024dyn_avx2_prng_get_bytes T
rng.o falcon1024dyn_avx2_prng_init T
rng.o falcon1024dyn_avx2_prng_refill T
shake.o falcon1024dyn_avx2_i_shake256_extract T
shake.o falcon1024dyn_avx2_i_shake256_flip T
shake.o falcon1024dyn_avx2_i_shake256_init T
shake.o falcon1024dyn_avx2_i_shake256_inject T
sign.o falcon1024dyn_avx2_expand_privkey T
sign.o falcon1024dyn_avx2_gaussian0_sampler T
sign.o falcon1024dyn_avx2_sampler T
sign.o falcon1024dyn_avx2_sign_dyn T
sign.o falcon1024dyn_avx2_sign_tree T
vrfy.o falcon1024dyn_avx2_complete_private T
vrfy.o falcon1024dyn_avx2_compute_public T
vrfy.o falcon1024dyn_avx2_count_nttzero T
vrfy.o falcon1024dyn_avx2_is_invertible T
vrfy.o falcon1024dyn_avx2_to_ntt_monty T
vrfy.o falcon1024dyn_avx2_verify_raw T
vrfy.o falcon1024dyn_avx2_verify_recover T

Number of similar (implementation,compiler) pairs: 9, namely:
ImplementationCompiler
T:avx2clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:avx2clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:avx2clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:avx2clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:avx2clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:avx2gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:avx2gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:avx2gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:avx2gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)

Namespace violations


codec.o falcon1024dyn_fpu_comp_decode T
codec.o falcon1024dyn_fpu_comp_encode T
codec.o falcon1024dyn_fpu_max_FG_bits R
codec.o falcon1024dyn_fpu_max_fg_bits R
codec.o falcon1024dyn_fpu_max_sig_bits R
codec.o falcon1024dyn_fpu_modq_decode T
codec.o falcon1024dyn_fpu_modq_encode T
codec.o falcon1024dyn_fpu_trim_i16_decode T
codec.o falcon1024dyn_fpu_trim_i16_encode T
codec.o falcon1024dyn_fpu_trim_i8_decode T
codec.o falcon1024dyn_fpu_trim_i8_encode T
common.o falcon1024dyn_fpu_hash_to_point_ct T
common.o falcon1024dyn_fpu_hash_to_point_vartime T
common.o falcon1024dyn_fpu_is_short T
common.o falcon1024dyn_fpu_is_short_half T
fft.o falcon1024dyn_fpu_FFT T
fft.o falcon1024dyn_fpu_iFFT T
fft.o falcon1024dyn_fpu_poly_LDL_fft T
fft.o falcon1024dyn_fpu_poly_LDLmv_fft T
fft.o falcon1024dyn_fpu_poly_add T
fft.o falcon1024dyn_fpu_poly_add_muladj_fft T
fft.o falcon1024dyn_fpu_poly_adj_fft T
fft.o falcon1024dyn_fpu_poly_div_autoadj_fft T
fft.o falcon1024dyn_fpu_poly_div_fft T
fft.o falcon1024dyn_fpu_poly_invnorm2_fft T
fft.o falcon1024dyn_fpu_poly_merge_fft T
fft.o falcon1024dyn_fpu_poly_mul_autoadj_fft T
fft.o falcon1024dyn_fpu_poly_mul_fft T
fft.o falcon1024dyn_fpu_poly_muladj_fft T
fft.o falcon1024dyn_fpu_poly_mulconst T
fft.o falcon1024dyn_fpu_poly_mulselfadj_fft T
fft.o falcon1024dyn_fpu_poly_neg T
fft.o falcon1024dyn_fpu_poly_split_fft T
fft.o falcon1024dyn_fpu_poly_sub T
fpr.o falcon1024dyn_fpu_fpr_gm_tab R
fpr.o falcon1024dyn_fpu_fpr_p2_tab R
keygen.o falcon1024dyn_fpu_keygen T
rng.o falcon1024dyn_fpu_get_seed T
rng.o falcon1024dyn_fpu_prng_get_bytes T
rng.o falcon1024dyn_fpu_prng_init T
rng.o falcon1024dyn_fpu_prng_refill T
shake.o falcon1024dyn_fpu_i_shake256_extract T
shake.o falcon1024dyn_fpu_i_shake256_flip T
shake.o falcon1024dyn_fpu_i_shake256_init T
shake.o falcon1024dyn_fpu_i_shake256_inject T
sign.o falcon1024dyn_fpu_expand_privkey T
sign.o falcon1024dyn_fpu_gaussian0_sampler T
sign.o falcon1024dyn_fpu_sampler T
sign.o falcon1024dyn_fpu_sign_dyn T
sign.o falcon1024dyn_fpu_sign_tree T
vrfy.o falcon1024dyn_fpu_complete_private T
vrfy.o falcon1024dyn_fpu_compute_public T
vrfy.o falcon1024dyn_fpu_count_nttzero T
vrfy.o falcon1024dyn_fpu_is_invertible T
vrfy.o falcon1024dyn_fpu_to_ntt_monty T
vrfy.o falcon1024dyn_fpu_verify_raw T
vrfy.o falcon1024dyn_fpu_verify_recover T

Number of similar (implementation,compiler) pairs: 9, namely:
ImplementationCompiler
T:fpuclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:fpuclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:fpuclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:fpuclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:fpuclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:fpugcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:fpugcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:fpugcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:fpugcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)

Namespace violations


codec.o falcon1024dyn_ref_comp_decode T
codec.o falcon1024dyn_ref_comp_encode T
codec.o falcon1024dyn_ref_max_FG_bits R
codec.o falcon1024dyn_ref_max_fg_bits R
codec.o falcon1024dyn_ref_max_sig_bits R
codec.o falcon1024dyn_ref_modq_decode T
codec.o falcon1024dyn_ref_modq_encode T
codec.o falcon1024dyn_ref_trim_i16_decode T
codec.o falcon1024dyn_ref_trim_i16_encode T
codec.o falcon1024dyn_ref_trim_i8_decode T
codec.o falcon1024dyn_ref_trim_i8_encode T
common.o falcon1024dyn_ref_hash_to_point_ct T
common.o falcon1024dyn_ref_hash_to_point_vartime T
common.o falcon1024dyn_ref_is_short T
common.o falcon1024dyn_ref_is_short_half T
fft.o falcon1024dyn_ref_FFT T
fft.o falcon1024dyn_ref_iFFT T
fft.o falcon1024dyn_ref_poly_LDL_fft T
fft.o falcon1024dyn_ref_poly_LDLmv_fft T
fft.o falcon1024dyn_ref_poly_add T
fft.o falcon1024dyn_ref_poly_add_muladj_fft T
fft.o falcon1024dyn_ref_poly_adj_fft T
fft.o falcon1024dyn_ref_poly_div_autoadj_fft T
fft.o falcon1024dyn_ref_poly_div_fft T
fft.o falcon1024dyn_ref_poly_invnorm2_fft T
fft.o falcon1024dyn_ref_poly_merge_fft T
fft.o falcon1024dyn_ref_poly_mul_autoadj_fft T
fft.o falcon1024dyn_ref_poly_mul_fft T
fft.o falcon1024dyn_ref_poly_muladj_fft T
fft.o falcon1024dyn_ref_poly_mulconst T
fft.o falcon1024dyn_ref_poly_mulselfadj_fft T
fft.o falcon1024dyn_ref_poly_neg T
fft.o falcon1024dyn_ref_poly_split_fft T
fft.o falcon1024dyn_ref_poly_sub T
fpr.o falcon1024dyn_ref_fpr_add T
fpr.o falcon1024dyn_ref_fpr_div T
fpr.o falcon1024dyn_ref_fpr_expm_p63 T
fpr.o falcon1024dyn_ref_fpr_gm_tab R
fpr.o falcon1024dyn_ref_fpr_mul T
fpr.o falcon1024dyn_ref_fpr_p2_tab R
fpr.o falcon1024dyn_ref_fpr_scaled T
fpr.o falcon1024dyn_ref_fpr_sqrt T
keygen.o falcon1024dyn_ref_keygen T
rng.o falcon1024dyn_ref_get_seed T
rng.o falcon1024dyn_ref_prng_get_bytes T
rng.o falcon1024dyn_ref_prng_init T
rng.o falcon1024dyn_ref_prng_refill T
shake.o falcon1024dyn_ref_i_shake256_extract T
shake.o falcon1024dyn_ref_i_shake256_flip T
shake.o falcon1024dyn_ref_i_shake256_init T
shake.o falcon1024dyn_ref_i_shake256_inject T
sign.o falcon1024dyn_ref_expand_privkey T
sign.o falcon1024dyn_ref_gaussian0_sampler T
sign.o falcon1024dyn_ref_sampler T
sign.o falcon1024dyn_ref_sign_dyn T
sign.o falcon1024dyn_ref_sign_tree T
vrfy.o falcon1024dyn_ref_complete_private T
vrfy.o falcon1024dyn_ref_compute_public T
vrfy.o falcon1024dyn_ref_count_nttzero T
vrfy.o falcon1024dyn_ref_is_invertible T
vrfy.o falcon1024dyn_ref_to_ntt_monty T
vrfy.o falcon1024dyn_ref_verify_raw T
vrfy.o falcon1024dyn_ref_verify_recover T

Number of similar (implementation,compiler) pairs: 9, namely:
ImplementationCompiler
T:refclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:refclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:refclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:refclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:refclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:refgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:refgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:refgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:refgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)