Implementation notes: amd64, renoir, crypto_sign/falcon512dyn

Computer: renoir
Microarchitecture: amd64; Zen 2 (860f01)
Architecture: amd64
CPU ID: AuthenticAMD-00860f01-178bfbff
SUPERCOP version: 20240716
Operation: crypto_sign
Primitive: falcon512dyn
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
839036144447 0 0165126 836 1752T:avx2clang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
844313135671 0 0156206 836 1752T:avx2clang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
869633144999 0 0166022 836 1720T:avx2clang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
92334080915 0 099756 828 1720T:avx2clang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
93428789654 0 0110090 796 1784T:avx2gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
947269158829 0 0180842 796 1784T:avx2gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
94824783449 0 0103450 796 1784T:avx2gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
1130125156346 0 0178394 796 1784T:fpugcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
1136823131214 0 0151934 836 1752T:fpuclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
1142598139958 0 0160758 836 1752T:fpuclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
116947272918 0 091930 788 1752T:avx2gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
1214420140706 0 0161926 836 1720T:fpuclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
126613381117 0 0101202 796 1784T:fpugcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
127797787193 0 0107658 796 1784T:fpugcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
129098678278 0 097132 820 1720T:fpuclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
135972885913 0 0106124 828 1720T:avx2clang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
150463070857 0 089890 788 1752T:fpugcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
269329684336 0 0104532 820 1720T:fpuclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
16051851147512 0 0169642 796 1784T:refgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
1620771693628 0 0114138 796 1784T:refgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
16349471145781 0 0167006 828 1720T:refclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
1641856277789 0 096970 788 1752T:refgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
1651255989665 0 0109746 796 1784T:refgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
1694842488709 0 0107676 820 1720T:refclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
18105722134332 0 0155326 828 1752T:refclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
1812172085650 0 0105940 820 1720T:refclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
18127824143050 0 0164206 828 1752T:refclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716

Namespace violations


codec.o falcon512dyn_avx2_comp_decode T
codec.o falcon512dyn_avx2_comp_encode T
codec.o falcon512dyn_avx2_max_FG_bits R
codec.o falcon512dyn_avx2_max_fg_bits R
codec.o falcon512dyn_avx2_max_sig_bits R
codec.o falcon512dyn_avx2_modq_decode T
codec.o falcon512dyn_avx2_modq_encode T
codec.o falcon512dyn_avx2_trim_i16_decode T
codec.o falcon512dyn_avx2_trim_i16_encode T
codec.o falcon512dyn_avx2_trim_i8_decode T
codec.o falcon512dyn_avx2_trim_i8_encode T
common.o falcon512dyn_avx2_hash_to_point_ct T
common.o falcon512dyn_avx2_hash_to_point_vartime T
common.o falcon512dyn_avx2_is_short T
common.o falcon512dyn_avx2_is_short_half T
fft.o falcon512dyn_avx2_FFT T
fft.o falcon512dyn_avx2_iFFT T
fft.o falcon512dyn_avx2_poly_LDL_fft T
fft.o falcon512dyn_avx2_poly_LDLmv_fft T
fft.o falcon512dyn_avx2_poly_add T
fft.o falcon512dyn_avx2_poly_add_muladj_fft T
fft.o falcon512dyn_avx2_poly_adj_fft T
fft.o falcon512dyn_avx2_poly_div_autoadj_fft T
fft.o falcon512dyn_avx2_poly_div_fft T
fft.o falcon512dyn_avx2_poly_invnorm2_fft T
fft.o falcon512dyn_avx2_poly_merge_fft T
fft.o falcon512dyn_avx2_poly_mul_autoadj_fft T
fft.o falcon512dyn_avx2_poly_mul_fft T
fft.o falcon512dyn_avx2_poly_muladj_fft T
fft.o falcon512dyn_avx2_poly_mulconst T
fft.o falcon512dyn_avx2_poly_mulselfadj_fft T
fft.o falcon512dyn_avx2_poly_neg T
fft.o falcon512dyn_avx2_poly_split_fft T
fft.o falcon512dyn_avx2_poly_sub T
fpr.o falcon512dyn_avx2_fpr_gm_tab R
fpr.o falcon512dyn_avx2_fpr_p2_tab R
keygen.o falcon512dyn_avx2_keygen T
rng.o falcon512dyn_avx2_get_seed T
rng.o falcon512dyn_avx2_prng_get_bytes T
rng.o falcon512dyn_avx2_prng_init T
rng.o falcon512dyn_avx2_prng_refill T
shake.o falcon512dyn_avx2_i_shake256_extract T
shake.o falcon512dyn_avx2_i_shake256_flip T
shake.o falcon512dyn_avx2_i_shake256_init T
shake.o falcon512dyn_avx2_i_shake256_inject T
sign.o falcon512dyn_avx2_expand_privkey T
sign.o falcon512dyn_avx2_gaussian0_sampler T
sign.o falcon512dyn_avx2_sampler T
sign.o falcon512dyn_avx2_sign_dyn T
sign.o falcon512dyn_avx2_sign_tree T
vrfy.o falcon512dyn_avx2_complete_private T
vrfy.o falcon512dyn_avx2_compute_public T
vrfy.o falcon512dyn_avx2_count_nttzero T
vrfy.o falcon512dyn_avx2_is_invertible T
vrfy.o falcon512dyn_avx2_to_ntt_monty T
vrfy.o falcon512dyn_avx2_verify_raw T
vrfy.o falcon512dyn_avx2_verify_recover T

Number of similar (implementation,compiler) pairs: 9, namely:
ImplementationCompiler
T:avx2clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:avx2clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:avx2clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:avx2clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:avx2clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:avx2gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:avx2gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:avx2gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:avx2gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)

Namespace violations


codec.o falcon512dyn_fpu_comp_decode T
codec.o falcon512dyn_fpu_comp_encode T
codec.o falcon512dyn_fpu_max_FG_bits R
codec.o falcon512dyn_fpu_max_fg_bits R
codec.o falcon512dyn_fpu_max_sig_bits R
codec.o falcon512dyn_fpu_modq_decode T
codec.o falcon512dyn_fpu_modq_encode T
codec.o falcon512dyn_fpu_trim_i16_decode T
codec.o falcon512dyn_fpu_trim_i16_encode T
codec.o falcon512dyn_fpu_trim_i8_decode T
codec.o falcon512dyn_fpu_trim_i8_encode T
common.o falcon512dyn_fpu_hash_to_point_ct T
common.o falcon512dyn_fpu_hash_to_point_vartime T
common.o falcon512dyn_fpu_is_short T
common.o falcon512dyn_fpu_is_short_half T
fft.o falcon512dyn_fpu_FFT T
fft.o falcon512dyn_fpu_iFFT T
fft.o falcon512dyn_fpu_poly_LDL_fft T
fft.o falcon512dyn_fpu_poly_LDLmv_fft T
fft.o falcon512dyn_fpu_poly_add T
fft.o falcon512dyn_fpu_poly_add_muladj_fft T
fft.o falcon512dyn_fpu_poly_adj_fft T
fft.o falcon512dyn_fpu_poly_div_autoadj_fft T
fft.o falcon512dyn_fpu_poly_div_fft T
fft.o falcon512dyn_fpu_poly_invnorm2_fft T
fft.o falcon512dyn_fpu_poly_merge_fft T
fft.o falcon512dyn_fpu_poly_mul_autoadj_fft T
fft.o falcon512dyn_fpu_poly_mul_fft T
fft.o falcon512dyn_fpu_poly_muladj_fft T
fft.o falcon512dyn_fpu_poly_mulconst T
fft.o falcon512dyn_fpu_poly_mulselfadj_fft T
fft.o falcon512dyn_fpu_poly_neg T
fft.o falcon512dyn_fpu_poly_split_fft T
fft.o falcon512dyn_fpu_poly_sub T
fpr.o falcon512dyn_fpu_fpr_gm_tab R
fpr.o falcon512dyn_fpu_fpr_p2_tab R
keygen.o falcon512dyn_fpu_keygen T
rng.o falcon512dyn_fpu_get_seed T
rng.o falcon512dyn_fpu_prng_get_bytes T
rng.o falcon512dyn_fpu_prng_init T
rng.o falcon512dyn_fpu_prng_refill T
shake.o falcon512dyn_fpu_i_shake256_extract T
shake.o falcon512dyn_fpu_i_shake256_flip T
shake.o falcon512dyn_fpu_i_shake256_init T
shake.o falcon512dyn_fpu_i_shake256_inject T
sign.o falcon512dyn_fpu_expand_privkey T
sign.o falcon512dyn_fpu_gaussian0_sampler T
sign.o falcon512dyn_fpu_sampler T
sign.o falcon512dyn_fpu_sign_dyn T
sign.o falcon512dyn_fpu_sign_tree T
vrfy.o falcon512dyn_fpu_complete_private T
vrfy.o falcon512dyn_fpu_compute_public T
vrfy.o falcon512dyn_fpu_count_nttzero T
vrfy.o falcon512dyn_fpu_is_invertible T
vrfy.o falcon512dyn_fpu_to_ntt_monty T
vrfy.o falcon512dyn_fpu_verify_raw T
vrfy.o falcon512dyn_fpu_verify_recover T

Number of similar (implementation,compiler) pairs: 9, namely:
ImplementationCompiler
T:fpuclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:fpuclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:fpuclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:fpuclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:fpuclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:fpugcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:fpugcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:fpugcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:fpugcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)

Namespace violations


codec.o falcon512dyn_ref_comp_decode T
codec.o falcon512dyn_ref_comp_encode T
codec.o falcon512dyn_ref_max_FG_bits R
codec.o falcon512dyn_ref_max_fg_bits R
codec.o falcon512dyn_ref_max_sig_bits R
codec.o falcon512dyn_ref_modq_decode T
codec.o falcon512dyn_ref_modq_encode T
codec.o falcon512dyn_ref_trim_i16_decode T
codec.o falcon512dyn_ref_trim_i16_encode T
codec.o falcon512dyn_ref_trim_i8_decode T
codec.o falcon512dyn_ref_trim_i8_encode T
common.o falcon512dyn_ref_hash_to_point_ct T
common.o falcon512dyn_ref_hash_to_point_vartime T
common.o falcon512dyn_ref_is_short T
common.o falcon512dyn_ref_is_short_half T
fft.o falcon512dyn_ref_FFT T
fft.o falcon512dyn_ref_iFFT T
fft.o falcon512dyn_ref_poly_LDL_fft T
fft.o falcon512dyn_ref_poly_LDLmv_fft T
fft.o falcon512dyn_ref_poly_add T
fft.o falcon512dyn_ref_poly_add_muladj_fft T
fft.o falcon512dyn_ref_poly_adj_fft T
fft.o falcon512dyn_ref_poly_div_autoadj_fft T
fft.o falcon512dyn_ref_poly_div_fft T
fft.o falcon512dyn_ref_poly_invnorm2_fft T
fft.o falcon512dyn_ref_poly_merge_fft T
fft.o falcon512dyn_ref_poly_mul_autoadj_fft T
fft.o falcon512dyn_ref_poly_mul_fft T
fft.o falcon512dyn_ref_poly_muladj_fft T
fft.o falcon512dyn_ref_poly_mulconst T
fft.o falcon512dyn_ref_poly_mulselfadj_fft T
fft.o falcon512dyn_ref_poly_neg T
fft.o falcon512dyn_ref_poly_split_fft T
fft.o falcon512dyn_ref_poly_sub T
fpr.o falcon512dyn_ref_fpr_add T
fpr.o falcon512dyn_ref_fpr_div T
fpr.o falcon512dyn_ref_fpr_expm_p63 T
fpr.o falcon512dyn_ref_fpr_gm_tab R
fpr.o falcon512dyn_ref_fpr_mul T
fpr.o falcon512dyn_ref_fpr_p2_tab R
fpr.o falcon512dyn_ref_fpr_scaled T
fpr.o falcon512dyn_ref_fpr_sqrt T
keygen.o falcon512dyn_ref_keygen T
rng.o falcon512dyn_ref_get_seed T
rng.o falcon512dyn_ref_prng_get_bytes T
rng.o falcon512dyn_ref_prng_init T
rng.o falcon512dyn_ref_prng_refill T
shake.o falcon512dyn_ref_i_shake256_extract T
shake.o falcon512dyn_ref_i_shake256_flip T
shake.o falcon512dyn_ref_i_shake256_init T
shake.o falcon512dyn_ref_i_shake256_inject T
sign.o falcon512dyn_ref_expand_privkey T
sign.o falcon512dyn_ref_gaussian0_sampler T
sign.o falcon512dyn_ref_sampler T
sign.o falcon512dyn_ref_sign_dyn T
sign.o falcon512dyn_ref_sign_tree T
vrfy.o falcon512dyn_ref_complete_private T
vrfy.o falcon512dyn_ref_compute_public T
vrfy.o falcon512dyn_ref_count_nttzero T
vrfy.o falcon512dyn_ref_is_invertible T
vrfy.o falcon512dyn_ref_to_ntt_monty T
vrfy.o falcon512dyn_ref_verify_raw T
vrfy.o falcon512dyn_ref_verify_recover T

Number of similar (implementation,compiler) pairs: 9, namely:
ImplementationCompiler
T:refclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:refclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:refclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:refclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:refclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:refgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:refgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:refgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:refgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)