Implementation notes: amd64, rumba7, crypto_sign/falcon1024dyn

Computer: rumba7
Microarchitecture: amd64; Zen (800f11)
Architecture: amd64
CPU ID: AuthenticAMD-00800f11-178bfbff
SUPERCOP version: 20240716
Operation: crypto_sign
Primitive: falcon1024dyn
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
1992905156761 0 0178333 868 1760T:avx2clang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
2005508140567 0 0161949 868 1760T:avx2clang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
2044801165414 0 0185973 868 1728T:avx2clang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
215270395643 0 0115005 868 1728T:avx2clang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
2169181167630 0 0189495 820 1792T:avx2gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
220037580873 0 0100983 820 1792T:avx2gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
221928980844 0 0100695 860 1792T:avx2clang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
228058786406 0 0107103 820 1792T:avx2gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
2299059164394 0 0186247 820 1792T:fpugcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
2474255137356 0 0158997 868 1760T:fpuclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
2518152154526 0 0176309 868 1760T:fpuclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
2526674164479 0 0185221 868 1728T:fpuclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
273470573396 0 092471 812 1760T:avx2gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
278276292553 0 0112005 860 1728T:fpuclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
278452778073 0 097999 852 1792T:fpuclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
279267978453 0 098607 820 1792T:fpugcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
279548683857 0 0104607 820 1792T:fpugcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
325182271427 0 090567 812 1760T:fpugcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
33160727165924 0 0186677 860 1728T:refclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
3321869089023 0 0108999 852 1792T:refclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
34241794101248 0 0120997 860 1728T:refclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
34659719140600 0 0162469 860 1760T:refclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
34661730157451 0 0179453 860 1760T:refclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
36878371160115 0 0182103 820 1792T:refgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
3725793690342 0 0111143 820 1792T:refgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
3779519078245 0 097455 812 1760T:refgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
3779938987136 0 0107343 820 1792T:refgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716

Namespace violations


codec.o falcon1024dyn_avx2_comp_decode T
codec.o falcon1024dyn_avx2_comp_encode T
codec.o falcon1024dyn_avx2_max_FG_bits R
codec.o falcon1024dyn_avx2_max_fg_bits R
codec.o falcon1024dyn_avx2_max_sig_bits R
codec.o falcon1024dyn_avx2_modq_decode T
codec.o falcon1024dyn_avx2_modq_encode T
codec.o falcon1024dyn_avx2_trim_i16_decode T
codec.o falcon1024dyn_avx2_trim_i16_encode T
codec.o falcon1024dyn_avx2_trim_i8_decode T
codec.o falcon1024dyn_avx2_trim_i8_encode T
common.o falcon1024dyn_avx2_hash_to_point_ct T
common.o falcon1024dyn_avx2_hash_to_point_vartime T
common.o falcon1024dyn_avx2_is_short T
common.o falcon1024dyn_avx2_is_short_half T
fft.o falcon1024dyn_avx2_FFT T
fft.o falcon1024dyn_avx2_iFFT T
fft.o falcon1024dyn_avx2_poly_LDL_fft T
fft.o falcon1024dyn_avx2_poly_LDLmv_fft T
fft.o falcon1024dyn_avx2_poly_add T
fft.o falcon1024dyn_avx2_poly_add_muladj_fft T
fft.o falcon1024dyn_avx2_poly_adj_fft T
fft.o falcon1024dyn_avx2_poly_div_autoadj_fft T
fft.o falcon1024dyn_avx2_poly_div_fft T
fft.o falcon1024dyn_avx2_poly_invnorm2_fft T
fft.o falcon1024dyn_avx2_poly_merge_fft T
fft.o falcon1024dyn_avx2_poly_mul_autoadj_fft T
fft.o falcon1024dyn_avx2_poly_mul_fft T
fft.o falcon1024dyn_avx2_poly_muladj_fft T
fft.o falcon1024dyn_avx2_poly_mulconst T
fft.o falcon1024dyn_avx2_poly_mulselfadj_fft T
fft.o falcon1024dyn_avx2_poly_neg T
fft.o falcon1024dyn_avx2_poly_split_fft T
fft.o falcon1024dyn_avx2_poly_sub T
fpr.o falcon1024dyn_avx2_fpr_gm_tab R
fpr.o falcon1024dyn_avx2_fpr_p2_tab R
keygen.o falcon1024dyn_avx2_keygen T
rng.o falcon1024dyn_avx2_get_seed T
rng.o falcon1024dyn_avx2_prng_get_bytes T
rng.o falcon1024dyn_avx2_prng_init T
rng.o falcon1024dyn_avx2_prng_refill T
shake.o falcon1024dyn_avx2_i_shake256_extract T
shake.o falcon1024dyn_avx2_i_shake256_flip T
shake.o falcon1024dyn_avx2_i_shake256_init T
shake.o falcon1024dyn_avx2_i_shake256_inject T
sign.o falcon1024dyn_avx2_expand_privkey T
sign.o falcon1024dyn_avx2_gaussian0_sampler T
sign.o falcon1024dyn_avx2_sampler T
sign.o falcon1024dyn_avx2_sign_dyn T
sign.o falcon1024dyn_avx2_sign_tree T
vrfy.o falcon1024dyn_avx2_complete_private T
vrfy.o falcon1024dyn_avx2_compute_public T
vrfy.o falcon1024dyn_avx2_count_nttzero T
vrfy.o falcon1024dyn_avx2_is_invertible T
vrfy.o falcon1024dyn_avx2_to_ntt_monty T
vrfy.o falcon1024dyn_avx2_verify_raw T
vrfy.o falcon1024dyn_avx2_verify_recover T

Number of similar (implementation,compiler) pairs: 9, namely:
ImplementationCompiler
T:avx2clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:avx2clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:avx2clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:avx2clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:avx2clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:avx2gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:avx2gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:avx2gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:avx2gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

Namespace violations


codec.o falcon1024dyn_fpu_comp_decode T
codec.o falcon1024dyn_fpu_comp_encode T
codec.o falcon1024dyn_fpu_max_FG_bits R
codec.o falcon1024dyn_fpu_max_fg_bits R
codec.o falcon1024dyn_fpu_max_sig_bits R
codec.o falcon1024dyn_fpu_modq_decode T
codec.o falcon1024dyn_fpu_modq_encode T
codec.o falcon1024dyn_fpu_trim_i16_decode T
codec.o falcon1024dyn_fpu_trim_i16_encode T
codec.o falcon1024dyn_fpu_trim_i8_decode T
codec.o falcon1024dyn_fpu_trim_i8_encode T
common.o falcon1024dyn_fpu_hash_to_point_ct T
common.o falcon1024dyn_fpu_hash_to_point_vartime T
common.o falcon1024dyn_fpu_is_short T
common.o falcon1024dyn_fpu_is_short_half T
fft.o falcon1024dyn_fpu_FFT T
fft.o falcon1024dyn_fpu_iFFT T
fft.o falcon1024dyn_fpu_poly_LDL_fft T
fft.o falcon1024dyn_fpu_poly_LDLmv_fft T
fft.o falcon1024dyn_fpu_poly_add T
fft.o falcon1024dyn_fpu_poly_add_muladj_fft T
fft.o falcon1024dyn_fpu_poly_adj_fft T
fft.o falcon1024dyn_fpu_poly_div_autoadj_fft T
fft.o falcon1024dyn_fpu_poly_div_fft T
fft.o falcon1024dyn_fpu_poly_invnorm2_fft T
fft.o falcon1024dyn_fpu_poly_merge_fft T
fft.o falcon1024dyn_fpu_poly_mul_autoadj_fft T
fft.o falcon1024dyn_fpu_poly_mul_fft T
fft.o falcon1024dyn_fpu_poly_muladj_fft T
fft.o falcon1024dyn_fpu_poly_mulconst T
fft.o falcon1024dyn_fpu_poly_mulselfadj_fft T
fft.o falcon1024dyn_fpu_poly_neg T
fft.o falcon1024dyn_fpu_poly_split_fft T
fft.o falcon1024dyn_fpu_poly_sub T
fpr.o falcon1024dyn_fpu_fpr_gm_tab R
fpr.o falcon1024dyn_fpu_fpr_p2_tab R
keygen.o falcon1024dyn_fpu_keygen T
rng.o falcon1024dyn_fpu_get_seed T
rng.o falcon1024dyn_fpu_prng_get_bytes T
rng.o falcon1024dyn_fpu_prng_init T
rng.o falcon1024dyn_fpu_prng_refill T
shake.o falcon1024dyn_fpu_i_shake256_extract T
shake.o falcon1024dyn_fpu_i_shake256_flip T
shake.o falcon1024dyn_fpu_i_shake256_init T
shake.o falcon1024dyn_fpu_i_shake256_inject T
sign.o falcon1024dyn_fpu_expand_privkey T
sign.o falcon1024dyn_fpu_gaussian0_sampler T
sign.o falcon1024dyn_fpu_sampler T
sign.o falcon1024dyn_fpu_sign_dyn T
sign.o falcon1024dyn_fpu_sign_tree T
vrfy.o falcon1024dyn_fpu_complete_private T
vrfy.o falcon1024dyn_fpu_compute_public T
vrfy.o falcon1024dyn_fpu_count_nttzero T
vrfy.o falcon1024dyn_fpu_is_invertible T
vrfy.o falcon1024dyn_fpu_to_ntt_monty T
vrfy.o falcon1024dyn_fpu_verify_raw T
vrfy.o falcon1024dyn_fpu_verify_recover T

Number of similar (implementation,compiler) pairs: 9, namely:
ImplementationCompiler
T:fpuclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:fpuclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:fpuclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:fpuclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:fpuclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:fpugcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:fpugcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:fpugcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:fpugcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

Namespace violations


codec.o falcon1024dyn_ref_comp_decode T
codec.o falcon1024dyn_ref_comp_encode T
codec.o falcon1024dyn_ref_max_FG_bits R
codec.o falcon1024dyn_ref_max_fg_bits R
codec.o falcon1024dyn_ref_max_sig_bits R
codec.o falcon1024dyn_ref_modq_decode T
codec.o falcon1024dyn_ref_modq_encode T
codec.o falcon1024dyn_ref_trim_i16_decode T
codec.o falcon1024dyn_ref_trim_i16_encode T
codec.o falcon1024dyn_ref_trim_i8_decode T
codec.o falcon1024dyn_ref_trim_i8_encode T
common.o falcon1024dyn_ref_hash_to_point_ct T
common.o falcon1024dyn_ref_hash_to_point_vartime T
common.o falcon1024dyn_ref_is_short T
common.o falcon1024dyn_ref_is_short_half T
fft.o falcon1024dyn_ref_FFT T
fft.o falcon1024dyn_ref_iFFT T
fft.o falcon1024dyn_ref_poly_LDL_fft T
fft.o falcon1024dyn_ref_poly_LDLmv_fft T
fft.o falcon1024dyn_ref_poly_add T
fft.o falcon1024dyn_ref_poly_add_muladj_fft T
fft.o falcon1024dyn_ref_poly_adj_fft T
fft.o falcon1024dyn_ref_poly_div_autoadj_fft T
fft.o falcon1024dyn_ref_poly_div_fft T
fft.o falcon1024dyn_ref_poly_invnorm2_fft T
fft.o falcon1024dyn_ref_poly_merge_fft T
fft.o falcon1024dyn_ref_poly_mul_autoadj_fft T
fft.o falcon1024dyn_ref_poly_mul_fft T
fft.o falcon1024dyn_ref_poly_muladj_fft T
fft.o falcon1024dyn_ref_poly_mulconst T
fft.o falcon1024dyn_ref_poly_mulselfadj_fft T
fft.o falcon1024dyn_ref_poly_neg T
fft.o falcon1024dyn_ref_poly_split_fft T
fft.o falcon1024dyn_ref_poly_sub T
fpr.o falcon1024dyn_ref_fpr_add T
fpr.o falcon1024dyn_ref_fpr_div T
fpr.o falcon1024dyn_ref_fpr_expm_p63 T
fpr.o falcon1024dyn_ref_fpr_gm_tab R
fpr.o falcon1024dyn_ref_fpr_mul T
fpr.o falcon1024dyn_ref_fpr_p2_tab R
fpr.o falcon1024dyn_ref_fpr_scaled T
fpr.o falcon1024dyn_ref_fpr_sqrt T
keygen.o falcon1024dyn_ref_keygen T
rng.o falcon1024dyn_ref_get_seed T
rng.o falcon1024dyn_ref_prng_get_bytes T
rng.o falcon1024dyn_ref_prng_init T
rng.o falcon1024dyn_ref_prng_refill T
shake.o falcon1024dyn_ref_i_shake256_extract T
shake.o falcon1024dyn_ref_i_shake256_flip T
shake.o falcon1024dyn_ref_i_shake256_init T
shake.o falcon1024dyn_ref_i_shake256_inject T
sign.o falcon1024dyn_ref_expand_privkey T
sign.o falcon1024dyn_ref_gaussian0_sampler T
sign.o falcon1024dyn_ref_sampler T
sign.o falcon1024dyn_ref_sign_dyn T
sign.o falcon1024dyn_ref_sign_tree T
vrfy.o falcon1024dyn_ref_complete_private T
vrfy.o falcon1024dyn_ref_compute_public T
vrfy.o falcon1024dyn_ref_count_nttzero T
vrfy.o falcon1024dyn_ref_is_invertible T
vrfy.o falcon1024dyn_ref_to_ntt_monty T
vrfy.o falcon1024dyn_ref_verify_raw T
vrfy.o falcon1024dyn_ref_verify_recover T

Number of similar (implementation,compiler) pairs: 9, namely:
ImplementationCompiler
T:refclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:refclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:refclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:refclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:refclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:refgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:refgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:refgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:refgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)