Implementation notes: amd64, hertz, crypto_sign/dilithium3

Computer: hertz
Microarchitecture: amd64; Zen 4 (a60f12)
Architecture: amd64
CPU ID: AuthenticAMD-00a60f12-178bfbff
SUPERCOP version: 20240425
Operation: crypto_sign
Primitive: dilithium3
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
275956126864 64 0160729 916 1832avx2clang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024042920240425
276738122268 64 0156265 916 1832avx2clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024042920240425
28637171109 64 098987 908 1832avx2clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024042920240425
29525393718 64 0123660 892 1896avx2gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042920240425
30179665904 64 093820 892 1832avx2gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042920240425
30512259936 64 086268 884 1800avx2gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042920240425
108002977843 0 0111641 844 1832refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024042920240425
113838673210 0 0106889 844 1832refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024042920240425
121914417927 0 046075 836 1832refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024042920240425
122555645504 0 075284 820 1896refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042920240425
124514521770 0 049548 820 1832refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042920240425
138062818640 0 044916 812 1800refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042920240425

Compiler output

Implementation: avx2
Security model: constbranchindex
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
poly.c: poly.c: In function 'crypto_sign_dilithium3_avx2_constbranchindex_poly_uniform_eta_4x':
poly.c: <command-line>: warning: 'crypto_sign_dilithium3_avx2_constbranchindex_rej_eta_avx' reading 840 bytes from a region of size 704 [-Wstringop-overread]
poly.c: config.h:3:32: note: in expansion of macro 'CRYPTO_NAMESPACE'
poly.c: 3 | #define DILITHIUM_NAMESPACE(s) CRYPTO_NAMESPACE(s)
poly.c: | ^~~~~~~~~~~~~~~~
poly.c: rejsample.h:24:21: note: in expansion of macro 'DILITHIUM_NAMESPACE'
poly.c: 24 | #define rej_eta_avx DILITHIUM_NAMESPACE(rej_eta_avx)
poly.c: | ^~~~~~~~~~~~~~~~~~~
poly.c: poly.c:596:10: note: in expansion of macro 'rej_eta_avx'
poly.c: 596 | ctr2 = rej_eta_avx(a2->coeffs, buf[2].coeffs);
poly.c: | ^~~~~~~~~~~
poly.c: <command-line>: note: referencing argument 2 of type 'const uint8_t[840]' {aka 'const unsigned char[840]'}
poly.c: config.h:3:32: note: in expansion of macro 'CRYPTO_NAMESPACE'
poly.c: 3 | #define DILITHIUM_NAMESPACE(s) CRYPTO_NAMESPACE(s)
poly.c: | ^~~~~~~~~~~~~~~~
poly.c: rejsample.h:24:21: note: in expansion of macro 'DILITHIUM_NAMESPACE'
poly.c: 24 | #define rej_eta_avx DILITHIUM_NAMESPACE(rej_eta_avx)
poly.c: | ^~~~~~~~~~~~~~~~~~~
poly.c: poly.c:596:10: note: in expansion of macro 'rej_eta_avx'
poly.c: 596 | ctr2 = rej_eta_avx(a2->coeffs, buf[2].coeffs);
poly.c: | ^~~~~~~~~~~
poly.c: <command-line>: note: in a call to function 'crypto_sign_dilithium3_avx2_constbranchindex_rej_eta_avx'
poly.c: config.h:3:32: note: in expansion of macro 'CRYPTO_NAMESPACE'
poly.c: 3 | #define DILITHIUM_NAMESPACE(s) CRYPTO_NAMESPACE(s)
poly.c: | ^~~~~~~~~~~~~~~~
poly.c: ...

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE avx2
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE avx2
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE avx2