Implementation notes: amd64, hertz, crypto_encrypt/lotus192

Computer: hertz
Microarchitecture: amd64; Zen 4 (a60f12)
Architecture: amd64
CPU ID: AuthenticAMD-00a60f12-178bfbff
SUPERCOP version: 20240716
Operation: crypto_encrypt
Primitive: lotus192
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
44889047425 0 876552 956 1928T:optgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
61462332598 0 861640 956 1864T:refgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
1484637153221 0 8186704 956 1832T:optclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
1548505154631 0 8188224 956 1832T:optclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
280287214803 0 841944 956 1864T:optgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
284402214028 0 841481 948 1832T:optclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
504894610789 0 837840 956 1832T:refgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
53182879672 0 837041 948 1832T:refclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
96365598838 0 834496 948 1832T:refgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
992904011409 0 837144 948 1832T:optgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716

Test failure


error 111
crypto_encrypt_open returns nonzero

Number of similar (implementation,compiler) pairs: 2, namely:
ImplementationCompiler
T:refclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))
T:refclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))

Compiler output


lwe-arithmetics_avx2.c: lwe-arithmetics_avx2.c:211:5: warning: array index 4 is past the end of the array (that has type '__m256i[4]') [-Warray-bounds]
lwe-arithmetics_avx2.c:   211 |     c[4] = _mm256_loadu_si256((__m256i*)(C + 64));
lwe-arithmetics_avx2.c:       |     ^ ~
lwe-arithmetics_avx2.c: lwe-arithmetics_avx2.c:204:3: note: array 'c' declared here
lwe-arithmetics_avx2.c:   204 |   __m256i a, b[4], c[4];
lwe-arithmetics_avx2.c:       |   ^
lwe-arithmetics_avx2.c: lwe-arithmetics_avx2.c:212:5: warning: array index 5 is past the end of the array (that has type '__m256i[4]') [-Warray-bounds]
lwe-arithmetics_avx2.c:   212 |     c[5] = _mm256_loadu_si256((__m256i*)(C + 80));
lwe-arithmetics_avx2.c:       |     ^ ~
lwe-arithmetics_avx2.c: lwe-arithmetics_avx2.c:204:3: note: array 'c' declared here
lwe-arithmetics_avx2.c:   204 |   __m256i a, b[4], c[4];
lwe-arithmetics_avx2.c:       |   ^
lwe-arithmetics_avx2.c: lwe-arithmetics_avx2.c:220:7: warning: array index 4 is past the end of the array (that has type '__m256i[4]') [-Warray-bounds]
lwe-arithmetics_avx2.c:   220 |       b[4] = _mm256_loadu_si256((__m256i*)(p + 64));
lwe-arithmetics_avx2.c:       |       ^ ~
lwe-arithmetics_avx2.c: lwe-arithmetics_avx2.c:204:3: note: array 'b' declared here
lwe-arithmetics_avx2.c:   204 |   __m256i a, b[4], c[4];
lwe-arithmetics_avx2.c:       |   ^
lwe-arithmetics_avx2.c: lwe-arithmetics_avx2.c:221:7: warning: array index 5 is past the end of the array (that has type '__m256i[4]') [-Warray-bounds]
lwe-arithmetics_avx2.c:   221 |       b[5] = _mm256_loadu_si256((__m256i*)(p + 80));
lwe-arithmetics_avx2.c:       |       ^ ~
lwe-arithmetics_avx2.c: lwe-arithmetics_avx2.c:204:3: note: array 'b' declared here
lwe-arithmetics_avx2.c:   204 |   __m256i a, b[4], c[4];
lwe-arithmetics_avx2.c:       |   ^
lwe-arithmetics_avx2.c: lwe-arithmetics_avx2.c:226:33: warning: array index 4 is past the end of the array (that has type '__m256i[4]') [-Warray-bounds]
lwe-arithmetics_avx2.c: ...

Number of similar (implementation,compiler) pairs: 3, namely:
ImplementationCompiler
T:avx2clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))
T:avx2clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))
T:avx2clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))

Compiler output


lwe-arithmetics_avx2.c: lwe-arithmetics_avx2.c: In function 'submat_submul':
lwe-arithmetics_avx2.c: lwe-arithmetics_avx2.c:211:6: warning: array subscript 4 is above array bounds of '__m256i[4]' [-Warray-bounds=]
lwe-arithmetics_avx2.c:   211 |     c[4] = _mm256_loadu_si256((__m256i*)(C + 64));
lwe-arithmetics_avx2.c:       |     ~^~~
lwe-arithmetics_avx2.c: lwe-arithmetics_avx2.c:204:20: note: while referencing 'c'
lwe-arithmetics_avx2.c:   204 |   __m256i a, b[4], c[4];
lwe-arithmetics_avx2.c:       |                    ^
lwe-arithmetics_avx2.c: lwe-arithmetics_avx2.c:212:6: warning: array subscript 5 is above array bounds of '__m256i[4]' [-Warray-bounds=]
lwe-arithmetics_avx2.c:   212 |     c[5] = _mm256_loadu_si256((__m256i*)(C + 80));
lwe-arithmetics_avx2.c:       |     ~^~~
lwe-arithmetics_avx2.c: lwe-arithmetics_avx2.c:204:20: note: while referencing 'c'
lwe-arithmetics_avx2.c:   204 |   __m256i a, b[4], c[4];
lwe-arithmetics_avx2.c:       |                    ^
lwe-arithmetics_avx2.c: lwe-arithmetics_avx2.c:240:5: warning: array subscript 4 is above array bounds of '__m256i[4]' [-Warray-bounds=]
lwe-arithmetics_avx2.c:   240 |     _mm256_storeu_si256((__m256i*)(C + 64), c[4]);
lwe-arithmetics_avx2.c:       |     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
lwe-arithmetics_avx2.c: lwe-arithmetics_avx2.c:204:20: note: while referencing 'c'
lwe-arithmetics_avx2.c:   204 |   __m256i a, b[4], c[4];
lwe-arithmetics_avx2.c:       |                    ^
lwe-arithmetics_avx2.c: lwe-arithmetics_avx2.c:241:5: warning: array subscript 5 is above array bounds of '__m256i[4]' [-Warray-bounds=]
lwe-arithmetics_avx2.c:   241 |     _mm256_storeu_si256((__m256i*)(C + 80), c[5]);
lwe-arithmetics_avx2.c:       |     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
lwe-arithmetics_avx2.c: lwe-arithmetics_avx2.c:204:20: note: while referencing 'c'
lwe-arithmetics_avx2.c:   204 |   __m256i a, b[4], c[4];
lwe-arithmetics_avx2.c:       |                    ^
lwe-arithmetics_avx2.c: ...

Number of similar (implementation,compiler) pairs: 3, namely:
ImplementationCompiler
T:avx2gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.2.0)
T:avx2gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.2.0)
T:avx2gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.2.0)

Namespace violations


cpa-pke_opt.o lotus_cpa_pke_dec_packed T
cpa-pke_opt.o lotus_cpa_pke_enc_packed T
cpa-pke_opt.o lotus_cpa_pke_keypair T
crypto.o crypto_symenc_decrypt T
crypto.o crypto_symenc_encrypt T
crypto.o crypto_symenc_keysetup T
crypto.o crypto_symenc_keystream T
crypto.o crypto_symenc_keystream_13block T
crypto.o crypto_symenc_keystream_32block T
encrypt.o util_cmp_const T
lwe-arithmetics_opt.o add_sigma T
lwe-arithmetics_opt.o distribute_2x2_nl T
lwe-arithmetics_opt.o distribute_2x2_nn T
lwe-arithmetics_opt.o merge_2x2_nl T
lwe-arithmetics_opt.o reconstruct T
lwe-arithmetics_opt.o redc T
lwe-arithmetics_opt.o submat_add_nl T
lwe-arithmetics_opt.o submat_add_nn T
lwe-arithmetics_opt.o submat_negmul T
lwe-arithmetics_opt.o submat_negsubmul T
lwe-arithmetics_opt.o submat_sub_nl T
lwe-arithmetics_opt.o submat_sub_nn T
lwe-arithmetics_opt.o submat_submul T
lwe-arithmetics_opt.o submul T
pack.o pack_128dg T
pack.o pack_128elems T
pack.o pack_64elems T
pack.o pack_ct T
pack.o pack_pk T
pack.o pack_sk T
pack.o unpack_128dg T
pack.o unpack_128elems T
pack.o unpack_64elems T
pack.o unpack_ct T
pack.o unpack_pk T
pack.o unpack_sk T
sampler.o _LOTUS_KYDG_SAMPLER_L1_pMat R
sampler.o _LOTUS_KYDG_SAMPLER_L1_weight R
sampler.o _LOTUS_KYDG_SAMPLER_LUT R
sampler.o csprng_sample_bit T
sampler.o csprng_sample_byte T
sampler.o extend_sign_with_random_bit T
sampler.o sample_discrete_gaussian T
sampler.o sample_uniform T
sampler.o sample_unit_discrete_gaussian T
sampler.o sampler_init T
sampler.o sampler_set_seed T
sampler.o scan_bit_and_output T

Number of similar (implementation,compiler) pairs: 6, namely:
ImplementationCompiler
T:optclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))
T:optclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))
T:optclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))
T:optgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.2.0)
T:optgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.2.0)
T:optgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.2.0)

Namespace violations


cpa-pke.o lotus_cpa_pke_dec T
cpa-pke.o lotus_cpa_pke_enc T
cpa-pke.o lotus_cpa_pke_keypair T
crypto.o crypto_symenc_decrypt T
crypto.o crypto_symenc_encrypt T
crypto.o crypto_symenc_keysetup T
crypto.o crypto_symenc_keystream T
encrypt.o util_cmp_const T
lwe-arithmetics.o add_sigma T
lwe-arithmetics.o addmul T
lwe-arithmetics.o addmul_concat T
lwe-arithmetics.o reconstruct T
lwe-arithmetics.o redc T
lwe-arithmetics.o submul T
pack.o pack_128dg T
pack.o pack_128elems T
pack.o pack_64elems T
pack.o pack_ct T
pack.o pack_pk T
pack.o pack_sk T
pack.o unpack_128dg T
pack.o unpack_128elems T
pack.o unpack_64elems T
pack.o unpack_ct T
pack.o unpack_pk T
pack.o unpack_sk T
sampler.o _LOTUS_KYDG_SAMPLER_L1_pMat R
sampler.o _LOTUS_KYDG_SAMPLER_L1_weight R
sampler.o _LOTUS_KYDG_SAMPLER_LUT R
sampler.o csprng_sample_bit T
sampler.o csprng_sample_byte T
sampler.o extend_sign_with_random_bit T
sampler.o sample_discrete_gaussian T
sampler.o sample_uniform T
sampler.o sample_unit_discrete_gaussian T
sampler.o sampler_init T
sampler.o sampler_set_seed T
sampler.o scan_bit_and_output T

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:refclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))
T:refgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.2.0)
T:refgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.2.0)
T:refgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.2.0)