Implementation notes: amd64, colossus5, crypto_kem/bikel3

Computer: colossus5
Architecture: amd64
CPU ID: AuthenticAMD-00800f12-178bfbff
SUPERCOP version: 20210125
Operation: crypto_kem
Primitive: bikel3
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
1172080042164 64 464062 856 1604T:avx2gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
1640334027991 64 447790 856 1604T:avx2gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
1662294015510 64 434062 848 1572T:avx2gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
1690840027582 64 446934 856 1604T:avx2gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
2433086035687 56 457534 848 1604T:aes-ni-and-pclmulgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
2721670014147 56 432646 840 1572T:aes-ni-and-pclmulgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
2821204026057 56 445742 848 1604T:aes-ni-and-pclmulgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
2857738025788 56 445078 848 1604T:aes-ni-and-pclmulgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
5286384011696 48 430753 904 1572T:portablegcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
5347960013819 56 432246 840 1572T:aes-ni-onlygcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
5543970024259 48 444561 912 1604T:portablegcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
5570420027321 56 446998 848 1604T:aes-ni-onlygcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
5735654027512 56 446782 848 1604T:aes-ni-onlygcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
5761638024763 48 444697 912 1604T:portablegcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
6629312048972 48 471409 912 1604T:portablegcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
6642862052799 56 474638 848 1604T:aes-ni-onlygcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125

Compiler output

Implementation: T:avx512
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
decode.c: In file included from decode.c:39:0:
decode.c: gf2x.h: In function 'gf2x_mod_add':
decode.c: gf2x.h:22:8: warning: AVX512F vector return without AVX512F enabled changes the ABI [-Wpsabi]
decode.c: va = LOAD(&a_qwords[i]);
decode.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:45:0,
decode.c: from x86_64_intrinsic.h:20,
decode.c: from defs.h:103,
decode.c: from bike_defs.h:10,
decode.c: from types.h:13,
decode.c: from decode.h:10,
decode.c: from decode.c:37:
decode.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx512fintrin.h:5831:1: error: inlining failed in call to always_inline '_mm512_storeu_si512': target specific option mismatch
decode.c: _mm512_storeu_si512 (void *__P, __m512i __A)
decode.c: ^~~~~~~~~~~~~~~~~~~
decode.c: In file included from defs.h:103:0,
decode.c: from bike_defs.h:10,
decode.c: from types.h:13,
decode.c: from decode.h:10,
decode.c: from decode.c:37:
decode.c: x86_64_intrinsic.h:41:27: note: called from here
decode.c: # define STORE(mem, reg) _mm512_storeu_si512((mem), (reg))
decode.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
decode.c: gf2x.h:25:5: note: in expansion of macro 'STORE'
decode.c: STORE(&c_qwords[i], va ^ vb);
decode.c: ^~~~~
decode.c: ...

Number of similar (compiler,implementation) pairs: 8, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx512
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx512
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx512
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx512
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx512-vpclmul
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx512-vpclmul
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx512-vpclmul
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx512-vpclmul

Namespace violations

Implementation: T:aes-ni-and-pclmul
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
aes.o aes256_enc T
aes.o aes256_key_expansion T
aes_ctr_prf.o aes_ctr_prf T
aes_ctr_prf.o init_aes_ctr_prf_state T
decode.o compute_syndrome T
decode.o decode T
error.o bike_errno B
gf2x_inv.o gf2x_mod_inv T
gf2x_ksqr_portable.o k_squaring T
gf2x_mul.o gf2x_mod_mul T
gf2x_mul.o gf2x_mod_sqr_in_place T
gf2x_mul_pclmul.o gf2x_mul_base T
gf2x_mul_pclmul.o gf2x_sqr T
rotate_portable.o rotate_right T
sampling.o generate_error_vector T
sampling.o generate_indices_mod_z T
sampling.o generate_sparse_rep T
sampling.o get_seeds T
sampling.o sample_uniform_r_bits T
sampling.o sample_uniform_r_bits_with_fixed_prf_context T
sampling_portable.o secure_set_bits T
sha.o sha T
utilities.o r_bits_vector_weight T

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:aes-ni-and-pclmul
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:aes-ni-and-pclmul
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:aes-ni-and-pclmul
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:aes-ni-and-pclmul

Namespace violations

Implementation: T:aes-ni-only
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
aes.o aes256_enc T
aes.o aes256_key_expansion T
aes_ctr_prf.o aes_ctr_prf T
aes_ctr_prf.o init_aes_ctr_prf_state T
decode.o compute_syndrome T
decode.o decode T
error.o bike_errno B
gf2x_inv.o gf2x_mod_inv T
gf2x_ksqr_portable.o k_squaring T
gf2x_mul.o gf2x_mod_mul T
gf2x_mul.o gf2x_mod_sqr_in_place T
gf2x_mul_portable.o gf2x_mul_base T
gf2x_mul_portable.o gf2x_sqr T
rotate_portable.o rotate_right T
sampling.o generate_error_vector T
sampling.o generate_indices_mod_z T
sampling.o generate_sparse_rep T
sampling.o get_seeds T
sampling.o sample_uniform_r_bits T
sampling.o sample_uniform_r_bits_with_fixed_prf_context T
sampling_portable.o secure_set_bits T
sha.o sha T
utilities.o r_bits_vector_weight T

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:aes-ni-only
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:aes-ni-only
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:aes-ni-only
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:aes-ni-only

Namespace violations

Implementation: T:avx2
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
aes.o aes256_enc T
aes.o aes256_key_expansion T
aes_ctr_prf.o aes_ctr_prf T
aes_ctr_prf.o init_aes_ctr_prf_state T
decode.o compute_syndrome T
decode.o decode T
error.o bike_errno B
gf2x_inv.o gf2x_mod_inv T
gf2x_ksqr_avx2.o generate_map T
gf2x_ksqr_avx2.o k_squaring T
gf2x_mul.o gf2x_mod_mul T
gf2x_mul.o gf2x_mod_sqr_in_place T
gf2x_mul_pclmul.o gf2x_mul_base T
gf2x_mul_pclmul.o gf2x_sqr T
rotate_avx2.o rotate_right T
sampling.o generate_error_vector T
sampling.o generate_indices_mod_z T
sampling.o generate_sparse_rep T
sampling.o get_seeds T
sampling.o sample_uniform_r_bits T
sampling.o sample_uniform_r_bits_with_fixed_prf_context T
sampling_avx2.o secure_set_bits T
sha.o sha T
utilities.o r_bits_vector_weight T

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2

Namespace violations

Implementation: T:portable
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
aes_ctr_prf.o aes_ctr_prf T
aes_ctr_prf.o init_aes_ctr_prf_state T
decode.o compute_syndrome T
decode.o decode T
error.o bike_errno B
gf2x_inv.o gf2x_mod_inv T
gf2x_ksqr_portable.o k_squaring T
gf2x_mul.o gf2x_mod_mul T
gf2x_mul.o gf2x_mod_sqr_in_place T
gf2x_mul_portable.o gf2x_mul_base T
gf2x_mul_portable.o gf2x_sqr T
rotate_portable.o rotate_right T
sampling.o generate_error_vector T
sampling.o generate_indices_mod_z T
sampling.o generate_sparse_rep T
sampling.o get_seeds T
sampling.o sample_uniform_r_bits T
sampling.o sample_uniform_r_bits_with_fixed_prf_context T
sampling_portable.o secure_set_bits T
utilities.o r_bits_vector_weight T

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:portable
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:portable
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:portable
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:portable