Implementation notes: amd64, gcc68, crypto_kem/mceliece8192128f

Computer: gcc68
Architecture: amd64
CPU ID: AuthenticAMD-00800f82-178bfbff
SUPERCOP version: 20191221
Operation: crypto_kem
Primitive: mceliece8192128f
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
548384134009 0 0172532 808 1608avxgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010120191221
573504184850 0 0213404 808 1608ssegcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010120191221
60924881309 0 0108820 808 1608ssegcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010120191221
64553678795 0 0105988 808 1608ssegcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010120191221
71148872993 0 099260 800 1576ssegcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010120191221
1465792207573 0 0238212 808 1608vecgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010120191221
174838470414 0 099081 784 1576vecclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010120191221
1823200130300 0 0162195 792 1576vecclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010120191221
184720096751 0 0128275 792 1576vecclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010120191221
185299296751 0 0128275 792 1576vecclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010120191221
1911040117348 0 0149171 792 1576vecclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010120191221
213961681386 0 0111292 808 1608vecgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010120191221
240211281747 0 0111372 808 1608vecgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010120191221
242528065529 0 094228 800 1576vecgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010120191221
27621065637947 0 069939 792 1576refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010120191221
27639254430273 0 062035 792 1576refclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010120191221
27639372830273 0 062035 792 1576refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010120191221
27829334442963 0 074955 792 1576refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010120191221
37072083214431 0 043217 784 1576refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020010120191221
37961027212349 0 041004 800 1576refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010120191221
37988019216187 0 046092 808 1608refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010120191221
41035904016001 0 045604 808 1608refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010120191221
437069664112300 0 0143470 816 1608refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020010120191221

Compiler output

Implementation: avx
Security model: unknown
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
bm.c: bm.c:133:30: error: argument to '__builtin_ia32_vec_ext_v4di' must be a constant integer
bm.c: out[ (4*j + k)*16 + i ] = (vec256_extract(buf[i], j) >> (k*16)) & GFMASK;
bm.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
bm.c: ./vec256.h:31:41: note: expanded from macro 'vec256_extract'
bm.c: #define vec256_extract(a,i) ((uint64_t) _mm256_extract_epi64((vec256) (a),(i)))
bm.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
bm.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/avxintrin.h:2034:14: note: expanded from macro '_mm256_extract_epi64'
bm.c: (long long)__builtin_ia32_vec_ext_v4di((__v4di)(__m256i)(X), (int)(N))
bm.c: ^ ~~~~~~~~
bm.c: 1 error generated.

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx

Compiler output

Implementation: avx
Security model: unknown
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
bm.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/8/include/immintrin.h:41,
bm.c: from vec256.h:11,
bm.c: from bm.h:10,
bm.c: from bm.c:6:
bm.c: /usr/lib/gcc/x86_64-linux-gnu/8/include/avxintrin.h: In function 'bm':
bm.c: /usr/lib/gcc/x86_64-linux-gnu/8/include/avxintrin.h:524:20: error: the last argument must be a 1-bit immediate
bm.c: return (__m128i) __builtin_ia32_vextractf128_si256 ((__v8si)__X, __N);
bm.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
bm.c: In file included from vec128.h:12,
bm.c: from bm.h:9,
bm.c: from bm.c:6:
bm.c: /usr/lib/gcc/x86_64-linux-gnu/8/include/smmintrin.h:456:10: error: selector must be an integer constant in the range 0..1
bm.c: return __builtin_ia32_vec_ext_v2di ((__v2di)__X, __N);
bm.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE avx
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE avx
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE avx

Compiler output

Implementation: sse
Security model: unknown
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
bm.c: bm.c:129:30: error: argument to '__builtin_ia32_vec_ext_v2di' must be a constant integer
bm.c: out[ (4*j + k)*16 + i ] = (vec128_extract(buf[i], j) >> (k*16)) & GFMASK;
bm.c: ^~~~~~~~~~~~~~~~~~~~~~~~~
bm.c: ./vec128.h:26:42: note: expanded from macro 'vec128_extract'
bm.c: #define vec128_extract(a, i) ((uint64_t) _mm_extract_epi64((vec128) (a), (i)))
bm.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
bm.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/smmintrin.h:1097:14: note: expanded from macro '_mm_extract_epi64'
bm.c: (long long)__builtin_ia32_vec_ext_v2di((__v2di)(__m128i)(X), (int)(N))
bm.c: ^ ~~~~~~~~
bm.c: 1 error generated.

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE sse

Namespace violations

Implementation: avx
Security model: unknown
Compiler: gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE
benes.o benes T
benes.o load_bits T
bm.o bm T
consts.o MASK0_0 R
consts.o MASK0_1 R
consts.o MASK1_0 R
consts.o MASK1_1 R
consts.o MASK2_0 R
consts.o MASK2_1 R
consts.o MASK3_0 R
consts.o MASK3_1 R
consts.o MASK4_0 R
consts.o MASK4_1 R
consts.o MASK5_0 R
consts.o MASK5_1 R
controlbits.o controlbits T
controlbits.o sort_63b T
decrypt.o decrypt T
encrypt.o encrypt T
fft.o fft T
fft_tr.o fft_tr T
gf.o GF_mul T
gf.o gf_frac T
gf.o gf_inv T
gf.o gf_iszero T
gf.o gf_mul T
pk_gen.o pk_gen T
sk_gen.o genpoly_gen T
sk_gen.o perm_check T
syndrome_asm.o _syndrome_asm T
syndrome_asm.o syndrome_asm T
transpose_64x128_sp_asm.o _transpose_64x128_sp_asm T
transpose_64x128_sp_asm.o transpose_64x128_sp_asm T
transpose_64x256_sp_asm.o _transpose_64x256_sp_asm T
transpose_64x256_sp_asm.o transpose_64x256_sp_asm T
update_asm.o _update_asm T
update_asm.o update_asm T
vec128_mul_asm.o _vec128_mul_asm T
vec128_mul_asm.o vec128_mul_asm T
vec256.o vec256_inv T
vec256.o vec256_sq T
vec256_ama_asm.o _vec256_ama_asm T
vec256_ama_asm.o vec256_ama_asm T
vec256_maa_asm.o _vec256_maa_asm T
vec256_maa_asm.o vec256_maa_asm T
vec256_mul_asm.o _vec256_mul_asm T
vec256_mul_asm.o vec256_mul_asm T
vec_reduce_asm.o _vec_reduce_asm T
vec_reduce_asm.o vec_reduce_asm T

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE avx

Namespace violations

Implementation: ref
Security model: unknown
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
benes.o apply_benes T
benes.o support_gen T
bm.o bm T
controlbits.o controlbits T
controlbits.o sort_63b T
decrypt.o decrypt T
encrypt.o encrypt T
encrypt.o syndrome T
gf.o GF_mul T
gf.o gf_add T
gf.o gf_frac T
gf.o gf_inv T
gf.o gf_iszero T
gf.o gf_mul T
pk_gen.o pk_gen T
root.o eval T
root.o root T
sk_gen.o genpoly_gen T
sk_gen.o perm_check T
synd.o synd T
transpose.o transpose_64x64 T
util.o bitrev T
util.o load2 T
util.o load4 T
util.o load8 T
util.o store2 T
util.o store8 T

Number of similar (compiler,implementation) pairs: 9, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE ref

Namespace violations

Implementation: sse
Security model: unknown
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
benes.o benes T
benes.o load_bits T
bm.o bm T
consts.o MASK0_0 R
consts.o MASK0_1 R
consts.o MASK1_0 R
consts.o MASK1_1 R
consts.o MASK2_0 R
consts.o MASK2_1 R
consts.o MASK3_0 R
consts.o MASK3_1 R
consts.o MASK4_0 R
consts.o MASK4_1 R
consts.o MASK5_0 R
consts.o MASK5_1 R
controlbits.o controlbits T
controlbits.o sort_63b T
decrypt.o decrypt T
encrypt.o encrypt T
fft.o fft T
fft_tr.o fft_tr T
gf.o GF_mul T
gf.o gf_frac T
gf.o gf_inv T
gf.o gf_iszero T
gf.o gf_mul T
pk_gen.o pk_gen T
sk_gen.o genpoly_gen T
sk_gen.o perm_check T
syndrome_asm.o _syndrome_asm T
syndrome_asm.o syndrome_asm T
transpose_64x128_sp_asm.o _transpose_64x128_sp_asm T
transpose_64x128_sp_asm.o transpose_64x128_sp_asm T
update_asm.o _update_asm T
update_asm.o update_asm T
vec128.o vec128_inv T
vec128.o vec128_sq T
vec128_mul_asm.o _vec128_mul_asm T
vec128_mul_asm.o vec128_mul_asm T
vec_reduce_asm.o _vec_reduce_asm T
vec_reduce_asm.o vec_reduce_asm T

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE sse
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE sse
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE sse
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE sse

Namespace violations

Implementation: vec
Security model: unknown
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
benes.o benes T
bm.o bm T
controlbits.o controlbits T
controlbits.o sort_63b T
decrypt.o decrypt T
encrypt.o encrypt T
encrypt.o syndrome T
fft.o fft T
fft_tr.o fft_tr T
gf.o GF_mul T
gf.o gf_frac T
gf.o gf_inv T
gf.o gf_iszero T
gf.o gf_mul T
pk_gen.o pk_gen T
sk_gen.o genpoly_gen T
sk_gen.o perm_check T
vec.o vec_inv T
vec.o vec_mul T
vec.o vec_sq T

Number of similar (compiler,implementation) pairs: 9, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE vec
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE vec
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE vec
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE vec
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE vec
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE vec
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE vec
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE vec
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE vec