Implementation notes: amd64, ghf51, crypto_sign/rainbow6a

Computer: ghf51
Architecture: amd64
CPU ID: AuthenticAMD-00810f81-178bfbff
SUPERCOP version: 20200702
Operation: crypto_sign
Primitive: rainbow6a
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
260517444499 8 104866059759 896 1050240avx2clang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020071620200702
285461834279 8 104866048865 888 1050240avx2clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020071620200702
295440665026 8 104867670896 904 1050304avx2gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020071620200702
312410844499 8 104866059759 896 1050240avx2clang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020071620200702
320652844348 8 104867656032 904 1050304avx2gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020071620200702
321742248057 8 104866061239 896 1050240avx2clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020071620200702
330200042944 8 104867654152 904 1050304avx2gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020071620200702
366111226437 8 104867642081 904 1050272avx2gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020071620200702
1100655452608 0 104864474689 888 1050304portablegcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020071620200702
1158497648626 0 104862870279 888 1050240portableclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020071620200702
1171983855434 0 104862877159 888 1050240portableclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020071620200702
1178010638399 0 104862860207 888 1050240portableclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020071620200702
1206618438399 0 104862860207 888 1050240portableclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020071620200702
1214959227962 0 104862847177 880 1050240portableclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020071620200702
1238023845009 0 104864465392 896 1050304portablegcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020071620200702
1277590646595 0 104864467073 888 1050304portablegcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020071620200702
2108113880535 0 1048628100143 888 1050240refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020071620200702

Test failure

Implementation: portable
Security model: unknown
Compiler: gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE
error 142
Alarm clock

Number of similar (compiler,implementation) pairs: 9, namely:
CompilerImplementations
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE portable
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE ref

Compiler output

Implementation: avx2
Security model: unknown
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
mpkc_avx2.c: In file included from mpkc_avx2.c:4:
mpkc_avx2.c: In file included from ./blas.h:21:
mpkc_avx2.c: ./blas_avx2.h:137:19: error: always_inline function '_mm256_load_si256' requires target feature 'avx', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'avx'
mpkc_avx2.c: __m256i mask_f = _mm256_load_si256( (__m256i*)__mask_low);
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:142:40: error: always_inline function '_mm256_setzero_si256' requires target feature 'avx', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'avx'
mpkc_avx2.c: for(unsigned i=0;i<n_ymm;i++) r0[i] = _mm256_setzero_si256();
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:143:40: error: always_inline function '_mm256_setzero_si256' requires target feature 'avx', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'avx'
mpkc_avx2.c: for(unsigned i=0;i<n_ymm;i++) r1[i] = _mm256_setzero_si256();
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:148:32: error: always_inline function '_mm256_load_si256' requires target feature 'avx', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'avx'
mpkc_avx2.c: __m256i lx = tbl32_gf16_log( _mm256_load_si256((__m256i*)(x+i*32)) );
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:149:3: error: always_inline function '_mm256_store_si256' requires target feature 'avx', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'avx'
mpkc_avx2.c: _mm256_store_si256((__m256i*)(x+i*32),lx);
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:154:16: error: always_inline function '_mm256_broadcastb_epi8' requires target feature 'avx2', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'avx2'
mpkc_avx2.c: __m256i ml = _mm256_broadcastb_epi8( _mm_load_si128((__m128i*)x) );
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:157:18: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'avx'
mpkc_avx2.c: __m256i inp = _mm256_loadu_si256( (__m256i*)(matA+j*32) );
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:159:33: error: always_inline function '_mm256_srli_epi16' requires target feature 'avx2', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'avx2'
mpkc_avx2.c: r1[j] ^= tbl32_gf16_mul_log( _mm256_srli_epi16(inp,4)&mask_f , ml , mask_f );
mpkc_avx2.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx2

Namespace violations

Implementation: avx2
Security model: unknown
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
blas.o __zero_32 B
gf16.o __gf16_exp R
gf16.o __gf16_inv R
gf16.o __gf16_log R
gf16.o __gf16_mul D
gf16.o __gf16_mulx2 R
gf16.o __gf16_squ R
gf16.o __gf16_squ_sl4 R
gf16.o __gf16_squ_x8 R
gf16.o __gf256_mul R
gf16.o __mask_0x55 R
gf16.o __mask_16 R
gf16.o __mask_low R
hash_utils.o sha2_chain T
hash_utils.o sha2_chain_byte T
hash_utils.o sha2_chain_file T
hash_utils.o sha2_chain_msg T
hash_utils.o sha2_file T
mpkc.o _mpkc_pub_map_gf16 T
mpkc.o _mpkc_pub_map_gf16_n_m T
mpkc.o mpkc_interpolate_gf16 T
mpkc_avx2.o mpkc_pub_map_gf16_avx2 T
mpkc_avx2.o mpkc_pub_map_gf16_n_m_avx2 T
prng_utils.o prng_bytes T
prng_utils.o prng_dump T
prng_utils.o prng_dump_set T
prng_utils.o prng_expand_buffer T
prng_utils.o userrand_bytes T
prng_utils.o userrand_dump_generated T
prng_utils.o userrand_source_file T
rainbow_16.o rainbow_central_map T
rainbow_16.o rainbow_genkey T
rainbow_16.o rainbow_genkey_debug T
rainbow_16.o rainbow_ivs_central_map T
rainbow_16.o rainbow_pubmap_seckey T
rainbow_16.o rainbow_secmap T
rainbow_16.o rainbow_sign T
rainbow_16.o rainbow_verify T
rainbow_16323232_core_avx2.o rainbow_ivs_central_map_16323232_avx2 T
rainbow_16323232_core_avx2.o rainbow_sign_16323232_avx2 T

Number of similar (compiler,implementation) pairs: 8, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx2
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx2
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx2
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx2
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE avx2
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE avx2
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE avx2
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE avx2

Namespace violations

Implementation: portable
Security model: unknown
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
hash_utils.o sha2_chain T
hash_utils.o sha2_chain_byte T
hash_utils.o sha2_chain_file T
hash_utils.o sha2_chain_msg T
hash_utils.o sha2_file T
prng_utils.o prng_bytes T
prng_utils.o prng_dump T
prng_utils.o prng_dump_set T
prng_utils.o prng_expand_buffer T
prng_utils.o userrand_bytes T
prng_utils.o userrand_dump_generated T
prng_utils.o userrand_source_file T
rainbow_16.o rainbow_central_map T
rainbow_16.o rainbow_genkey T
rainbow_16.o rainbow_genkey_debug T
rainbow_16.o rainbow_ivs_central_map T
rainbow_16.o rainbow_pubmap_seckey T
rainbow_16.o rainbow_secmap T
rainbow_16.o rainbow_sign T
rainbow_16.o rainbow_verify T

Number of similar (compiler,implementation) pairs: 9, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE portable
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE portable
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE portable
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE portable
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE portable
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE portable
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE portable
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE portable
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref