Implementation notes: amd64, firefly, crypto_sign/rainbow4a

Computer: firefly
Architecture: amd64
CPU ID: AuthenticAMD-00800f12-178bfbff
SUPERCOP version: 20201130
Operation: crypto_sign
Primitive: rainbow4a
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
109705449481 8 104864463739 904 1050200T:avx2clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
152915560214 8 104867668377 920 1050296T:avx2gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
156443748697 8 104864462067 904 1050200T:avx2clang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
157156748697 8 104864462067 904 1050200T:avx2clang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
158739136430 8 104864450793 896 1050200T:avx2clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
241497744203 8 104867655817 920 1050296T:avx2gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
244358942582 8 104867653841 920 1050296T:avx2gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
256302826051 8 104867641782 920 1050264T:avx2gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
398318640960 0 104864461649 912 1050296T:portablegcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
422717039906 0 104864460353 912 1050296T:portablegcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
464554034014 0 104861255379 896 1050200T:portableclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
470451220517 0 104861239937 888 1050200T:portableclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
508550731023 0 104861252003 896 1050200T:portableclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
526805832381 0 104861254251 896 1050200T:portableclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
630420831023 0 104861252003 896 1050200T:portableclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121320201130
83051629142 0 104864428822 912 1050264T:portablegcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130
16600986131858 0 1048644154393 912 1050296T:refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121320201130

Test failure

Implementation: T:portable
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE
error 111

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:portable

Test failure

Implementation: T:ref
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
error 142
Alarm clock

Number of similar (compiler,implementation) pairs: 8, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:ref
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:ref
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:ref

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
mpkc_avx2.c: In file included from mpkc_avx2.c:4:
mpkc_avx2.c: In file included from ./blas.h:21:
mpkc_avx2.c: ./blas_avx2.h:137:19: error: always_inline function '_mm256_load_si256' requires target feature 'sse4.2', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'sse4.2'
mpkc_avx2.c: __m256i mask_f = _mm256_load_si256( (__m256i*)__mask_low);
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:142:40: error: always_inline function '_mm256_setzero_si256' requires target feature 'sse4.2', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'sse4.2'
mpkc_avx2.c: for(unsigned i=0;i<n_ymm;i++) r0[i] = _mm256_setzero_si256();
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:143:40: error: always_inline function '_mm256_setzero_si256' requires target feature 'sse4.2', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'sse4.2'
mpkc_avx2.c: for(unsigned i=0;i<n_ymm;i++) r1[i] = _mm256_setzero_si256();
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:148:32: error: always_inline function '_mm256_load_si256' requires target feature 'sse4.2', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'sse4.2'
mpkc_avx2.c: __m256i lx = tbl32_gf16_log( _mm256_load_si256((__m256i*)(x+i*32)) );
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:149:3: error: always_inline function '_mm256_store_si256' requires target feature 'sse4.2', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'sse4.2'
mpkc_avx2.c: _mm256_store_si256((__m256i*)(x+i*32),lx);
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:154:16: error: always_inline function '_mm256_broadcastb_epi8' requires target feature 'sse4.2', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'sse4.2'
mpkc_avx2.c: __m256i ml = _mm256_broadcastb_epi8( _mm_load_si128((__m128i*)x) );
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:157:18: error: always_inline function '_mm256_loadu_si256' requires target feature 'sse4.2', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'sse4.2'
mpkc_avx2.c: __m256i inp = _mm256_loadu_si256( (__m256i*)(matA+j*32) );
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:159:33: error: always_inline function '_mm256_srli_epi16' requires target feature 'sse4.2', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'sse4.2'
mpkc_avx2.c: r1[j] ^= tbl32_gf16_mul_log( _mm256_srli_epi16(inp,4)&mask_f , ml , mask_f );
mpkc_avx2.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2

Namespace violations

Implementation: T:avx2
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
blas.o __zero_32 B
gf16.o __gf16_exp R
gf16.o __gf16_inv R
gf16.o __gf16_log R
gf16.o __gf16_mul D
gf16.o __gf16_mulx2 R
gf16.o __gf16_squ R
gf16.o __gf16_squ_sl4 R
gf16.o __gf16_squ_x8 R
gf16.o __gf256_mul R
gf16.o __mask_0x55 R
gf16.o __mask_16 R
gf16.o __mask_low R
hash_utils.o sha2_chain T
hash_utils.o sha2_chain_byte T
hash_utils.o sha2_chain_file T
hash_utils.o sha2_chain_msg T
hash_utils.o sha2_file T
mpkc.o _mpkc_pub_map_gf16 T
mpkc.o _mpkc_pub_map_gf16_n_m T
mpkc.o mpkc_interpolate_gf16 T
mpkc_avx2.o mpkc_pub_map_gf16_avx2 T
mpkc_avx2.o mpkc_pub_map_gf16_n_m_avx2 T
prng_utils.o prng_bytes T
prng_utils.o prng_dump T
prng_utils.o prng_dump_set T
prng_utils.o prng_expand_buffer T
prng_utils.o userrand_bytes T
prng_utils.o userrand_dump_generated T
prng_utils.o userrand_source_file T
rainbow_16.o rainbow_central_map T
rainbow_16.o rainbow_genkey T
rainbow_16.o rainbow_genkey_debug T
rainbow_16.o rainbow_ivs_central_map T
rainbow_16.o rainbow_pubmap_seckey T
rainbow_16.o rainbow_secmap T
rainbow_16.o rainbow_sign T
rainbow_16.o rainbow_verify T
rainbow_16323232_core_avx2.o rainbow_ivs_central_map_16323232_avx2 T
rainbow_16323232_core_avx2.o rainbow_sign_16323232_avx2 T

Number of similar (compiler,implementation) pairs: 8, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2

Namespace violations

Implementation: T:portable
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
hash_utils.o sha2_chain T
hash_utils.o sha2_chain_byte T
hash_utils.o sha2_chain_file T
hash_utils.o sha2_chain_msg T
hash_utils.o sha2_file T
prng_utils.o prng_bytes T
prng_utils.o prng_dump T
prng_utils.o prng_dump_set T
prng_utils.o prng_expand_buffer T
prng_utils.o userrand_bytes T
prng_utils.o userrand_dump_generated T
prng_utils.o userrand_source_file T
rainbow_16.o rainbow_central_map T
rainbow_16.o rainbow_genkey T
rainbow_16.o rainbow_genkey_debug T
rainbow_16.o rainbow_ivs_central_map T
rainbow_16.o rainbow_pubmap_seckey T
rainbow_16.o rainbow_secmap T
rainbow_16.o rainbow_sign T
rainbow_16.o rainbow_verify T

Number of similar (compiler,implementation) pairs: 9, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:portable
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:portable
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:portable
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:portable
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:portable
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:portable
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:portable
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:portable
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:ref