Implementation notes: amd64, h8atom, crypto_sign/rainbow4a

Computer: h8atom
Architecture: amd64
CPU ID: GenuineIntel-00030661-bfebfbff
SUPERCOP version: 20200618
Operation: crypto_sign
Primitive: rainbow4a
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
992740056571 0 104862876389 880 1050240portableclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020042420200409
1013585356244 0 104862876181 880 1050240portableclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020042420200409
1147148826257 0 104862843195 872 1050240portableclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020042420200409
1174242345883 0 104862864477 880 1050240portableclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020042420200409
1174654645883 0 104862864477 880 1050240portableclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020042420200409
1551591383517 0 1048644104194 888 1050304portablegcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020042420200409
1909443942007 0 104864460426 888 1050304portablegcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020042420200409
2098523741272 0 104864459697 896 1050304portablegcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020042420200409

Test failure

Implementation: portable
Security model: unknown
Compiler: gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE
error 142
Alarm clock

Number of similar (compiler,implementation) pairs: 10, namely:
CompilerImplementations
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE portable
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE ref
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE ref
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE ref

Compiler output

Implementation: avx2
Security model: unknown
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
mpkc.c: In file included from mpkc.c:4:
mpkc.c: In file included from ./blas.h:21:
mpkc.c: ./blas_avx2.h:59:18: error: always_inline function '_mm256_load_si256' requires target feature 'xsave', but would be inlined into function 'gf16v_madd_avx2' that is compiled without support for 'xsave'
mpkc.c: __m256i m_tab = _mm256_load_si256( (__m256i*) (__gf16_mul + 32*b) );
mpkc.c: ^
mpkc.c: ./blas_avx2.h:60:15: error: '__builtin_ia32_permti256' needs target feature avx2
mpkc.c: __m256i ml = _mm256_permute2x128_si256( m_tab , m_tab , 0 );
mpkc.c: ^
mpkc.c: /usr/lib/llvm-6.0/lib/clang/6.0.0/include/avx2intrin.h:975:12: note: expanded from macro '_mm256_permute2x128_si256'
mpkc.c: (__m256i)__builtin_ia32_permti256((__m256i)(V1), (__m256i)(V2), (M)); })
mpkc.c: ^
mpkc.c: In file included from mpkc.c:4:
mpkc.c: In file included from ./blas.h:21:
mpkc.c: ./blas_avx2.h:61:15: error: '__builtin_ia32_permti256' needs target feature avx2
mpkc.c: __m256i mh = _mm256_permute2x128_si256( m_tab , m_tab , 0x11 );
mpkc.c: ^
mpkc.c: /usr/lib/llvm-6.0/lib/clang/6.0.0/include/avx2intrin.h:975:12: note: expanded from macro '_mm256_permute2x128_si256'
mpkc.c: (__m256i)__builtin_ia32_permti256((__m256i)(V1), (__m256i)(V2), (M)); })
mpkc.c: ^
mpkc.c: In file included from mpkc.c:4:
mpkc.c: In file included from ./blas.h:21:
mpkc.c: ./blas_avx2.h:62:17: error: always_inline function '_mm256_load_si256' requires target feature 'xsave', but would be inlined into function 'gf16v_madd_avx2' that is compiled without support for 'xsave'
mpkc.c: __m256i mask = _mm256_load_si256( (__m256i*) __mask_low );
mpkc.c: ^
mpkc.c: ./blas_avx2.h:70:10: error: always_inline function '_mm256_loadu_si256' requires target feature 'xsave', but would be inlined into function 'gf16v_madd_avx2' that is compiled without support for 'xsave'
mpkc.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx2
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx2
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx2
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx2

Compiler output

Implementation: avx2
Security model: unknown
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
mpkc.c: In file included from mpkc.c:4:
mpkc.c: In file included from ./blas.h:21:
mpkc.c: ./blas_avx2.h:59:18: error: always_inline function '_mm256_load_si256' requires target feature 'sse4.2', but would be inlined into function 'gf16v_madd_avx2' that is compiled without support for 'sse4.2'
mpkc.c: __m256i m_tab = _mm256_load_si256( (__m256i*) (__gf16_mul + 32*b) );
mpkc.c: ^
mpkc.c: ./blas_avx2.h:60:15: error: '__builtin_ia32_permti256' needs target feature avx2
mpkc.c: __m256i ml = _mm256_permute2x128_si256( m_tab , m_tab , 0 );
mpkc.c: ^
mpkc.c: /usr/lib/llvm-6.0/lib/clang/6.0.0/include/avx2intrin.h:975:12: note: expanded from macro '_mm256_permute2x128_si256'
mpkc.c: (__m256i)__builtin_ia32_permti256((__m256i)(V1), (__m256i)(V2), (M)); })
mpkc.c: ^
mpkc.c: In file included from mpkc.c:4:
mpkc.c: In file included from ./blas.h:21:
mpkc.c: ./blas_avx2.h:61:15: error: '__builtin_ia32_permti256' needs target feature avx2
mpkc.c: __m256i mh = _mm256_permute2x128_si256( m_tab , m_tab , 0x11 );
mpkc.c: ^
mpkc.c: /usr/lib/llvm-6.0/lib/clang/6.0.0/include/avx2intrin.h:975:12: note: expanded from macro '_mm256_permute2x128_si256'
mpkc.c: (__m256i)__builtin_ia32_permti256((__m256i)(V1), (__m256i)(V2), (M)); })
mpkc.c: ^
mpkc.c: In file included from mpkc.c:4:
mpkc.c: In file included from ./blas.h:21:
mpkc.c: ./blas_avx2.h:62:17: error: always_inline function '_mm256_load_si256' requires target feature 'sse4.2', but would be inlined into function 'gf16v_madd_avx2' that is compiled without support for 'sse4.2'
mpkc.c: __m256i mask = _mm256_load_si256( (__m256i*) __mask_low );
mpkc.c: ^
mpkc.c: ./blas_avx2.h:70:10: error: always_inline function '_mm256_loadu_si256' requires target feature 'sse4.2', but would be inlined into function 'gf16v_madd_avx2' that is compiled without support for 'sse4.2'
mpkc.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE avx2

Compiler output

Implementation: avx2
Security model: unknown
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
blas.c: In file included from blas_avx2.h:11:0,
blas.c: from blas.h:21,
blas.c: from blas.c:1:
blas.c: gf16_avx2.h: In function 'tbl32_gf4_x2':
blas.c: gf16_avx2.h:25:1: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
blas.c: {
blas.c: ^
mpkc.c: In file included from blas_avx2.h:11:0,
mpkc.c: from blas.h:21,
mpkc.c: from mpkc.c:4:
mpkc.c: gf16_avx2.h: In function 'tbl32_gf4_x2':
mpkc.c: gf16_avx2.h:25:1: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
mpkc.c: {
mpkc.c: ^
mpkc.c: In file included from blas_avx2.h:11:0,
mpkc.c: from blas.h:21,
mpkc.c: from mpkc.c:4:
mpkc.c: gf16_avx2.h: In function 'tbl32_gf16_log':
mpkc.c: gf16_avx2.h:70:23: note: The ABI for passing parameters with 32-byte alignment has changed in GCC 4.6
mpkc.c: static inline __m256i tbl32_gf16_log( __m256i a )
mpkc.c: ^~~~~~~~~~~~~~
mpkc.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
mpkc.c: from blas_avx2.h:6,
mpkc.c: from blas.h:21,
mpkc.c: from mpkc.c:4:
mpkc.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:588:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi8': target specific option mismatch
mpkc.c: _mm256_shuffle_epi8 (__m256i __X, __m256i __Y)
mpkc.c: ^~~~~~~~~~~~~~~~~~~
mpkc.c: In file included from blas_avx2.h:11:0,
mpkc.c: from blas.h:21,
mpkc.c: from mpkc.c:4:
mpkc.c: gf16_avx2.h:73:9: note: called from here
mpkc.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE avx2
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE avx2
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE avx2
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE avx2

Namespace violations

Implementation: portable
Security model: unknown
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
hash_utils.o sha2_chain T
hash_utils.o sha2_chain_byte T
hash_utils.o sha2_chain_file T
hash_utils.o sha2_chain_msg T
hash_utils.o sha2_file T
prng_utils.o prng_bytes T
prng_utils.o prng_dump T
prng_utils.o prng_dump_set T
prng_utils.o prng_expand_buffer T
prng_utils.o userrand_bytes T
prng_utils.o userrand_dump_generated T
prng_utils.o userrand_source_file T
rainbow_16.o rainbow_central_map T
rainbow_16.o rainbow_genkey T
rainbow_16.o rainbow_genkey_debug T
rainbow_16.o rainbow_ivs_central_map T
rainbow_16.o rainbow_pubmap_seckey T
rainbow_16.o rainbow_secmap T
rainbow_16.o rainbow_sign T
rainbow_16.o rainbow_verify T

Number of similar (compiler,implementation) pairs: 8, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE portable
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE portable
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE portable
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE portable
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE portable
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE portable
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE portable
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE portable