Implementation notes: amd64, hunsnivy, crypto_sign/rainbow3c

Computer: hunsnivy
Microarchitecture: amd64; Ivy Bridge+AES (306a9)
Architecture: amd64
CPU ID: GenuineIntel-000306a9-bfebfbff
SUPERCOP version: 20240625
Operation: crypto_sign
Primitive: rainbow3c
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
1348395586505 0 1048628103890 908 1050360T:portableclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
1422826976004 0 104864495580 892 1050424T:portablegcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
1453057031413 0 104864449892 892 1050424T:portablegcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
1496528357193 0 104862877284 916 1050360T:portableclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
1517723357457 0 104862879164 916 1050360T:portableclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
1518076655033 0 104862875036 916 1050360T:portableclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
1562309231581 0 104864449780 892 1050424T:portablegcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625
22921808147887 0 1048628167012 916 1050360T:refclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070520240625

Test failure


error 142
Alarm clock

Number of similar (implementation,compiler) pairs: 10, namely:
ImplementationCompiler
T:portableclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:portablegcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:refclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:refclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:refclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:refclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:refgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:refgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:refgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:refgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)

Compiler output


rainbow.c: In file included from rainbow.c:4:
rainbow.c: In file included from ./rainbow.h:7:
rainbow.c: In file included from ./blas.h:21:
rainbow.c: ./blas_avx2.h:208:16: error: '__builtin_ia32_permti256' needs target feature avx2
rainbow.c:                 __m256i ml = _mm256_permute2x128_si256(mt,mt,0x00 );
rainbow.c:                              ^
rainbow.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/avx2intrin.h:821:12: note: expanded from macro '_mm256_permute2x128_si256'
rainbow.c:   (__m256i)__builtin_ia32_permti256((__m256i)(V1), (__m256i)(V2), (int)(M))
rainbow.c:            ^
rainbow.c: In file included from rainbow.c:4:
rainbow.c: In file included from ./rainbow.h:7:
rainbow.c: In file included from ./blas.h:21:
rainbow.c: ./blas_avx2.h:209:16: error: '__builtin_ia32_permti256' needs target feature avx2
rainbow.c:                 __m256i mh = _mm256_permute2x128_si256(mt,mt,0x11 );
rainbow.c:                              ^
rainbow.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/avx2intrin.h:821:12: note: expanded from macro '_mm256_permute2x128_si256'
rainbow.c:   (__m256i)__builtin_ia32_permti256((__m256i)(V1), (__m256i)(V2), (int)(M))
rainbow.c:            ^
rainbow.c: In file included from rainbow.c:4:
rainbow.c: In file included from ./rainbow.h:7:
rainbow.c: In file included from ./blas.h:21:
rainbow.c: ./blas_avx2.h:212:12: error: always_inline function '_mm256_shuffle_epi8' requires target feature 'avx2', but would be inlined into function 'gf256mat_prod_multab_avx2' that is compiled without support for 'avx2'
rainbow.c:                         r[j] ^= _mm256_shuffle_epi8( ml , inp&mask_f );
rainbow.c:                                 ^
rainbow.c: ./blas_avx2.h:213:38: error: always_inline function '_mm256_srli_epi16' requires target feature 'avx2', but would be inlined into function 'gf256mat_prod_multab_avx2' that is compiled without support for 'avx2'
rainbow.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:avx2clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:avx2clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:avx2clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:avx2clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)

Compiler output


rainbow.c: In file included from rainbow.c:4:
rainbow.c: In file included from ./rainbow.h:7:
rainbow.c: In file included from ./blas.h:21:
rainbow.c: In file included from ./blas_avx2.h:13:
rainbow.c: ./blas_sse.h:661:24: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function '_gf256mat_gauss_elim_sse' that is compiled without support for 'ssse3'
rainbow.c:                         __m128i madd_mask = _mm_shuffle_epi8( is_madd , _mm_set1_epi8(i_r) );
rainbow.c:                                             ^
rainbow.c: In file included from rainbow.c:4:
rainbow.c: In file included from ./rainbow.h:7:
rainbow.c: In file included from ./blas.h:21:
rainbow.c: In file included from ./blas_avx2.h:11:
rainbow.c: In file included from ./gf16_avx2.h:8:
rainbow.c: ./gf16_sse.h:78:9: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'tbl_gf16_squ_x8' that is compiled without support for 'ssse3'
rainbow.c:         return _mm_shuffle_epi8(tab_l,a);
rainbow.c:                ^
rainbow.c: ./gf16_sse.h:118:15: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'tbl_gf16_mul' that is compiled without support for 'ssse3'
rainbow.c:         __m128i la = _mm_shuffle_epi8(log_16,a);
rainbow.c:                      ^
rainbow.c: ./gf16_sse.h:119:15: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'tbl_gf16_mul' that is compiled without support for 'ssse3'
rainbow.c:         __m128i lb = _mm_shuffle_epi8(log_16,b);
rainbow.c:                      ^
rainbow.c: ./gf16_sse.h:122:15: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'tbl_gf16_mul' that is compiled without support for 'ssse3'
rainbow.c:         __m128i r0 = _mm_shuffle_epi8(exp_16, _mm_sub_epi8(la_lb, mask_f&_mm_cmpgt_epi8(la_lb,mask_f) ) );
rainbow.c:                      ^
rainbow.c: ./gf16_sse.h:85:9: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'tbl_gf16_inv' that is compiled without support for 'ssse3'
rainbow.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
T:avx2clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)

Compiler output


rainbow.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:53,
rainbow.c:                  from blas_avx2.h:6,
rainbow.c:                  from blas.h:21,
rainbow.c:                  from rainbow.h:7,
rainbow.c:                  from rainbow.c:4:
rainbow.c: gf16_avx2.h: In function 'tbl32_gf16_log':
rainbow.c: /usr/lib/gcc/x86_64-linux-gnu/10/include/avx2intrin.h:588:1: error: inlining failed in call to 'always_inline' '_mm256_shuffle_epi8': target specific option mismatch
rainbow.c:   588 | _mm256_shuffle_epi8 (__m256i __X, __m256i __Y)
rainbow.c:       | ^~~~~~~~~~~~~~~~~~~
rainbow.c: In file included from blas_avx2.h:11,
rainbow.c:                  from blas.h:21,
rainbow.c:                  from rainbow.h:7,
rainbow.c:                  from rainbow.c:4:
rainbow.c: gf16_avx2.h:73:9: note: called from here
rainbow.c:    73 |  return _mm256_shuffle_epi8(tab_l,a);
rainbow.c:       |         ^~~~~~~~~~~~~~~~~~~~~~~~~~~~
rainbow.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:53,
rainbow.c:                  from blas_avx2.h:6,
rainbow.c:                  from blas.h:21,
rainbow.c:                  from rainbow.h:7,
rainbow.c:                  from rainbow.c:4:
rainbow.c: /usr/lib/gcc/x86_64-linux-gnu/10/include/avx2intrin.h:588:1: error: inlining failed in call to 'always_inline' '_mm256_shuffle_epi8': target specific option mismatch
rainbow.c:   588 | _mm256_shuffle_epi8 (__m256i __X, __m256i __Y)
rainbow.c:       | ^~~~~~~~~~~~~~~~~~~
rainbow.c: In file included from blas_avx2.h:11,
rainbow.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:avx2gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:avx2gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:avx2gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:avx2gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)