Implementation notes: amd64, jasper, crypto_sign/rainbow1c

Computer: jasper
Microarchitecture: amd64; Tremont (906c0)
Architecture: amd64
CPU ID: GenuineIntel-000906c0-20-bfebfbff
SUPERCOP version: 20231107
Operation: crypto_sign
Primitive: rainbow1c
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
398908957116 0 104862874842 908 1050360T:portableclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023060820230530
442045726971 0 104864445796 892 1050424T:portablegcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023060820230530
452022067382 0 104862889212 916 1050360T:portableclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023060820230530
453209161598 0 104862881172 916 1050360T:portableclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023060820230530
454439866448 0 104862886932 916 1050360T:portableclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023060820230530
478622327117 0 104864445540 892 1050424T:portablegcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023060820230530
90582629239 0 104864426741 892 1050424T:portablegcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023060820230530
995926612180 0 104862830730 908 1050360T:portableclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023060820230530
10040636156141 0 1048644176196 892 1050424T:refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023060820230530
1801762181269 0 1048628102068 916 1050360T:refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023060820230530
1826968471309 0 104862889836 916 1050360T:refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023060820230530
1871512183983 0 1048628103596 916 1050360T:refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023060820230530
2727706923639 0 104864442516 892 1050424T:refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023060820230530
2916460422451 0 104864440876 892 1050424T:refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023060820230530
2927611647377 0 104862865018 908 1050360T:refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2023060820230530

Test failure

Implementation: T:portable
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE
error 111

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:portable

Test failure

Implementation: T:ref
Security model: timingleaks
Compiler: clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
error 142
Alarm clock

Number of similar (compiler,implementation) pairs: 2, namely:
CompilerImplementations
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:ref

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
rainbow.c: In file included from rainbow.c:4:
rainbow.c: In file included from ./rainbow.h:7:
rainbow.c: In file included from ./blas.h:21:
rainbow.c: ./blas_avx2.h:200:19: error: always_inline function '_mm256_load_si256' requires target feature 'avx', but would be inlined into function 'gf256mat_prod_multab_avx2' that is compiled without support for 'avx'
rainbow.c: __m256i mask_f = _mm256_load_si256((__m256i const *) __mask_low);
rainbow.c: ^
rainbow.c: ./blas_avx2.h:200:19: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
rainbow.c: ./blas_avx2.h:204:39: error: always_inline function '_mm256_setzero_si256' requires target feature 'avx', but would be inlined into function 'gf256mat_prod_multab_avx2' that is compiled without support for 'avx'
rainbow.c: for(unsigned i=0;i<n_ymm;i++) r[i] = _mm256_setzero_si256();
rainbow.c: ^
rainbow.c: ./blas_avx2.h:204:39: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
rainbow.c: ./blas_avx2.h:207:16: error: always_inline function '_mm256_load_si256' requires target feature 'avx', but would be inlined into function 'gf256mat_prod_multab_avx2' that is compiled without support for 'avx'
rainbow.c: __m256i mt = _mm256_load_si256( (__m256i*)( multab + i*32) );
rainbow.c: ^
rainbow.c: ./blas_avx2.h:207:16: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
rainbow.c: ./blas_avx2.h:208:16: error: '__builtin_ia32_permti256' needs target feature avx2
rainbow.c: __m256i ml = _mm256_permute2x128_si256(mt,mt,0x00 );
rainbow.c: ^
rainbow.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/avx2intrin.h:821:12: note: expanded from macro '_mm256_permute2x128_si256'
rainbow.c: (__m256i)__builtin_ia32_permti256((__m256i)(V1), (__m256i)(V2), (int)(M))
rainbow.c: ^
rainbow.c: In file included from rainbow.c:4:
rainbow.c: In file included from ./rainbow.h:7:
rainbow.c: In file included from ./blas.h:21:
rainbow.c: ./blas_avx2.h:209:16: error: '__builtin_ia32_permti256' needs target feature avx2
rainbow.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
rainbow.c: In file included from rainbow.c:4:
rainbow.c: In file included from ./rainbow.h:7:
rainbow.c: In file included from ./blas.h:21:
rainbow.c: In file included from ./blas_avx2.h:13:
rainbow.c: ./blas_sse.h:661:24: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function '_gf256mat_gauss_elim_sse' that is compiled without support for 'ssse3'
rainbow.c: __m128i madd_mask = _mm_shuffle_epi8( is_madd , _mm_set1_epi8(i_r) );
rainbow.c: ^
rainbow.c: In file included from rainbow.c:4:
rainbow.c: In file included from ./rainbow.h:7:
rainbow.c: In file included from ./blas.h:21:
rainbow.c: In file included from ./blas_avx2.h:11:
rainbow.c: In file included from ./gf16_avx2.h:8:
rainbow.c: ./gf16_sse.h:78:9: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'tbl_gf16_squ_x8' that is compiled without support for 'ssse3'
rainbow.c: return _mm_shuffle_epi8(tab_l,a);
rainbow.c: ^
rainbow.c: ./gf16_sse.h:118:15: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'tbl_gf16_mul' that is compiled without support for 'ssse3'
rainbow.c: __m128i la = _mm_shuffle_epi8(log_16,a);
rainbow.c: ^
rainbow.c: ./gf16_sse.h:119:15: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'tbl_gf16_mul' that is compiled without support for 'ssse3'
rainbow.c: __m128i lb = _mm_shuffle_epi8(log_16,b);
rainbow.c: ^
rainbow.c: ./gf16_sse.h:122:15: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'tbl_gf16_mul' that is compiled without support for 'ssse3'
rainbow.c: __m128i r0 = _mm_shuffle_epi8(exp_16, _mm_sub_epi8(la_lb, mask_f&_mm_cmpgt_epi8(la_lb,mask_f) ) );
rainbow.c: ^
rainbow.c: ./gf16_sse.h:85:9: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'tbl_gf16_inv' that is compiled without support for 'ssse3'
rainbow.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
blas.c: In file included from blas_avx2.h:11,
blas.c: from blas.h:21,
blas.c: from blas.c:1:
blas.c: gf16_avx2.h: In function 'tbl32_gf4_x2':
blas.c: gf16_avx2.h:25:1: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
blas.c: 25 | {
blas.c: | ^
rainbow.c: In file included from blas_avx2.h:11,
rainbow.c: from blas.h:21,
rainbow.c: from rainbow.h:7,
rainbow.c: from rainbow.c:4:
rainbow.c: gf16_avx2.h: In function 'tbl32_gf4_x2':
rainbow.c: gf16_avx2.h:25:1: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
rainbow.c: 25 | {
rainbow.c: | ^
rainbow.c: gf16_avx2.h: In function 'tbl32_gf16_log':
rainbow.c: gf16_avx2.h:70:23: note: the ABI for passing parameters with 32-byte alignment has changed in GCC 4.6
rainbow.c: 70 | static inline __m256i tbl32_gf16_log( __m256i a )
rainbow.c: | ^~~~~~~~~~~~~~
rainbow.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:53,
rainbow.c: from blas_avx2.h:6,
rainbow.c: from blas.h:21,
rainbow.c: from rainbow.h:7,
rainbow.c: from rainbow.c:4:
rainbow.c: /usr/lib/gcc/x86_64-linux-gnu/10/include/avx2intrin.h:588:1: error: inlining failed in call to 'always_inline' '_mm256_shuffle_epi8': target specific option mismatch
rainbow.c: 588 | _mm256_shuffle_epi8 (__m256i __X, __m256i __Y)
rainbow.c: | ^~~~~~~~~~~~~~~~~~~
rainbow.c: In file included from blas_avx2.h:11,
rainbow.c: from blas.h:21,
rainbow.c: from rainbow.h:7,
rainbow.c: from rainbow.c:4:
rainbow.c: gf16_avx2.h:73:9: note: called from here
rainbow.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2