Implementation notes: amd64, zen3, crypto_sign/rainbow4a

Computer: zen3
Architecture: amd64
CPU ID: AuthenticAMD-00a20f10-178bfbff
SUPERCOP version: 20211108
Operation: crypto_sign
Primitive: rainbow4a
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
83564158145 8 104867667092 948 1050456T:avx2gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022020720211108
86223040115 8 104867655540 948 1050456T:avx2gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022020720211108
87385035731 8 104866051970 964 1050360T:avx2clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022020720211108
88066039908 8 104866055620 972 1050360T:avx2clang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022020720211108
919156144285 8 1048660141316 972 1050392T:avx2clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022020720211108
929758160471 8 1048660153060 972 1050392T:avx2clang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022020720211108
110283125528 8 104867641680 940 1050424T:avx2gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022020720211108
122414839185 8 104867653540 948 1050456T:avx2gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022020720211108
1749238151896 0 1048628180364 964 1050392T:portableclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022020720211108
236286892716 0 1048644116012 940 1050456T:portablegcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022020720211108
2432691146828 0 1048628175692 964 1050392T:portableclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022020720211108
274010053861 0 104862877852 964 1050360T:portableclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022020720211108
283463432892 0 104862853402 956 1050360T:portableclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022020720211108
294425928482 0 104862849748 964 1050360T:portableclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022020720211108
317427021268 0 104864442596 940 1050456T:portablegcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022020720211108
355087721814 0 104864442572 940 1050456T:portablegcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022020720211108
58080619956 0 104864429784 932 1050424T:portablegcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022020720211108
11027504114851 0 1048644138092 940 1050456T:refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022020720211108
1392309283087 0 1048628105972 964 1050360T:refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022020720211108
14932007108034 0 1048628134804 964 1050392T:refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022020720211108
15223973103560 0 1048628130252 964 1050392T:refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022020720211108
3092814422266 0 104864443588 940 1050456T:refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022020720211108
3168310321493 0 104864442284 940 1050456T:refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022020720211108
3201384633159 0 104862854404 964 1050360T:refclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022020720211108
3536003830921 0 104862851954 956 1050360T:refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022020720211108

Test failure

Implementation: T:ref
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE
error 142

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:ref

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
mpkc_avx2.c: In file included from mpkc_avx2.c:4:
mpkc_avx2.c: In file included from ./blas.h:21:
mpkc_avx2.c: ./blas_avx2.h:137:19: error: always_inline function '_mm256_load_si256' requires target feature 'avx', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'avx'
mpkc_avx2.c: __m256i mask_f = _mm256_load_si256( (__m256i*)__mask_low);
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:137:19: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mpkc_avx2.c: ./blas_avx2.h:142:40: error: always_inline function '_mm256_setzero_si256' requires target feature 'avx', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'avx'
mpkc_avx2.c: for(unsigned i=0;i<n_ymm;i++) r0[i] = _mm256_setzero_si256();
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:142:40: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mpkc_avx2.c: ./blas_avx2.h:143:40: error: always_inline function '_mm256_setzero_si256' requires target feature 'avx', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'avx'
mpkc_avx2.c: for(unsigned i=0;i<n_ymm;i++) r1[i] = _mm256_setzero_si256();
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:143:40: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mpkc_avx2.c: ./blas_avx2.h:148:32: error: always_inline function '_mm256_load_si256' requires target feature 'avx', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'avx'
mpkc_avx2.c: __m256i lx = tbl32_gf16_log( _mm256_load_si256((__m256i*)(x+i*32)) );
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:148:32: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mpkc_avx2.c: ./blas_avx2.h:148:16: warning: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI [-Wpsabi]
mpkc_avx2.c: __m256i lx = tbl32_gf16_log( _mm256_load_si256((__m256i*)(x+i*32)) );
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:149:3: error: always_inline function '_mm256_store_si256' requires target feature 'avx', but would be inlined into function 'gf16mat_prod_avx2' that is compiled without support for 'avx'
mpkc_avx2.c: _mm256_store_si256((__m256i*)(x+i*32),lx);
mpkc_avx2.c: ^
mpkc_avx2.c: ./blas_avx2.h:149:3: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mpkc_avx2.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2