Implementation notes: amd64, whosthere, crypto_sign/rainbow4a

Computer: whosthere
Microarchitecture: amd64; KabyLake (806e9)
Architecture: amd64
CPU ID: GenuineIntel-000806e9-bfebfbff
SUPERCOP version: 20221122
Operation: crypto_sign
Primitive: rainbow4a
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
140018823812 8 104867640197 900 1050424T:avx2gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022101620221005
146109357344 8 104867665540 900 1050424T:avx2gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022101620221005
159651138794 8 104867654380 900 1050424T:avx2gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022101620221005
161650037675 8 104867652092 900 1050424T:avx2gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022101620221005
166873658043 8 104866070356 924 1050392T:avx2clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022101620221005
168665456145 8 104866069036 924 1050392T:avx2clang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022101620221005
172011934541 8 104866049866 916 1050360T:avx2clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022101620221005
299683360270 0 104862883636 916 1050392T:portableclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022101620221005
340123652875 0 104862875548 916 1050360T:portableclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022101620221005
345155826357 0 104862846330 908 1050360T:portableclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022101620221005
347335620841 0 104864441700 892 1050424T:portablegcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022101620221005
352830147658 0 104862870932 916 1050392T:portableclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022101620221005
407957121500 0 104864441948 892 1050424T:portablegcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022101620221005
462609630106 8 104866046066 916 1050360T:avx2clang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022101620221005
14785377140836 0 1048644163652 892 1050424T:refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022101620221005
154309739069 0 104864428629 892 1050424T:portablegcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2022101620221005
2315436112024 0 104862832538 908 1050360T:portableclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2022101620221005

Test failure

Implementation: T:portable
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE
error 111

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:portable

Test failure

Implementation: T:ref
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
error 142
Alarm clock

Number of similar (compiler,implementation) pairs: 8, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:ref
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:ref
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:ref

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
mpkc.c: In file included from mpkc.c:4:
mpkc.c: In file included from ./blas.h:21:
mpkc.c: ./blas_avx2.h:59:18: error: always_inline function '_mm256_load_si256' requires target feature 'avx', but would be inlined into function 'gf16v_madd_avx2' that is compiled without support for 'avx'
mpkc.c: __m256i m_tab = _mm256_load_si256( (__m256i*) (__gf16_mul + 32*b) );
mpkc.c: ^
mpkc.c: ./blas_avx2.h:59:18: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
mpkc.c: ./blas_avx2.h:60:15: error: '__builtin_ia32_permti256' needs target feature avx2
mpkc.c: __m256i ml = _mm256_permute2x128_si256( m_tab , m_tab , 0 );
mpkc.c: ^
mpkc.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/avx2intrin.h:821:12: note: expanded from macro '_mm256_permute2x128_si256'
mpkc.c: (__m256i)__builtin_ia32_permti256((__m256i)(V1), (__m256i)(V2), (int)(M))
mpkc.c: ^
mpkc.c: In file included from mpkc.c:4:
mpkc.c: In file included from ./blas.h:21:
mpkc.c: ./blas_avx2.h:61:15: error: '__builtin_ia32_permti256' needs target feature avx2
mpkc.c: __m256i mh = _mm256_permute2x128_si256( m_tab , m_tab , 0x11 );
mpkc.c: ^
mpkc.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/avx2intrin.h:821:12: note: expanded from macro '_mm256_permute2x128_si256'
mpkc.c: (__m256i)__builtin_ia32_permti256((__m256i)(V1), (__m256i)(V2), (int)(M))
mpkc.c: ^
mpkc.c: In file included from mpkc.c:4:
mpkc.c: In file included from ./blas.h:21:
mpkc.c: ./blas_avx2.h:62:17: error: always_inline function '_mm256_load_si256' requires target feature 'avx', but would be inlined into function 'gf16v_madd_avx2' that is compiled without support for 'avx'
mpkc.c: __m256i mask = _mm256_load_si256( (__m256i*) __mask_low );
mpkc.c: ^
mpkc.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2