Test results for amd64, h9ivy, crypto_sign/luov863256pc
[Page version: 20260503 03:45:06]
Measurements for amd64, h9ivy, crypto_sign
Test results for amd64, h9ivy, crypto_sign
Test results for crypto_sign/luov863256pc
Computer: h9ivy
Microarchitecture: amd64; Ivy Bridge+AES (306a9)
Architecture: amd64
CPU ID: GenuineIntel-000306a9-bfebfbff
SUPERCOP version: 20260330
Operation: crypto_sign
Primitive: luov863256pc
Compiler output
LUOV.c: LUOV.c:38:19: error: '__builtin_ia32_permdi256' needs target feature avx2
LUOV.c: 38 | __m256i rrrr = _mm256_permute4x64_epi64(_mm256_loadu_si256((__m256i *)&Q1[col++]),0);
LUOV.c: | ^
LUOV.c: /usr/lib/llvm-19/lib/clang/19/include/avx2intrin.h:3394:13: note: expanded from macro '_mm256_permute4x64_epi64'
LUOV.c: 3394 | ((__m256i)__builtin_ia32_permdi256((__v4di)(__m256i)(V), (int)(M)))
LUOV.c: | ^
LUOV.c: LUOV.c:44:10: error: always_inline function '_mm256_slli_epi64' requires target feature 'avx2', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx2'
LUOV.c: 44 | TJ = _mm256_slli_epi64(TJ,4);
LUOV.c: | ^
LUOV.c: LUOV.c:47:10: error: always_inline function '_mm256_slli_epi64' requires target feature 'avx2', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx2'
LUOV.c: 47 | TJ = _mm256_slli_epi64(TJ,4);
LUOV.c: | ^
LUOV.c: LUOV.c:60:19: error: '__builtin_ia32_permdi256' needs target feature avx2
LUOV.c: 60 | __m256i rrrr = _mm256_permute4x64_epi64(_mm256_loadu_si256((__m256i *)&TempMat[j][i]),0);
LUOV.c: | ^
LUOV.c: /usr/lib/llvm-19/lib/clang/19/include/avx2intrin.h:3394:13: note: expanded from macro '_mm256_permute4x64_epi64'
LUOV.c: 3394 | ((__m256i)__builtin_ia32_permdi256((__v4di)(__m256i)(V), (int)(M)))
LUOV.c: | ^
LUOV.c: LUOV.c:66:10: error: always_inline function '_mm256_slli_epi64' requires target feature 'avx2', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx2'
LUOV.c: 66 | TJ = _mm256_slli_epi64(TJ,4);
LUOV.c: | ^
LUOV.c: LUOV.c:69:10: error: always_inline function '_mm256_slli_epi64' requires target feature 'avx2', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx2'
LUOV.c: 69 | TJ = _mm256_slli_epi64(TJ,4);
LUOV.c: | ^
LUOV.c: 6 errors generated.
Number of similar (implementation,compiler) pairs: 4, namely:
| Implementation | Compiler |
| T:avx2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) |
| T:avx2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) |
| T:avx2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) |
| T:avx2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) |
Compiler output
LUOV.c: LUOV.c:38:19: error: '__builtin_ia32_permdi256' needs target feature avx2
LUOV.c: 38 | __m256i rrrr = _mm256_permute4x64_epi64(_mm256_loadu_si256((__m256i *)&Q1[col++]),0);
LUOV.c: | ^
LUOV.c: /usr/lib/llvm-19/lib/clang/19/include/avx2intrin.h:3394:13: note: expanded from macro '_mm256_permute4x64_epi64'
LUOV.c: 3394 | ((__m256i)__builtin_ia32_permdi256((__v4di)(__m256i)(V), (int)(M)))
LUOV.c: | ^
LUOV.c: LUOV.c:38:44: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx'
LUOV.c: 38 | __m256i rrrr = _mm256_permute4x64_epi64(_mm256_loadu_si256((__m256i *)&Q1[col++]),0);
LUOV.c: | ^
LUOV.c: LUOV.c:38:44: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
LUOV.c: LUOV.c:43:69: error: always_inline function '_mm256_setzero_pd' requires target feature 'avx', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx'
LUOV.c: 43 | *((__m256i *)&TempMat[i][k*8+4]) ^= (__m256i) _mm256_blendv_pd(_mm256_setzero_pd(),(__m256d) rrrr,(__m256d)TJ);
LUOV.c: | ^
LUOV.c: LUOV.c:43:69: error: AVX vector return of type '__m256d' (vector of 4 'double' values) without 'avx' enabled changes the ABI
LUOV.c: LUOV.c:43:52: error: always_inline function '_mm256_blendv_pd' requires target feature 'avx', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx'
LUOV.c: 43 | *((__m256i *)&TempMat[i][k*8+4]) ^= (__m256i) _mm256_blendv_pd(_mm256_setzero_pd(),(__m256d) rrrr,(__m256d)TJ);
LUOV.c: | ^
LUOV.c: LUOV.c:43:52: error: AVX vector argument of type '__m256d' (vector of 4 'double' values) without 'avx' enabled changes the ABI
LUOV.c: LUOV.c:44:10: error: always_inline function '_mm256_slli_epi64' requires target feature 'avx2', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx2'
LUOV.c: 44 | TJ = _mm256_slli_epi64(TJ,4);
LUOV.c: | ^
LUOV.c: LUOV.c:44:10: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
LUOV.c: LUOV.c:46:66: error: always_inline function '_mm256_setzero_pd' requires target feature 'avx', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx'
LUOV.c: 46 | *((__m256i *)&TempMat[i][k*8]) ^= (__m256i) _mm256_blendv_pd(_mm256_setzero_pd(),(__m256d) rrrr,(__m256d)TJ);
LUOV.c: | ^
LUOV.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
| Implementation | Compiler |
| T:avx2 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_19.1.7_(3+b1)) |
Compiler output
LUOV.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/14/include/immintrin.h:53,
LUOV.c: from LUOV.h:8,
LUOV.c: from LUOV.c:1:
LUOV.c: /usr/lib/gcc/x86_64-linux-gnu/14/include/avx2intrin.h: In function 'mulByX':
LUOV.c: /usr/lib/gcc/x86_64-linux-gnu/14/include/avx2intrin.h:913:1: error: inlining failed in call to 'always_inline' '_mm256_xor_si256': target specific option mismatch
LUOV.c: 913 | _mm256_xor_si256 (__m256i __A, __m256i __B)
LUOV.c: | ^~~~~~~~~~~~~~~~
LUOV.c: In file included from LinearAlgebra.h:9,
LUOV.c: from LUOV.h:14:
LUOV.c: AVX_Operations.h:466:14: note: called from here
LUOV.c: 466 | *A = _mm256_xor_si256(Temp, _mm256_shuffle_epi8(Table[1],_mm256_and_si256(*Mask,_mm256_srli_epi16(*A,4))));
LUOV.c: | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
LUOV.c: /usr/lib/gcc/x86_64-linux-gnu/14/include/avx2intrin.h:588:1: error: inlining failed in call to 'always_inline' '_mm256_shuffle_epi8': target specific option mismatch
LUOV.c: 588 | _mm256_shuffle_epi8 (__m256i __X, __m256i __Y)
LUOV.c: | ^~~~~~~~~~~~~~~~~~~
LUOV.c: AVX_Operations.h:466:14: note: called from here
LUOV.c: 466 | *A = _mm256_xor_si256(Temp, _mm256_shuffle_epi8(Table[1],_mm256_and_si256(*Mask,_mm256_srli_epi16(*A,4))));
LUOV.c: | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
LUOV.c: /usr/lib/gcc/x86_64-linux-gnu/14/include/avx2intrin.h:179:1: error: inlining failed in call to 'always_inline' '_mm256_and_si256': target specific option mismatch
LUOV.c: 179 | _mm256_and_si256 (__m256i __A, __m256i __B)
LUOV.c: | ^~~~~~~~~~~~~~~~
LUOV.c: AVX_Operations.h:466:14: note: called from here
LUOV.c: 466 | *A = _mm256_xor_si256(Temp, _mm256_shuffle_epi8(Table[1],_mm256_and_si256(*Mask,_mm256_srli_epi16(*A,4))));
LUOV.c: | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
LUOV.c: /usr/lib/gcc/x86_64-linux-gnu/14/include/avx2intrin.h:759:1: error: inlining failed in call to 'always_inline' '_mm256_srli_epi16': target specific option mismatch
LUOV.c: ...
Number of similar (implementation,compiler) pairs: 4, namely:
| Implementation | Compiler |
| T:avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) |
| T:avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) |
| T:avx2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) |
| T:avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) |