Test results for amd64, speed2supercop, crypto_sign/luov890351
[Page version: 20241023 21:37:24]
Measurements for amd64, speed2supercop, crypto_sign
Test results for amd64, speed2supercop, crypto_sign
Test results for crypto_sign/luov890351
Computer: speed2supercop
Microarchitecture: amd64; Haswell+AES (306c3)
Architecture: amd64
CPU ID: GenuineIntel-000306c3-1fc9cbf5
SUPERCOP version: 20241022
Operation: crypto_sign
Primitive: luov890351
Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
22511956 | 59624 0 0 | 90067 832 1600 | T:avx2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
22540524 | 65776 0 0 | 96707 832 1600 | T:avx2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
22550176 | 36894 32768 0 | 67528 33560 1632 | T:avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
22586396 | 54119 0 0 | 80947 832 1568 | T:avx2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
22646252 | 51030 0 0 | 79674 816 1632 | T:avx2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
22655896 | 21942 32768 0 | 50568 33560 1632 | T:avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
22872400 | 20915 32768 0 | 49280 33560 1632 | T:avx2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
23302732 | 14948 32768 0 | 42272 33552 1600 | T:avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
42520584 | 557414 0 0 | 56146 832 1600 | T:portable | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
42662736 | 555946 0 0 | 54226 832 1600 | T:portable | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
43288924 | 558787 0 0 | 55591 784 1632 | T:portable | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
43927288 | 556845 0 0 | 54194 832 1568 | T:portable | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
44322024 | 551948 0 0 | 48474 832 1568 | T:portable | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
46172880 | 550023 0 0 | 47945 816 1632 | T:portable | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
46253736 | 553999 0 0 | 50919 784 1632 | T:portable | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
50881664 | 552517 0 0 | 49359 784 1632 | T:portable | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
54225312 | 547949 0 0 | 44839 776 1600 | T:portable | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
448562572 | 415689 36 0 | 174330 832 1568 | T:ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
459621040 | 416146 36 0 | 176138 832 1600 | T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
460105284 | 415224 36 0 | 174858 832 1600 | T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
482133988 | 411044 36 0 | 170407 784 1632 | T:ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
488127964 | 412287 36 0 | 171623 784 1632 | T:ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
493975524 | 416275 36 0 | 174871 784 1632 | T:ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
497615600 | 409144 36 0 | 169649 816 1632 | T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
516253316 | 408332 36 0 | 167887 776 1600 | T:ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
530727048 | 411144 36 0 | 170170 832 1568 | T:ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
Compiler output
keccakrng.c: keccakrng.c:71:24: warning: unused function 'rotl' [-Wunused-function]
keccakrng.c: static inline uint64_t rotl(const uint64_t x, int k) {
keccakrng.c: ^
keccakrng.c: 1 warning generated.
Number of similar (implementation,compiler) pairs: 4, namely:
Implementation | Compiler |
T:avx2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1)) |
T:avx2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1)) |
T:avx2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1)) |
T:avx2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1)) |
Compiler output
LUOV.c: LUOV.c:110:17: error: '__builtin_ia32_permti256' needs target feature avx2
LUOV.c: __m256i rr = _mm256_permute2x128_si256(_mm256_loadu_si256((__m256i *)&r),_mm256_setzero_si256(),0);
LUOV.c: ^
LUOV.c: /usr/lib/llvm-16/lib/clang/16/include/avx2intrin.h:821:13: note: expanded from macro '_mm256_permute2x128_si256'
LUOV.c: ((__m256i)__builtin_ia32_permti256((__m256i)(V1), (__m256i)(V2), (int)(M)))
LUOV.c: ^
LUOV.c: LUOV.c:110:43: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx'
LUOV.c: __m256i rr = _mm256_permute2x128_si256(_mm256_loadu_si256((__m256i *)&r),_mm256_setzero_si256(),0);
LUOV.c: ^
LUOV.c: LUOV.c:110:43: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
LUOV.c: LUOV.c:110:77: error: always_inline function '_mm256_setzero_si256' requires target feature 'avx', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx'
LUOV.c: __m256i rr = _mm256_permute2x128_si256(_mm256_loadu_si256((__m256i *)&r),_mm256_setzero_si256(),0);
LUOV.c: ^
LUOV.c: LUOV.c:110:77: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
LUOV.c: LUOV.c:115:20: error: always_inline function '_mm256_set1_epi8' requires target feature 'avx', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx'
LUOV.c: __m256i tttt = _mm256_set1_epi8(t[k/8]);
LUOV.c: ^
LUOV.c: LUOV.c:115:20: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
LUOV.c: LUOV.c:117:54: error: always_inline function '_mm256_setzero_si256' requires target feature 'avx', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx'
LUOV.c: __m256i t1t2 = _mm256_cmpeq_epi8(tttt & masks[0],_mm256_setzero_si256());
LUOV.c: ^
LUOV.c: LUOV.c:117:54: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
LUOV.c: LUOV.c:117:20: error: always_inline function '_mm256_cmpeq_epi8' requires target feature 'avx2', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx2'
LUOV.c: __m256i t1t2 = _mm256_cmpeq_epi8(tttt & masks[0],_mm256_setzero_si256());
LUOV.c: ^
LUOV.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
T:avx2 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1)) |
Compiler output
LUOV.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/13/include/immintrin.h:43,
LUOV.c: from LUOV.h:7,
LUOV.c: from LUOV.c:1:
LUOV.c: In function '_mm256_loadu_si256',
LUOV.c: inlined from 'calculateQ2' at LUOV.c:142:17:
LUOV.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/avxintrin.h:929:10: warning: array subscript '__m256i_u[0]' is partly outside array bounds of '__m128i[1]' [-Warray-bounds=]
LUOV.c: 929 | return *__P;
LUOV.c: | ^~~~
LUOV.c: LUOV.c: In function 'calculateQ2':
LUOV.c: LUOV.c:141:38: note: object 'r' of size 16
LUOV.c: 141 | bitcontainer r = TempMat[j][i];
LUOV.c: | ^
LUOV.c: In function '_mm256_loadu_si256',
LUOV.c: inlined from 'calculateQ2' at LUOV.c:110:17:
LUOV.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/avxintrin.h:929:10: warning: array subscript '__m256i_u[0]' is partly outside array bounds of '__m128i[1]' [-Warray-bounds=]
LUOV.c: 929 | return *__P;
LUOV.c: | ^~~~
LUOV.c: LUOV.c: In function 'calculateQ2':
LUOV.c: LUOV.c:109:38: note: object 'r' of size 16
LUOV.c: 109 | bitcontainer r = Q1[col++];
LUOV.c: | ^
LUOV.c: In function '_mm256_loadu_si256',
LUOV.c: inlined from 'TransformQ1' at LUOV.c:280:17:
LUOV.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/avxintrin.h:929:10: warning: array subscript '__m256i_u[0]' is partly outside array bounds of '__m128i[1]' [-Warray-bounds=]
LUOV.c: 929 | return *__P;
LUOV.c: ...
Number of similar (implementation,compiler) pairs: 3, namely:
Implementation | Compiler |
T:avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
T:avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) |
T:avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) |
Compiler output
F64Field.c: F64Field.c: In function 'f64addInPlace':
F64Field.c: F64Field.c:43:11: warning: dereferencing type-punned pointer will break strict-aliasing rules [-Wstrict-aliasing]
F64Field.c: 43 | *((uint64_t *) a->coef) ^= *((uint64_t *) b->coef);
F64Field.c: | ~^~~~~~~~~~~~~~~~~~~~~
F64Field.c: F64Field.c:43:38: warning: dereferencing type-punned pointer will break strict-aliasing rules [-Wstrict-aliasing]
F64Field.c: 43 | *((uint64_t *) a->coef) ^= *((uint64_t *) b->coef);
F64Field.c: | ~^~~~~~~~~~~~~~~~~~~~~
F80Field.c: F80Field.c: In function 'f80addInPlace':
F80Field.c: F80Field.c:55:11: warning: dereferencing type-punned pointer will break strict-aliasing rules [-Wstrict-aliasing]
F80Field.c: 55 | *((uint64_t *) a->coef) ^= *((uint64_t *) b->coef);
F80Field.c: | ~^~~~~~~~~~~~~~~~~~~~~
F80Field.c: F80Field.c:55:38: warning: dereferencing type-punned pointer will break strict-aliasing rules [-Wstrict-aliasing]
F80Field.c: 55 | *((uint64_t *) a->coef) ^= *((uint64_t *) b->coef);
F80Field.c: | ~^~~~~~~~~~~~~~~~~~~~~
Number of similar (implementation,compiler) pairs: 3, namely:
Implementation | Compiler |
T:portable | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
T:portable | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) |
T:portable | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) |