Test results for amd64, speed2supercop, crypto_sign/luov8117404
[Page version: 20241023 21:37:24]
Measurements for amd64, speed2supercop, crypto_sign
Test results for amd64, speed2supercop, crypto_sign
Test results for crypto_sign/luov8117404
Computer: speed2supercop
Microarchitecture: amd64; Haswell+AES (306c3)
Architecture: amd64
CPU ID: GenuineIntel-000306c3-1fc9cbf5
SUPERCOP version: 20241022
Operation: crypto_sign
Primitive: luov8117404
Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
38646100 | 59770 0 0 | 90163 832 1600 | T:avx2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
38752080 | 36131 32768 0 | 66720 33560 1632 | T:avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
38777192 | 66514 0 0 | 97187 832 1600 | T:avx2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
38846472 | 51310 0 0 | 79994 816 1632 | T:avx2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
38848896 | 54201 0 0 | 81075 832 1568 | T:avx2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
38913640 | 21499 32768 0 | 50160 33560 1632 | T:avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
39232300 | 20493 32768 0 | 48864 33560 1632 | T:avx2 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
39924616 | 15059 32768 0 | 42400 33552 1600 | T:avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
72677708 | 558054 0 0 | 56786 832 1600 | T:portable | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
72749996 | 556218 0 0 | 54482 832 1600 | T:portable | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
73758452 | 558944 0 0 | 55751 784 1632 | T:portable | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
75870296 | 551964 0 0 | 48474 832 1568 | T:portable | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
76187608 | 557357 0 0 | 54706 832 1568 | T:portable | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
78889592 | 550049 0 0 | 48009 816 1632 | T:portable | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
79099520 | 553637 0 0 | 50631 784 1632 | T:portable | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
86056948 | 552237 0 0 | 49135 784 1632 | T:portable | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
91671636 | 547949 0 0 | 44839 776 1600 | T:portable | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
820168196 | 415097 36 0 | 173754 832 1568 | T:ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
835317576 | 415224 36 0 | 174858 832 1600 | T:ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
836024216 | 415570 36 0 | 175562 832 1600 | T:ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
886697480 | 415872 36 0 | 174423 784 1632 | T:ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
889792104 | 411909 36 0 | 171239 784 1632 | T:ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
902675892 | 410653 36 0 | 170023 784 1632 | T:ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
906432304 | 409155 36 0 | 169649 816 1632 | T:ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
938271136 | 408332 36 0 | 167887 776 1600 | T:ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
963278420 | 411144 36 0 | 170170 832 1568 | T:ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20240708 | 20240625 |
Compiler output
keccakrng.c: keccakrng.c:71:24: warning: unused function 'rotl' [-Wunused-function]
keccakrng.c: static inline uint64_t rotl(const uint64_t x, int k) {
keccakrng.c: ^
keccakrng.c: 1 warning generated.
Number of similar (implementation,compiler) pairs: 4, namely:
Implementation | Compiler |
T:avx2 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1)) |
T:avx2 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1)) |
T:avx2 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1)) |
T:avx2 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1)) |
Compiler output
LUOV.c: LUOV.c:110:17: error: '__builtin_ia32_permti256' needs target feature avx2
LUOV.c: __m256i rr = _mm256_permute2x128_si256(_mm256_loadu_si256((__m256i *)&r),_mm256_setzero_si256(),0);
LUOV.c: ^
LUOV.c: /usr/lib/llvm-16/lib/clang/16/include/avx2intrin.h:821:13: note: expanded from macro '_mm256_permute2x128_si256'
LUOV.c: ((__m256i)__builtin_ia32_permti256((__m256i)(V1), (__m256i)(V2), (int)(M)))
LUOV.c: ^
LUOV.c: LUOV.c:110:43: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx'
LUOV.c: __m256i rr = _mm256_permute2x128_si256(_mm256_loadu_si256((__m256i *)&r),_mm256_setzero_si256(),0);
LUOV.c: ^
LUOV.c: LUOV.c:110:43: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
LUOV.c: LUOV.c:110:77: error: always_inline function '_mm256_setzero_si256' requires target feature 'avx', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx'
LUOV.c: __m256i rr = _mm256_permute2x128_si256(_mm256_loadu_si256((__m256i *)&r),_mm256_setzero_si256(),0);
LUOV.c: ^
LUOV.c: LUOV.c:110:77: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
LUOV.c: LUOV.c:115:20: error: always_inline function '_mm256_set1_epi8' requires target feature 'avx', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx'
LUOV.c: __m256i tttt = _mm256_set1_epi8(t[k/8]);
LUOV.c: ^
LUOV.c: LUOV.c:115:20: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
LUOV.c: LUOV.c:117:54: error: always_inline function '_mm256_setzero_si256' requires target feature 'avx', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx'
LUOV.c: __m256i t1t2 = _mm256_cmpeq_epi8(tttt & masks[0],_mm256_setzero_si256());
LUOV.c: ^
LUOV.c: LUOV.c:117:54: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
LUOV.c: LUOV.c:117:20: error: always_inline function '_mm256_cmpeq_epi8' requires target feature 'avx2', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx2'
LUOV.c: __m256i t1t2 = _mm256_cmpeq_epi8(tttt & masks[0],_mm256_setzero_si256());
LUOV.c: ^
LUOV.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
T:avx2 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1)) |
Compiler output
LUOV.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/13/include/immintrin.h:43,
LUOV.c: from LUOV.h:7,
LUOV.c: from LUOV.c:1:
LUOV.c: In function '_mm256_loadu_si256',
LUOV.c: inlined from 'calculateQ2' at LUOV.c:142:17:
LUOV.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/avxintrin.h:929:10: warning: array subscript '__m256i_u[0]' is partly outside array bounds of '__m128i[1]' [-Warray-bounds=]
LUOV.c: 929 | return *__P;
LUOV.c: | ^~~~
LUOV.c: LUOV.c: In function 'calculateQ2':
LUOV.c: LUOV.c:141:38: note: object 'r' of size 16
LUOV.c: 141 | bitcontainer r = TempMat[j][i];
LUOV.c: | ^
LUOV.c: In function '_mm256_loadu_si256',
LUOV.c: inlined from 'calculateQ2' at LUOV.c:110:17:
LUOV.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/avxintrin.h:929:10: warning: array subscript '__m256i_u[0]' is partly outside array bounds of '__m128i[1]' [-Warray-bounds=]
LUOV.c: 929 | return *__P;
LUOV.c: | ^~~~
LUOV.c: LUOV.c: In function 'calculateQ2':
LUOV.c: LUOV.c:109:38: note: object 'r' of size 16
LUOV.c: 109 | bitcontainer r = Q1[col++];
LUOV.c: | ^
LUOV.c: In function '_mm256_loadu_si256',
LUOV.c: inlined from 'TransformQ1' at LUOV.c:280:17:
LUOV.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/avxintrin.h:929:10: warning: array subscript '__m256i_u[0]' is partly outside array bounds of '__m128i[1]' [-Warray-bounds=]
LUOV.c: 929 | return *__P;
LUOV.c: ...
Number of similar (implementation,compiler) pairs: 3, namely:
Implementation | Compiler |
T:avx2 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
T:avx2 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) |
T:avx2 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) |
Compiler output
F64Field.c: F64Field.c: In function 'f64addInPlace':
F64Field.c: F64Field.c:43:11: warning: dereferencing type-punned pointer will break strict-aliasing rules [-Wstrict-aliasing]
F64Field.c: 43 | *((uint64_t *) a->coef) ^= *((uint64_t *) b->coef);
F64Field.c: | ~^~~~~~~~~~~~~~~~~~~~~
F64Field.c: F64Field.c:43:38: warning: dereferencing type-punned pointer will break strict-aliasing rules [-Wstrict-aliasing]
F64Field.c: 43 | *((uint64_t *) a->coef) ^= *((uint64_t *) b->coef);
F64Field.c: | ~^~~~~~~~~~~~~~~~~~~~~
F80Field.c: F80Field.c: In function 'f80addInPlace':
F80Field.c: F80Field.c:55:11: warning: dereferencing type-punned pointer will break strict-aliasing rules [-Wstrict-aliasing]
F80Field.c: 55 | *((uint64_t *) a->coef) ^= *((uint64_t *) b->coef);
F80Field.c: | ~^~~~~~~~~~~~~~~~~~~~~
F80Field.c: F80Field.c:55:38: warning: dereferencing type-punned pointer will break strict-aliasing rules [-Wstrict-aliasing]
F80Field.c: 55 | *((uint64_t *) a->coef) ^= *((uint64_t *) b->coef);
F80Field.c: | ~^~~~~~~~~~~~~~~~~~~~~
Number of similar (implementation,compiler) pairs: 3, namely:
Implementation | Compiler |
T:portable | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
T:portable | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) |
T:portable | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (14.2.0) |