Test results for amd64, hydra5, crypto_sort/uint64
[Page version: 20260409 06:44:40]
Measurements for amd64, hydra5, crypto_sort
Test results for amd64, hydra5, crypto_sort
Test results for crypto_sort/uint64
Computer: hydra5
Microarchitecture: amd64; K10 32nm (300f10)
Architecture: amd64
CPU ID: AuthenticAMD-00300f10-178bfbff
SUPERCOP version: 20260330
Operation: crypto_sort
Primitive: uint64
| Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 65389 | 273 0 0 | 31069 1224 1576 | useint64 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20260302 | 20260217 |
| 65857 | 338 0 0 | 31722 1272 1512 | useint64 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20260302 | 20260217 |
| 66024 | 338 0 0 | 32642 1272 1512 | useint64 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20260302 | 20260217 |
| 66216 | 338 0 0 | 32642 1272 1512 | useint64 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20260302 | 20260217 |
| 66495 | 626 0 0 | 33405 1224 1576 | useint64 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20260302 | 20260217 |
| 67267 | 167 0 0 | 29276 1264 1512 | useint64 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20260302 | 20260217 |
| 67463 | 178 0 0 | 29810 1272 1512 | useint64 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20260302 | 20260217 |
| 69718 | 192 0 0 | 30164 1216 1576 | useint64 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20260302 | 20260217 |
| 69994 | 174 0 0 | 28979 1200 1544 | useint64 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20260302 | 20260217 |
| 74881 | 1679 0 0 | 29659 1320 1608 | T:stdsort | g++ -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20260406 | 20260330 |
| 76166 | 1393 0 0 | 27395 1320 1608 | T:stdsort | g++ -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20260406 | 20260330 |
| 81935 | 1236 0 0 | 25938 1304 1608 | T:stdsort | g++ -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20260406 | 20260330 |
| 83703 | 988 0 0 | 24323 1296 1576 | T:stdsort | g++ -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20260406 | 20260330 |
Compiler output
sort.c: sort.c:16:20: error: always_inline function '_mm256_set1_epi64x' requires target feature 'avx', but would be inlined into function 'uint64_sort' that is compiled without support for 'avx'
sort.c: 16 | int64x4 vecxor = int64x4_broadcast(0x8000000000000000ULL);
sort.c: | ^
sort.c: sort.c:8:27: note: expanded from macro 'int64x4_broadcast'
sort.c: 8 | #define int64x4_broadcast _mm256_set1_epi64x
sort.c: | ^
sort.c: sort.c:16:20: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
sort.c: sort.c:8:27: note: expanded from macro 'int64x4_broadcast'
sort.c: 8 | #define int64x4_broadcast _mm256_set1_epi64x
sort.c: | ^
sort.c: sort.c:18:18: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'uint64_sort' that is compiled without support for 'avx'
sort.c: 18 | int64x4 x0 = int64x4_load(x+j);
sort.c: | ^
sort.c: sort.c:6:25: note: expanded from macro 'int64x4_load'
sort.c: 6 | #define int64x4_load(z) _mm256_loadu_si256((__m256i *) (z))
sort.c: | ^
sort.c: sort.c:18:18: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
sort.c: sort.c:6:25: note: expanded from macro 'int64x4_load'
sort.c: 6 | #define int64x4_load(z) _mm256_loadu_si256((__m256i *) (z))
sort.c: | ^
sort.c: sort.c:19:18: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'uint64_sort' that is compiled without support for 'avx'
sort.c: 19 | int64x4 x1 = int64x4_load(x+j+4);
sort.c: | ^
sort.c: sort.c:6:25: note: expanded from macro 'int64x4_load'
sort.c: 6 | #define int64x4_load(z) _mm256_loadu_si256((__m256i *) (z))
sort.c: ...
Number of similar (implementation,compiler) pairs: 5, namely:
| Implementation | Compiler |
| avx2useint64 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
| avx2useint64 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
| avx2useint64 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
| avx2useint64 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
| avx2useint64 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
Compiler output
sort.c: sort.c: In function 'uint64_sort':
sort.c: sort.c:16:11: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
sort.c: 16 | int64x4 vecxor = int64x4_broadcast(0x8000000000000000ULL);
sort.c: | ^~~~~~
sort.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/13/include/immintrin.h:43,
sort.c: from sort.c:4:
sort.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/avxintrin.h:1358:1: error: inlining failed in call to 'always_inline' '_mm256_set1_epi64x': target specific option mismatch
sort.c: 1358 | _mm256_set1_epi64x (long long __A)
sort.c: | ^~~~~~~~~~~~~~~~~~
sort.c: sort.c:8:27: note: called from here
sort.c: 8 | #define int64x4_broadcast _mm256_set1_epi64x
sort.c: | ^
sort.c: sort.c:16:20: note: in expansion of macro 'int64x4_broadcast'
sort.c: 16 | int64x4 vecxor = int64x4_broadcast(0x8000000000000000ULL);
sort.c: | ^~~~~~~~~~~~~~~~~
sort.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/avxintrin.h:933:1: error: inlining failed in call to 'always_inline' '_mm256_storeu_si256': target specific option mismatch
sort.c: 933 | _mm256_storeu_si256 (__m256i_u *__P, __m256i __A)
sort.c: | ^~~~~~~~~~~~~~~~~~~
sort.c: sort.c:7:28: note: called from here
sort.c: 7 | #define int64x4_store(z,i) _mm256_storeu_si256((__m256i *) (z),(i))
sort.c: | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
sort.c: sort.c:23:5: note: in expansion of macro 'int64x4_store'
sort.c: 23 | int64x4_store(x+j+4,x1);
sort.c: | ^~~~~~~~~~~~~
sort.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/avxintrin.h:933:1: error: inlining failed in call to 'always_inline' '_mm256_storeu_si256': target specific option mismatch
sort.c: ...
Number of similar (implementation,compiler) pairs: 4, namely:
| Implementation | Compiler |
| avx2useint64 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
| avx2useint64 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
| avx2useint64 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
| avx2useint64 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
Compiler output
try.cpp: In file included from try-anything.cpp:22:
try.cpp: .../supercop-data/hydra5/amd64/include/stq.h:32:13: warning: 'void stq_sort(std::vector<double>, long long int)' defined but not used [-Wunused-function]
try.cpp: 32 | static void stq_sort(stq_doublearray S,long long n)
try.cpp: | ^~~~~~~~
try.cpp: In file included from try-anything.cpp:22:
try.cpp: .../supercop-data/hydra5/amd64/include/stq.h:32:13: warning: 'void stq_sort(std::vector<double>, long long int)' defined but not used [-Wunused-function]
try.cpp: 32 | static void stq_sort(stq_doublearray S,long long n)
try.cpp: | ^~~~~~~~
measure.cpp: In file included from measure-anything.cpp:19:
measure.cpp: .../supercop-data/hydra5/amd64/include/stq.h:32:13: warning: 'void stq_sort(std::vector<double>, long long int)' defined but not used [-Wunused-function]
measure.cpp: 32 | static void stq_sort(stq_doublearray S,long long n)
measure.cpp: | ^~~~~~~~
Number of similar (implementation,compiler) pairs: 4, namely:
| Implementation | Compiler |
| T:stdsort | g++ -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
| T:stdsort | g++ -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
| T:stdsort | g++ -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
| T:stdsort | g++ -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
Passed TIMECOP
TIMECOP iterations: 10
Number of similar (implementation,compiler) pairs: 9, namely:
| Implementation | Compiler |
| useint64 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
| useint64 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
| useint64 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
| useint64 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
| useint64 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
| useint64 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
| useint64 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
| useint64 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
| useint64 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |