Test results for amd64, hydra5, crypto_sort/uint64

[Page version: 20260409 06:44:40]

Measurements for amd64, hydra5, crypto_sort Test results for amd64, hydra5, crypto_sort Test results for crypto_sort/uint64
Computer: hydra5
Microarchitecture: amd64; K10 32nm (300f10)
Architecture: amd64
CPU ID: AuthenticAMD-00300f10-178bfbff
SUPERCOP version: 20260330
Operation: crypto_sort
Primitive: uint64
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
65389273 0 031069 1224 1576useint64gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
65857338 0 031722 1272 1512useint64clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
66024338 0 032642 1272 1512useint64clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
66216338 0 032642 1272 1512useint64clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
66495626 0 033405 1224 1576useint64gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
67267167 0 029276 1264 1512useint64clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
67463178 0 029810 1272 1512useint64clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
69718192 0 030164 1216 1576useint64gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
69994174 0 028979 1200 1544useint64gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026030220260217
748811679 0 029659 1320 1608T:stdsortg++ -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026040620260330
761661393 0 027395 1320 1608T:stdsortg++ -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026040620260330
819351236 0 025938 1304 1608T:stdsortg++ -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026040620260330
83703988 0 024323 1296 1576T:stdsortg++ -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall2026040620260330

Compiler output


sort.c: sort.c:16:20: error: always_inline function '_mm256_set1_epi64x' requires target feature 'avx', but would be inlined into function 'uint64_sort' that is compiled without support for 'avx'
sort.c:    16 |   int64x4 vecxor = int64x4_broadcast(0x8000000000000000ULL);
sort.c:       |                    ^
sort.c: sort.c:8:27: note: expanded from macro 'int64x4_broadcast'
sort.c:     8 | #define int64x4_broadcast _mm256_set1_epi64x
sort.c:       |                           ^
sort.c: sort.c:16:20: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
sort.c: sort.c:8:27: note: expanded from macro 'int64x4_broadcast'
sort.c:     8 | #define int64x4_broadcast _mm256_set1_epi64x
sort.c:       |                           ^
sort.c: sort.c:18:18: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'uint64_sort' that is compiled without support for 'avx'
sort.c:    18 |     int64x4 x0 = int64x4_load(x+j);
sort.c:       |                  ^
sort.c: sort.c:6:25: note: expanded from macro 'int64x4_load'
sort.c:     6 | #define int64x4_load(z) _mm256_loadu_si256((__m256i *) (z))
sort.c:       |                         ^
sort.c: sort.c:18:18: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
sort.c: sort.c:6:25: note: expanded from macro 'int64x4_load'
sort.c:     6 | #define int64x4_load(z) _mm256_loadu_si256((__m256i *) (z))
sort.c:       |                         ^
sort.c: sort.c:19:18: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'uint64_sort' that is compiled without support for 'avx'
sort.c:    19 |     int64x4 x1 = int64x4_load(x+j+4);
sort.c:       |                  ^
sort.c: sort.c:6:25: note: expanded from macro 'int64x4_load'
sort.c:     6 | #define int64x4_load(z) _mm256_loadu_si256((__m256i *) (z))
sort.c: ...

Number of similar (implementation,compiler) pairs: 5, namely:
ImplementationCompiler
avx2useint64clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))
avx2useint64clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))
avx2useint64clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))
avx2useint64clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))
avx2useint64clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))

Compiler output


sort.c: sort.c: In function 'uint64_sort':
sort.c: sort.c:16:11: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
sort.c:    16 |   int64x4 vecxor = int64x4_broadcast(0x8000000000000000ULL);
sort.c:       |           ^~~~~~
sort.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/13/include/immintrin.h:43,
sort.c:                  from sort.c:4:
sort.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/avxintrin.h:1358:1: error: inlining failed in call to 'always_inline' '_mm256_set1_epi64x': target specific option mismatch
sort.c:  1358 | _mm256_set1_epi64x (long long __A)
sort.c:       | ^~~~~~~~~~~~~~~~~~
sort.c: sort.c:8:27: note: called from here
sort.c:     8 | #define int64x4_broadcast _mm256_set1_epi64x
sort.c:       |                           ^
sort.c: sort.c:16:20: note: in expansion of macro 'int64x4_broadcast'
sort.c:    16 |   int64x4 vecxor = int64x4_broadcast(0x8000000000000000ULL);
sort.c:       |                    ^~~~~~~~~~~~~~~~~
sort.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/avxintrin.h:933:1: error: inlining failed in call to 'always_inline' '_mm256_storeu_si256': target specific option mismatch
sort.c:   933 | _mm256_storeu_si256 (__m256i_u *__P, __m256i __A)
sort.c:       | ^~~~~~~~~~~~~~~~~~~
sort.c: sort.c:7:28: note: called from here
sort.c:     7 | #define int64x4_store(z,i) _mm256_storeu_si256((__m256i *) (z),(i))
sort.c:       |                            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
sort.c: sort.c:23:5: note: in expansion of macro 'int64x4_store'
sort.c:    23 |     int64x4_store(x+j+4,x1);
sort.c:       |     ^~~~~~~~~~~~~
sort.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/avxintrin.h:933:1: error: inlining failed in call to 'always_inline' '_mm256_storeu_si256': target specific option mismatch
sort.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
avx2useint64gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)
avx2useint64gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)
avx2useint64gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)
avx2useint64gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)

Compiler output


try.cpp: In file included from try-anything.cpp:22:
try.cpp: .../supercop-data/hydra5/amd64/include/stq.h:32:13: warning: 'void stq_sort(std::vector<double>, long long int)' defined but not used [-Wunused-function]
try.cpp:    32 | static void stq_sort(stq_doublearray S,long long n)
try.cpp:       |             ^~~~~~~~
try.cpp: In file included from try-anything.cpp:22:
try.cpp: .../supercop-data/hydra5/amd64/include/stq.h:32:13: warning: 'void stq_sort(std::vector<double>, long long int)' defined but not used [-Wunused-function]
try.cpp:    32 | static void stq_sort(stq_doublearray S,long long n)
try.cpp:       |             ^~~~~~~~
measure.cpp: In file included from measure-anything.cpp:19:
measure.cpp: .../supercop-data/hydra5/amd64/include/stq.h:32:13: warning: 'void stq_sort(std::vector<double>, long long int)' defined but not used [-Wunused-function]
measure.cpp:    32 | static void stq_sort(stq_doublearray S,long long n)
measure.cpp:       |             ^~~~~~~~

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:stdsortg++ -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)
T:stdsortg++ -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)
T:stdsortg++ -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)
T:stdsortg++ -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)

Passed TIMECOP


TIMECOP iterations: 10

Number of similar (implementation,compiler) pairs: 9, namely:
ImplementationCompiler
useint64clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))
useint64clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))
useint64clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))
useint64clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))
useint64clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))
useint64gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)
useint64gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)
useint64gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)
useint64gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)