Test results for amd64, hydra5, crypto_sort/int64

[Page version: 20260709 06:18:22]

Measurements for amd64, hydra5, crypto_sort Test results for amd64, hydra5, crypto_sort Test results for crypto_sort/int64

Computer: hydra5
Microarchitecture: amd64; K10 32nm (300f10)
Architecture: amd64
CPU ID: AuthenticAMD-00300f10-178bfbff
SUPERCOP version: 20260627
Operation: crypto_sort
Primitive: int64

Time	Object size	Test size	Implementation	Compiler	Benchmark date	SUPERCOP version
65298	8622 0 0	29116 1264 1512	`x863`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
65425	11623 0 0	35322 1272 1512	`x863`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
65530	11623 0 0	35322 1272 1512	`x863`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
65678	9991 0 0	32770 1272 1512	`x863`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
67194	9359 0 0	30386 1272 1512	`x863`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
67908	11060 0 0	33821 1224 1576	`x863`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
68064	10434 0 0	32364 1216 1576	`x863`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
69014	14604 0 0	39341 1224 1576	`x863`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
74094	1679 0 0	29659 1320 1608	`T:stdsort`	`g++ -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260405	20260330
75492	1393 0 0	27395 1320 1608	`T:stdsort`	`g++ -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260405	20260330
78629	842 0 0	24562 1272 1512	`portable4`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
78671	842 0 0	24562 1272 1512	`portable4`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
79458	623 0 0	21116 1264 1512	`portable4`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
80001	730 0 0	21778 1272 1512	`portable4`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
80223	762 0 0	23546 1272 1512	`portable4`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
80691	3887 0 0	24683 1200 1544	`x863`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
81889	1236 0 0	25938 1304 1608	`T:stdsort`	`g++ -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260405	20260330
84279	988 0 0	24323 1296 1576	`T:stdsort`	`g++ -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260405	20260330
99131	913 0 0	23701 1224 1576	`portable4`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
100394	1793 0 0	26565 1224 1576	`portable4`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
117434	1003 0 0	22948 1216 1576	`portable4`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217
216331	732 0 0	21507 1200 1544	`portable4`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260302	20260217

Compiler output

sort.c: sort.c:123:8: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'int64_sort_8through16' that is compiled without support for 'avx'
sort.c:   123 |   x0 = int64x4_load(x+0); // 0 4 8 12
sort.c:       |        ^
sort.c: sort.c:17:25: note: expanded from macro 'int64x4_load'
sort.c:    17 | #define int64x4_load(z) _mm256_loadu_si256((__m256i *) (z))
sort.c:       |                         ^
sort.c: sort.c:123:8: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
sort.c: sort.c:17:25: note: expanded from macro 'int64x4_load'
sort.c:    17 | #define int64x4_load(z) _mm256_loadu_si256((__m256i *) (z))
sort.c:       |                         ^
sort.c: sort.c:124:8: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'int64_sort_8through16' that is compiled without support for 'avx'
sort.c:   124 |   x1 = int64x4_load(x+4); // 1 5 9 13
sort.c:       |        ^
sort.c: sort.c:17:25: note: expanded from macro 'int64x4_load'
sort.c:    17 | #define int64x4_load(z) _mm256_loadu_si256((__m256i *) (z))
sort.c:       |                         ^
sort.c: sort.c:124:8: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
sort.c: sort.c:17:25: note: expanded from macro 'int64x4_load'
sort.c:    17 | #define int64x4_load(z) _mm256_loadu_si256((__m256i *) (z))
sort.c:       |                         ^
sort.c: sort.c:125:11: error: always_inline function '_mm256_set1_epi64x' requires target feature 'avx', but would be inlined into function 'int64_sort_8through16' that is compiled without support for 'avx'
sort.c:   125 |   infty = int64x4_broadcast(int64_largest);
sort.c:       |           ^
sort.c: sort.c:41:27: note: expanded from macro 'int64x4_broadcast'
sort.c:    41 | #define int64x4_broadcast _mm256_set1_epi64x
sort.c: ...

Number of similar (implementation,compiler) pairs: 5, namely:

Implementation	Compiler
`2026avx2`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`2026avx2`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`2026avx2`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`2026avx2`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`2026avx2`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`

Compiler output

sort.c: sort.c: In function 'int64_sort_8through16':
sort.c: sort.c:123:6: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
sort.c:   123 |   x0 = int64x4_load(x+0); // 0 4 8 12
sort.c:       |      ^
sort.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/13/include/immintrin.h:43,
sort.c:                  from sort.c:4:
sort.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/avxintrin.h:933:1: error: inlining failed in call to 'always_inline' '_mm256_storeu_si256': target specific option mismatch
sort.c:   933 | _mm256_storeu_si256 (__m256i_u *__P, __m256i __A)
sort.c:       | ^~~~~~~~~~~~~~~~~~~
sort.c: sort.c:18:28: note: called from here
sort.c:    18 | #define int64x4_store(z,i) _mm256_storeu_si256((__m256i *) (z),(i))
sort.c:       |                            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
sort.c: sort.c:224:3: note: in expansion of macro 'int64x4_store'
sort.c:   224 |   int64x4_store(x+0,y2);
sort.c:       |   ^~~~~~~~~~~~~
sort.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/avxintrin.h:933:1: error: inlining failed in call to 'always_inline' '_mm256_storeu_si256': target specific option mismatch
sort.c:   933 | _mm256_storeu_si256 (__m256i_u *__P, __m256i __A)
sort.c:       | ^~~~~~~~~~~~~~~~~~~
sort.c: sort.c:18:28: note: called from here
sort.c:    18 | #define int64x4_store(z,i) _mm256_storeu_si256((__m256i *) (z),(i))
sort.c:       |                            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
sort.c: sort.c:223:3: note: in expansion of macro 'int64x4_store'
sort.c:   223 |   int64x4_store(x+4,y1);
sort.c:       |   ^~~~~~~~~~~~~
sort.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/avxintrin.h:933:1: error: inlining failed in call to 'always_inline' '_mm256_storeu_si256': target specific option mismatch
sort.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:

Implementation	Compiler
`2026avx2`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`2026avx2`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`2026avx2`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`2026avx2`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`

Compiler output

sort.c: sort.c:130:93: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'int64_sort_8through16' that is compiled without support for 'ssse3'
sort.c:   130 |   x4 = int8x16_iftopthenelse(int64x2_smaller_mask(int64x2_set(8,9),int64x2_broadcast(pos4)),int8x16_varextract(int64x2_load(x+pos4-2),int8x16_sub(int8x16_set(16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31),int8x16_broadcast(8*(pos4&1)))),infty); // 4 12
sort.c:       |                                                                                             ^
sort.c: sort.c:39:28: note: expanded from macro 'int8x16_varextract'
sort.c:    39 | #define int8x16_varextract _mm_shuffle_epi8
sort.c:       |                            ^
sort.c: sort.c:130:30: error: always_inline function '_mm_cmpgt_epi64' requires target feature 'sse4.2', but would be inlined into function 'int64_sort_8through16' that is compiled without support for 'sse4.2'
sort.c:   130 |   x4 = int8x16_iftopthenelse(int64x2_smaller_mask(int64x2_set(8,9),int64x2_broadcast(pos4)),int8x16_varextract(int64x2_load(x+pos4-2),int8x16_sub(int8x16_set(16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31),int8x16_broadcast(8*(pos4&1)))),infty); // 4 12
sort.c:       |                              ^
sort.c: sort.c:20:35: note: expanded from macro 'int64x2_smaller_mask'
sort.c:    20 | #define int64x2_smaller_mask(a,b) _mm_cmpgt_epi64(b,a)
sort.c:       |                                   ^
sort.c: sort.c:130:8: error: always_inline function '_mm_blendv_epi8' requires target feature 'sse4.1', but would be inlined into function 'int64_sort_8through16' that is compiled without support for 'sse4.1'
sort.c:   130 |   x4 = int8x16_iftopthenelse(int64x2_smaller_mask(int64x2_set(8,9),int64x2_broadcast(pos4)),int8x16_varextract(int64x2_load(x+pos4-2),int8x16_sub(int8x16_set(16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31),int8x16_broadcast(8*(pos4&1)))),infty); // 4 12
sort.c:       |        ^
sort.c: sort.c:23:38: note: expanded from macro 'int8x16_iftopthenelse'
sort.c:    23 | #define int8x16_iftopthenelse(c,t,e) _mm_blendv_epi8(e,t,c)
sort.c:       |                                      ^
sort.c: sort.c:132:95: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'int64_sort_8through16' that is compiled without support for 'ssse3'
sort.c:   132 |   x5 = int8x16_iftopthenelse(int64x2_smaller_mask(int64x2_set(10,11),int64x2_broadcast(pos5)),int8x16_varextract(int64x2_load(x+pos5-2),int8x16_sub(int8x16_set(16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31),int8x16_broadcast(8*(pos5&1)))),infty); // 5 13
sort.c:       |                                                                                               ^
sort.c: sort.c:39:28: note: expanded from macro 'int8x16_varextract'
sort.c:    39 | #define int8x16_varextract _mm_shuffle_epi8
sort.c:       |                            ^
sort.c: sort.c:132:30: error: always_inline function '_mm_cmpgt_epi64' requires target feature 'sse4.2', but would be inlined into function 'int64_sort_8through16' that is compiled without support for 'sse4.2'
sort.c: ...

Number of similar (implementation,compiler) pairs: 5, namely:

Implementation	Compiler
`sse42`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`sse42`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`sse42`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`sse42`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`sse42`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`

Compiler output

sort.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/13/include/immintrin.h:37,
sort.c:                  from sort.c:4:
sort.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/tmmintrin.h: In function 'int64_sort_8through16':
sort.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/tmmintrin.h:136:1: error: inlining failed in call to 'always_inline' '_mm_shuffle_epi8': target specific option mismatch
sort.c:   136 | _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
sort.c:       | ^~~~~~~~~~~~~~~~
sort.c: sort.c:18:28: note: called from here
sort.c:    18 | #define int64x2_store(z,i) _mm_storeu_si128((__m128i *) (z),(i))
sort.c:       |                            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
sort.c: sort.c:277:3: note: in expansion of macro 'int64x2_store'
sort.c:   277 |   int64x2_store(x+pos4-2,int8x16_varextract(x3,int8x16_add(int8x16_set(16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31),int8x16_broadcast(8*(pos4&1)))));
sort.c:       |   ^~~~~~~~~~~~~
sort.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/tmmintrin.h:136:1: error: inlining failed in call to 'always_inline' '_mm_shuffle_epi8': target specific option mismatch
sort.c:   136 | _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
sort.c:       | ^~~~~~~~~~~~~~~~
sort.c: sort.c:18:28: note: called from here
sort.c:    18 | #define int64x2_store(z,i) _mm_storeu_si128((__m128i *) (z),(i))
sort.c:       |                            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
sort.c: sort.c:276:3: note: in expansion of macro 'int64x2_store'
sort.c:   276 |   int64x2_store(x+pos5-2,int8x16_varextract(y1,int8x16_add(int8x16_set(16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31),int8x16_broadcast(8*(pos5&1)))));
sort.c:       |   ^~~~~~~~~~~~~
sort.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/tmmintrin.h:136:1: error: inlining failed in call to 'always_inline' '_mm_shuffle_epi8': target specific option mismatch
sort.c:   136 | _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
sort.c:       | ^~~~~~~~~~~~~~~~
sort.c: sort.c:18:28: note: called from here
sort.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:

Implementation	Compiler
`sse42`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`sse42`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`sse42`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`sse42`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`

Compiler output

try.cpp: In file included from try-anything.cpp:22:
try.cpp: .../supercop-data/hydra5/amd64/include/stq.h:32:13: warning: 'void stq_sort(std::vector<double>, long long int)' defined but not used [-Wunused-function]
try.cpp:    32 | static void stq_sort(stq_doublearray S,long long n)
try.cpp:       |             ^~~~~~~~
try.cpp: In file included from try-anything.cpp:22:
try.cpp: .../supercop-data/hydra5/amd64/include/stq.h:32:13: warning: 'void stq_sort(std::vector<double>, long long int)' defined but not used [-Wunused-function]
try.cpp:    32 | static void stq_sort(stq_doublearray S,long long n)
try.cpp:       |             ^~~~~~~~
measure.cpp: In file included from measure-anything.cpp:19:
measure.cpp: .../supercop-data/hydra5/amd64/include/stq.h:32:13: warning: 'void stq_sort(std::vector<double>, long long int)' defined but not used [-Wunused-function]
measure.cpp:    32 | static void stq_sort(stq_doublearray S,long long n)
measure.cpp:       |             ^~~~~~~~

Number of similar (implementation,compiler) pairs: 4, namely:

Implementation	Compiler
`T:stdsort`	`g++ -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`T:stdsort`	`g++ -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`T:stdsort`	`g++ -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`T:stdsort`	`g++ -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`

Passed TIMECOP

TIMECOP iterations: 10

Number of similar (implementation,compiler) pairs: 18, namely:

Implementation	Compiler
`portable4`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`portable4`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`portable4`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`portable4`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`portable4`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`portable4`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`portable4`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`portable4`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`portable4`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`x863`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`x863`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`x863`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`x863`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`x863`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`x863`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`x863`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`x863`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`x863`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`