Test results for amd64, saber214, crypto_decode/857x1723
[Page version: 20241023 11:56:08]
Measurements for amd64, saber214, crypto_decode
Test results for amd64, saber214, crypto_decode
Test results for crypto_decode/857x1723
Computer: saber214
Microarchitecture: amd64; Bulldozer (600f20)
Architecture: amd64
CPU ID: AuthenticAMD-00600f20-1789c3f5
SUPERCOP version: 20241022
Operation: crypto_decode
Primitive: 857x1723
Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
17227 | 2345 0 0 | 13172 816 728 | int16 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
17238 | 2999 0 0 | 16212 816 744 | int16 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
17865 | 2040 0 0 | 11206 808 728 | int16 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
19160 | 3035 0 0 | 14964 816 728 | int16 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
19177 | 2236 0 0 | 11796 816 728 | int16 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
19649 | 2124 0 0 | 11401 752 768 | int16 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
19753 | 3181 0 0 | 15134 776 800 | int16 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
19852 | 2214 0 0 | 12477 768 800 | int16 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
19938 | 2202 0 0 | 12694 776 800 | int16 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
31051 | 3119 0 0 | 13932 816 728 | portable | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
31302 | 3870 0 0 | 17084 816 744 | portable | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
31869 | 2757 0 0 | 11926 808 728 | portable | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
32640 | 2857 0 0 | 12444 816 728 | portable | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
32901 | 2953 0 0 | 13446 776 800 | portable | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
32935 | 4432 0 0 | 16390 776 800 | portable | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
34280 | 4072 0 0 | 15996 816 728 | portable | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
65547 | 1830 0 0 | 13734 776 800 | ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
66231 | 2284 0 0 | 14228 816 728 | ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
66826 | 1811 0 0 | 12652 816 728 | ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
66925 | 2299 0 0 | 15540 816 744 | ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
69426 | 1618 0 0 | 12062 776 800 | ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
70387 | 1805 0 0 | 12085 768 800 | portable | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
71637 | 1162 0 0 | 10310 808 728 | ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
73342 | 1169 0 0 | 10740 816 728 | ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
73677 | 1732 0 0 | 11065 752 768 | portable | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
83844 | 1673 0 0 | 11853 768 800 | ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
89575 | 1530 0 0 | 10729 752 768 | ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20241011 | 20241011 |
Compiler output
decode.c: decode.c:245:10: error: always_inline function '_mm256_srli_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_decode_857x1723_avx_constbranchindex' that is compiled without support for 'avx2'
decode.c: S1 = _mm256_srli_epi16(S0,8);
decode.c: ^
decode.c: decode.c:260:10: error: always_inline function '_mm256_unpacklo_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_decode_857x1723_avx_constbranchindex' that is compiled without support for 'avx2'
decode.c: B0 = _mm256_unpacklo_epi16(A0,A1);
decode.c: ^
decode.c: decode.c:261:10: error: always_inline function '_mm256_unpackhi_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_decode_857x1723_avx_constbranchindex' that is compiled without support for 'avx2'
decode.c: B1 = _mm256_unpackhi_epi16(A0,A1);
decode.c: ^
decode.c: decode.c:264:10: error: '__builtin_ia32_permti256' needs target feature avx2
decode.c: C0 = _mm256_permute2x128_si256(B0,B1,0x20);
decode.c: ^
decode.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/avx2intrin.h:821:13: note: expanded from macro '_mm256_permute2x128_si256'
decode.c: ((__m256i)__builtin_ia32_permti256((__m256i)(V1), (__m256i)(V2), (int)(M)))
decode.c: ^
decode.c: decode.c:265:10: error: '__builtin_ia32_permti256' needs target feature avx2
decode.c: C1 = _mm256_permute2x128_si256(B0,B1,0x31);
decode.c: ^
decode.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/avx2intrin.h:821:13: note: expanded from macro '_mm256_permute2x128_si256'
decode.c: ((__m256i)__builtin_ia32_permti256((__m256i)(V1), (__m256i)(V2), (int)(M)))
decode.c: ^
decode.c: decode.c:306:10: error: always_inline function '_mm256_unpacklo_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_decode_857x1723_avx_constbranchindex' that is compiled without support for 'avx2'
decode.c: B0 = _mm256_unpacklo_epi16(A0,A1);
decode.c: ^
decode.c: decode.c:307:10: error: always_inline function '_mm256_unpackhi_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_decode_857x1723_avx_constbranchindex' that is compiled without support for 'avx2'
decode.c: ...
Number of similar (implementation,compiler) pairs: 4, namely:
Implementation | Compiler |
avx | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
avx | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
avx | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
avx | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
Compiler output
decode.c: decode.c:243:15: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'crypto_decode_857x1723_avx_constbranchindex' that is compiled without support for 'avx'
decode.c: A2 = A0 = _mm256_loadu_si256((__m256i *) &R5[i]);
decode.c: ^
decode.c: decode.c:243:15: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
decode.c: decode.c:244:10: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'crypto_decode_857x1723_avx_constbranchindex' that is compiled without support for 'avx'
decode.c: S0 = _mm256_loadu_si256((__m256i *) (s+2*i));
decode.c: ^
decode.c: decode.c:244:10: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
decode.c: decode.c:245:10: error: always_inline function '_mm256_srli_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_decode_857x1723_avx_constbranchindex' that is compiled without support for 'avx2'
decode.c: S1 = _mm256_srli_epi16(S0,8);
decode.c: ^
decode.c: decode.c:245:10: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
decode.c: decode.c:246:11: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'crypto_decode_857x1723_avx_constbranchindex' that is compiled without support for 'avx'
decode.c: S0 &= _mm256_set1_epi16(255);
decode.c: ^
decode.c: decode.c:246:11: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
decode.c: decode.c:247:14: warning: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI [-Wpsabi]
decode.c: A0 = sub(mulhiconst(A0,-259),mulhiconst(mulloconst(A0,-3971),4225)); /* -2178...2112 */
decode.c: ^
decode.c: decode.c:247:45: warning: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI [-Wpsabi]
decode.c: A0 = sub(mulhiconst(A0,-259),mulhiconst(mulloconst(A0,-3971),4225)); /* -2178...2112 */
decode.c: ^
decode.c: decode.c:247:34: warning: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI [-Wpsabi]
decode.c: A0 = sub(mulhiconst(A0,-259),mulhiconst(mulloconst(A0,-3971),4225)); /* -2178...2112 */
decode.c: ^
decode.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
avx | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
Compiler output
decode.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
decode.c: from decode.c:3:
decode.c: decode.c: In function 'add':
decode.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:112:1: error: inlining failed in call to 'always_inline' '_mm256_add_epi16': target specific option mismatch
decode.c: 112 | _mm256_add_epi16 (__m256i __A, __m256i __B)
decode.c: | ^~~~~~~~~~~~~~~~
decode.c: decode.c:22:10: note: called from here
decode.c: 22 | return _mm256_add_epi16(x,y);
decode.c: | ^~~~~~~~~~~~~~~~~~~~~
Number of similar (implementation,compiler) pairs: 4, namely:
Implementation | Compiler |
avx | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
avx | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
avx | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
avx | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
TIMECOP error (can be valgrind bug)
error 111
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x109C3D
at 0x...: salsa20.part.0 (try-anything.c:102)
by 0x...: salsa20 (try-anything.c:85)
by 0x...: canary (try-anything.c:148)
by 0x...: input_prepare (try-anything.c:163)
by 0x...: test (try.c:95)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 3, namely:
Implementation | Compiler |
int16 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
portable | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
TIMECOP error (can be valgrind bug)
error 111
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x10988A
at 0x...: core (try-anything.c:53)
by 0x...: salsa20.part.0 (try-anything.c:89)
by 0x...: salsa20 (try-anything.c:85)
by 0x...: testvector (try-anything.c:124)
by 0x...: input_prepare (try-anything.c:162)
by 0x...: test (try.c:95)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 3, namely:
Implementation | Compiler |
int16 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
portable | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
TIMECOP error (can be valgrind bug)
error 111
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x10A700
at 0x...: crypto_decode_857x1723_ref_constbranchindex (wrapper.c:17)
by 0x...: test (try.c:99)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
TIMECOP error (can be valgrind bug)
error 111
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x10A7A0
at 0x...: crypto_decode_857x1723_ref_constbranchindex (wrapper.c:17)
by 0x...: test (try.c:99)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
TIMECOP error (can be valgrind bug)
error 111
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x109E14
at 0x...: crypto_decode_857x1723_ref_constbranchindex (wrapper.c:10)
by 0x...: test (try.c:99)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
TIMECOP error (can be valgrind bug)
error 111
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x10A277
at 0x...: crypto_decode_857x1723_ref_constbranchindex_Decode (Decode.c:32)
by 0x...: crypto_decode_857x1723_ref_constbranchindex (wrapper.c:16)
by 0x...: test (try.c:99)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
Passed TIMECOP
TIMECOP iterations: 10
Number of similar (implementation,compiler) pairs: 17, namely:
Implementation | Compiler |
int16 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
int16 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
int16 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
int16 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
int16 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
int16 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
int16 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
portable | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
portable | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
portable | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
portable | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
portable | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
portable | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
portable | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |
ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |