Implementation notes: amd64, panther, crypto_core/scale3sntrup953
Computer: panther
Microarchitecture: amd64; Tiger Lake (806c1)
Architecture: amd64
CPU ID: GenuineIntel-000806c1-00-bfebfbff
SUPERCOP version: 20240625
Operation: crypto_core
Primitive: scale3sntrup953
Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
279 | 278 0 0 | 12376 780 984 | avx | gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
279 | 270 0 0 | 10835 756 952 | avx | gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
281 | 274 0 0 | 11935 772 984 | avx | gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
282 | 278 0 0 | 14448 780 984 | avx | gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
297 | 197 0 0 | 11467 820 920 | avx | clang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
307 | 327 0 0 | 14741 828 952 | avx | clang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
310 | 327 0 0 | 14725 828 952 | avx | clang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
311 | 327 0 0 | 11763 820 920 | avx | clang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
359 | 497 0 0 | 14800 780 984 | ref | gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
546 | 707 0 0 | 15253 828 952 | ref | clang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
595 | 905 0 0 | 14821 828 920 | ref | clang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
611 | 707 0 0 | 15237 828 952 | ref | clang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
768 | 478 0 0 | 11859 820 920 | ref | clang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
3434 | 189 0 0 | 12384 780 984 | ref | gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
3564 | 184 0 0 | 11927 772 984 | ref | gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
4524 | 189 0 0 | 11731 820 920 | ref | clang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
9926 | 186 0 0 | 10943 764 952 | ref | gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
Compiler output
core.c: core.c:20:18: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_scale3sntrup953_avx_constbranchindex' that is compiled without support for 'avx'
core.c: __m256i save = _mm256_loadu_si256((__m256i *) (inbytes+2*i));
core.c: ^
core.c: core.c:20:18: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
core.c: core.c:25:19: error: always_inline function '_mm256_loadu_si256' requires target feature 'avx', but would be inlined into function 'crypto_core_scale3sntrup953_avx_constbranchindex' that is compiled without support for 'avx'
core.c: __m256i x = _mm256_loadu_si256((__m256i *) inbytes);
core.c: ^
core.c: core.c:25:19: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
core.c: core.c:27:32: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'crypto_core_scale3sntrup953_avx_constbranchindex' that is compiled without support for 'avx'
core.c: x = _mm256_mullo_epi16(x,_mm256_set1_epi16(3));
core.c: ^
core.c: core.c:27:32: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
core.c: core.c:27:11: error: always_inline function '_mm256_mullo_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_core_scale3sntrup953_avx_constbranchindex' that is compiled without support for 'avx2'
core.c: x = _mm256_mullo_epi16(x,_mm256_set1_epi16(3));
core.c: ^
core.c: core.c:27:11: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
core.c: core.c:28:30: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'crypto_core_scale3sntrup953_avx_constbranchindex' that is compiled without support for 'avx'
core.c: x = _mm256_sub_epi16(x,_mm256_set1_epi16((q+1)/2));
core.c: ^
core.c: core.c:28:30: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
core.c: core.c:28:11: error: always_inline function '_mm256_sub_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_core_scale3sntrup953_avx_constbranchindex' that is compiled without support for 'avx2'
core.c: x = _mm256_sub_epi16(x,_mm256_set1_epi16((q+1)/2));
core.c: ^
core.c: core.c:28:11: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
core.c: core.c:29:14: error: always_inline function '_mm256_srai_epi16' requires target feature 'avx2', but would be inlined into function 'crypto_core_scale3sntrup953_avx_constbranchindex' that is compiled without support for 'avx2'
core.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
avx | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
TIMECOP error (can be valgrind bug)
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x402330
at 0x...: st32 (try-anything.c:47)
by 0x...: core (try-anything.c:78)
by 0x...: salsa20 (try-anything.c:89)
by 0x...: canary (try-anything.c:148)
by 0x...: output_prepare (try-anything.c:178)
by 0x...: test (try.c:99)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 2, namely:
Implementation | Compiler |
avx | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
TIMECOP error (can be valgrind bug)
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x401D7E
at 0x...: core (try-anything.c:73)
by 0x...: salsa20 (try-anything.c:89)
by 0x...: canary (try-anything.c:148)
by 0x...: output_prepare (try-anything.c:178)
by 0x...: test (try.c:99)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 2, namely:
Implementation | Compiler |
avx | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
TIMECOP error (can be valgrind bug)
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x109FD9
at 0x...: salsa20.part.0 (try-anything.c:102)
by 0x...: salsa20 (try-anything.c:85)
by 0x...: canary (try-anything.c:148)
by 0x...: output_prepare (try-anything.c:178)
by 0x...: test (try.c:99)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 2, namely:
Implementation | Compiler |
avx | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
TIMECOP error (can be valgrind bug)
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x109E7A
at 0x...: st32 (try-anything.c:47)
by 0x...: core (try-anything.c:78)
by 0x...: salsa20.part.0 (try-anything.c:89)
by 0x...: salsa20 (try-anything.c:85)
by 0x...: canary (try-anything.c:148)
by 0x...: output_prepare (try-anything.c:178)
by 0x...: test (try.c:99)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 2, namely:
Implementation | Compiler |
avx | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
TIMECOP error (can be valgrind bug)
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x10A168
at 0x...: _mm256_mullo_epi16 (avx2intrin.h:555)
by 0x...: crypto_core_scale3sntrup953_avx_constbranchindex (core.c:27)
by 0x...: test (try.c:106)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
avx | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
Passed TIMECOP
TIMECOP iterations: 1
Number of similar (implementation,compiler) pairs: 8, namely:
Implementation | Compiler |
avx | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
avx | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
avx | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |