Test results for amd64, bolero, crypto_core/invsntrup761
[Page version: 20250926 09:38:42]
Measurements for amd64, bolero, crypto_core
Test results for amd64, bolero, crypto_core
Test results for crypto_core/invsntrup761
Computer: bolero
Microarchitecture: amd64; Broadwell+AES (406f1)
Architecture: amd64
CPU ID: GenuineIntel-000406f1-1fc9cbf5
SUPERCOP version: 20250922
Operation: crypto_core
Primitive: invsntrup761
Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
667152 | 236950 0 0 | 245823 800 856 | jumpdivsteps | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 |
697516 | 246097 0 0 | 260037 784 832 | jumpdivsteps | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 |
700792 | 250042 0 0 | 259007 800 856 | jumpdivsteps | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 |
703228 | 260744 0 0 | 272645 784 832 | jumpdivsteps | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 |
717020 | 241016 0 0 | 245936 800 760 | jumpdivsteps | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 |
724204 | 211857 0 0 | 224057 792 856 | jumpdivsteps | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 |
790980 | 3359 0 0 | 18204 816 776 | avx | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
826508 | 263371 0 0 | 275293 784 832 | jumpdivsteps | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 |
835972 | 4303 0 0 | 19460 816 776 | avx | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
838044 | 1705 0 0 | 12812 816 760 | avx | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
840352 | 1563 0 0 | 13422 808 856 | avx | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
849988 | 1797 0 0 | 13453 768 832 | avx | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
884756 | 204750 0 0 | 215636 768 800 | jumpdivsteps | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250923 | 20250922 |
911440 | 3689 0 0 | 17622 776 832 | avx | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
1031100 | 1753 0 0 | 13678 776 832 | avx | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
1041892 | 1477 0 0 | 12117 760 800 | avx | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
4599072 | 8525 0 0 | 22454 776 832 | portable | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
8603572 | 3943 0 0 | 18804 816 776 | portable | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
8639820 | 4951 0 0 | 20124 816 776 | portable | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
9204992 | 4076 0 0 | 18080 784 832 | ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
11655896 | 3796 0 0 | 17284 816 760 | portable | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
15807332 | 1689 0 0 | 12780 816 760 | portable | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
15822396 | 1585 0 0 | 13446 808 856 | portable | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
17191888 | 1812 0 0 | 13702 776 832 | portable | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
18920220 | 1798 0 0 | 13413 768 832 | portable | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
25037928 | 4104 0 0 | 19342 824 776 | ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
25135056 | 3112 0 0 | 18038 824 776 | ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
32980528 | 2894 0 0 | 16454 824 760 | ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
35455780 | 1076 0 0 | 13000 816 856 | ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
35658720 | 1048 0 0 | 13008 784 832 | ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
35658960 | 1179 0 0 | 12342 824 760 | ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
38517628 | 1155 0 0 | 12765 768 832 | ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
38605460 | 1510 0 0 | 12125 760 800 | portable | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
39713372 | 951 0 0 | 11607 768 800 | ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall | 20250416 | 20250415 |
Compiler output
recip.c: recip.c:73:19: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'vectormodq_swapeliminate' that is compiled without support for 'avx'
recip.c: __m256i f0vec = _mm256_set1_epi16(f0);
recip.c: ^
recip.c: recip.c:73:19: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
recip.c: recip.c:74:19: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'vectormodq_swapeliminate' that is compiled without support for 'avx'
recip.c: __m256i g0vec = _mm256_set1_epi16(g0);
recip.c: ^
recip.c: recip.c:74:19: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
recip.c: recip.c:75:48: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'vectormodq_swapeliminate' that is compiled without support for 'avx'
recip.c: __m256i f0vecqinv = _mm256_mullo_epi16(f0vec,qinvvec);
recip.c: ^
recip.c: recip.c:59:17: note: expanded from macro 'qinvvec'
recip.c: #define qinvvec _mm256_set1_epi16(qinv)
recip.c: ^
recip.c: recip.c:75:48: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
recip.c: recip.c:59:17: note: expanded from macro 'qinvvec'
recip.c: #define qinvvec _mm256_set1_epi16(qinv)
recip.c: ^
recip.c: recip.c:75:23: error: always_inline function '_mm256_mullo_epi16' requires target feature 'avx2', but would be inlined into function 'vectormodq_swapeliminate' that is compiled without support for 'avx2'
recip.c: __m256i f0vecqinv = _mm256_mullo_epi16(f0vec,qinvvec);
recip.c: ^
recip.c: recip.c:75:23: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
recip.c: recip.c:76:48: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'vectormodq_swapeliminate' that is compiled without support for 'avx'
recip.c: __m256i g0vecqinv = _mm256_mullo_epi16(g0vec,qinvvec);
recip.c: ^
recip.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
avx | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
Compiler output
avx-768.c: avx-768.c:716:35: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c: 716 | __m256i f0,f1,f2,f3,g0,g1,g2,g3,h1;
avx-768.c: | ^~
avx-768.c: avx-768.c:847:35: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c: 847 | __m256i f0,f1,f2,f3,g0,g1,g2,g3,h1;
avx-768.c: | ^~
avx-768.c: avx-768.c:973:35: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c: 973 | __m256i f0,f1,f2,f3,g0,g1,g2,g3,h1;
avx-768.c: | ^~
avx-768.c: avx-768.c:1176:23: warning: unused variable 'f4' [-Wunused-variable]
avx-768.c: 1176 | __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c: | ^~
avx-768.c: avx-768.c:1176:26: warning: unused variable 'f5' [-Wunused-variable]
avx-768.c: 1176 | __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c: | ^~
avx-768.c: avx-768.c:1237:17: warning: unused variable 'f2' [-Wunused-variable]
avx-768.c: 1237 | __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c: | ^~
avx-768.c: avx-768.c:1237:20: warning: unused variable 'f3' [-Wunused-variable]
avx-768.c: 1237 | __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c: | ^~
avx-768.c: avx-768.c:1237:23: warning: unused variable 'f4' [-Wunused-variable]
avx-768.c: 1237 | __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c: | ^~
avx-768.c: avx-768.c:1237:26: warning: unused variable 'f5' [-Wunused-variable]
avx-768.c: ...
avx.c: avx.c:23:21: warning: unused function 'mullo' [-Wunused-function]
avx.c: 23 | static inline int16 mullo(int16 x,int16 y)
avx.c: | ^~~~~
avx.c: avx.c:28:21: warning: unused function 'mulhi' [-Wunused-function]
avx.c: 28 | static inline int16 mulhi(int16 x,int16 y)
avx.c: | ^~~~~
avx.c: avx.c:33:21: warning: unused function 'mulhrs' [-Wunused-function]
avx.c: 33 | static inline int16 mulhrs(int16 x,int16 y)
avx.c: | ^~~~~~
avx.c: 3 warnings generated.
avx3-512.c: avx3-512.c:1055:35: warning: unused variable 'h0' [-Wunused-variable]
avx3-512.c: 1055 | __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c: | ^~
avx3-512.c: avx3-512.c:1055:38: warning: unused variable 'h1' [-Wunused-variable]
avx3-512.c: 1055 | __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c: | ^~
avx3-512.c: avx3-512.c:1055:41: warning: unused variable 'h2' [-Wunused-variable]
avx3-512.c: 1055 | __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c: | ^~
avx3-512.c: avx3-512.c:1055:44: warning: unused variable 'h3' [-Wunused-variable]
avx3-512.c: 1055 | __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c: | ^~
avx3-512.c: avx3-512.c:1056:10: warning: unused variable 'origf' [-Wunused-variable]
avx3-512.c: 1056 | int16 *origf = f;
avx3-512.c: | ^~~~~
avx3-512.c: avx3-512.c:1019:13: warning: unused function 'ntt512_7681' [-Wunused-function]
avx3-512.c: 1019 | static void ntt512_7681(int16 *f,int reps)
avx3-512.c: | ^~~~~~~~~~~
avx3-512.c: avx3-512.c:1030:13: warning: unused function 'ntt512_10753' [-Wunused-function]
avx3-512.c: 1030 | static void ntt512_10753(int16 *f,int reps)
avx3-512.c: | ^~~~~~~~~~~~
avx3-512.c: avx3-512.c:1041:13: warning: unused function 'ntt512_12289' [-Wunused-function]
avx3-512.c: 1041 | static void ntt512_12289(int16 *f,int reps)
avx3-512.c: | ^~~~~~~~~~~~
avx3-512.c: 8 warnings generated.
jump32divsteps.c: jump32divsteps.c:113:31: warning: unused variable 'gx' [-Wunused-variable]
jump32divsteps.c: 113 | __m256i minusdelta_v, temp, gx, ux, vx, qx, rx;
jump32divsteps.c: | ^~
jump32divsteps.c: 1 warning generated.
polymul_ntt.c: polymul_ntt.c:166:13: warning: unused function 'polymulntt512_256x256_test' [-Wunused-function]
polymul_ntt.c: 166 | static void polymulntt512_256x256_test(__m256i *h, __m256i *f, __m256i *g)
polymul_ntt.c: | ^~~~~~~~~~~~~~~~~~~~~~~~~~
polymul_ntt.c: 1 warning generated.
Number of similar (implementation,compiler) pairs: 4, namely:
Implementation | Compiler |
jumpdivsteps | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
jumpdivsteps | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
jumpdivsteps | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
jumpdivsteps | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
Compiler output
avx-768.c: avx-768.c:544:36: error: invalid output size for constraint '+x'
avx-768.c: 544 | __asm__("vpsubw %1,%0,%0" : "+x"(a),"+x"(b));
avx-768.c: | ^
avx-768.c: avx-768.c:550:36: error: invalid output size for constraint '+x'
avx-768.c: 550 | __asm__("vpaddw %1,%0,%0" : "+x"(a),"+x"(b));
avx-768.c: | ^
avx-768.c: avx-768.c:716:35: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c: 716 | __m256i f0,f1,f2,f3,g0,g1,g2,g3,h1;
avx-768.c: | ^~
avx-768.c: avx-768.c:847:35: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c: 847 | __m256i f0,f1,f2,f3,g0,g1,g2,g3,h1;
avx-768.c: | ^~
avx-768.c: avx-768.c:973:35: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c: 973 | __m256i f0,f1,f2,f3,g0,g1,g2,g3,h1;
avx-768.c: | ^~
avx-768.c: avx-768.c:1176:23: warning: unused variable 'f4' [-Wunused-variable]
avx-768.c: 1176 | __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c: | ^~
avx-768.c: avx-768.c:1176:26: warning: unused variable 'f5' [-Wunused-variable]
avx-768.c: 1176 | __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c: | ^~
avx-768.c: avx-768.c:1237:17: warning: unused variable 'f2' [-Wunused-variable]
avx-768.c: 1237 | __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c: | ^~
avx-768.c: avx-768.c:1237:20: warning: unused variable 'f3' [-Wunused-variable]
avx-768.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
jumpdivsteps | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
Compiler output
avx-768.c: avx-768.c: In function 'invntt64':
avx-768.c: avx-768.c:716:35: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c: 716 | __m256i f0,f1,f2,f3,g0,g1,g2,g3,h1;
avx-768.c: | ^~
avx-768.c: avx-768.c: In function 'ntt256':
avx-768.c: avx-768.c:847:35: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c: 847 | __m256i f0,f1,f2,f3,g0,g1,g2,g3,h1;
avx-768.c: | ^~
avx-768.c: avx-768.c: In function 'invntt256':
avx-768.c: avx-768.c:973:35: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c: 973 | __m256i f0,f1,f2,f3,g0,g1,g2,g3,h1;
avx-768.c: | ^~
avx-768.c: avx-768.c: In function 'ntt768_512':
avx-768.c: avx-768.c:1176:26: warning: unused variable 'f5' [-Wunused-variable]
avx-768.c: 1176 | __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c: | ^~
avx-768.c: avx-768.c:1176:23: warning: unused variable 'f4' [-Wunused-variable]
avx-768.c: 1176 | __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c: | ^~
avx-768.c: avx-768.c: In function 'ntt768_256':
avx-768.c: avx-768.c:1237:32: warning: unused variable 'g1' [-Wunused-variable]
avx-768.c: 1237 | __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c: | ^~
avx-768.c: avx-768.c:1237:26: warning: unused variable 'f5' [-Wunused-variable]
avx-768.c: 1237 | __m256i f0,f1,f2,f3,f4,f5,g0,g1,g2,g3,g4,g5;
avx-768.c: ...
avx.c: avx.c:437:21: warning: argument 1 of type 'int16x16[16][8]' {aka '__m256i[16][8]'} with mismatched bound [-Warray-parameter=]
avx.c: 437 | void fft64(int16x16 fpad[16][8],const int16x16 f[64])
avx.c: | ~~~~~~~~~^~~~~~~~~~~
avx.c: In file included from avx.c:5:
avx.c: avx.h:53:19: note: previously declared as 'int16x16 (*)[8]' {aka '__m256i (*)[8]'}
avx.c: 53 | extern void fft64(int16x16 (*)[8],const int16x16 *);
avx.c: | ^~~~~~~~~~~~~~~
avx.c: avx.c:437:48: warning: argument 2 of type 'const int16x16[64]' {aka 'const __m256i[64]'} with mismatched bound [-Warray-parameter=]
avx.c: 437 | void fft64(int16x16 fpad[16][8],const int16x16 f[64])
avx.c: | ~~~~~~~~~~~~~~~^~~~~
avx.c: avx.h:53:35: note: previously declared as 'const int16x16 *' {aka 'const __m256i *'}
avx.c: 53 | extern void fft64(int16x16 (*)[8],const int16x16 *);
avx.c: | ^~~~~~~~~~~~~~~~
avx.c: avx.c:687:31: warning: argument 1 of type 'int16x16[64]' {aka '__m256i[64]'} with mismatched bound [-Warray-parameter=]
avx.c: 687 | void unfft64_scale16(int16x16 f[64],int16x16 fpad[16][8])
avx.c: | ~~~~~~~~~^~~~~
avx.c: avx.h:54:29: note: previously declared as 'int16x16 *' {aka '__m256i *'}
avx.c: 54 | extern void unfft64_scale16(int16x16 *,int16x16 (*)[8]);
avx.c: | ^~~~~~~~~~
avx.c: avx.c:687:46: warning: argument 2 of type 'int16x16[16][8]' {aka '__m256i[16][8]'} with mismatched bound [-Warray-parameter=]
avx.c: 687 | void unfft64_scale16(int16x16 f[64],int16x16 fpad[16][8])
avx.c: | ~~~~~~~~~^~~~~~~~~~~
avx.c: avx.h:54:40: note: previously declared as 'int16x16 (*)[8]' {aka '__m256i (*)[8]'}
avx.c: 54 | extern void unfft64_scale16(int16x16 *,int16x16 (*)[8]);
avx.c: | ^~~~~~~~~~~~~~~
avx.c: ...
avx3-512.c: avx3-512.c: In function 'invntt512':
avx3-512.c: avx3-512.c:1056:10: warning: unused variable 'origf' [-Wunused-variable]
avx3-512.c: 1056 | int16 *origf = f;
avx3-512.c: | ^~~~~
avx3-512.c: avx3-512.c:1055:44: warning: unused variable 'h3' [-Wunused-variable]
avx3-512.c: 1055 | __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c: | ^~
avx3-512.c: avx3-512.c:1055:41: warning: unused variable 'h2' [-Wunused-variable]
avx3-512.c: 1055 | __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c: | ^~
avx3-512.c: avx3-512.c:1055:38: warning: unused variable 'h1' [-Wunused-variable]
avx3-512.c: 1055 | __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c: | ^~
avx3-512.c: avx3-512.c:1055:35: warning: unused variable 'h0' [-Wunused-variable]
avx3-512.c: 1055 | __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c: | ^~
avx3-512.c: avx3-512.c: At top level:
avx3-512.c: avx3-512.c:1041:13: warning: 'ntt512_12289' defined but not used [-Wunused-function]
avx3-512.c: 1041 | static void ntt512_12289(int16 *f,int reps)
avx3-512.c: | ^~~~~~~~~~~~
avx3-512.c: avx3-512.c:1030:13: warning: 'ntt512_10753' defined but not used [-Wunused-function]
avx3-512.c: 1030 | static void ntt512_10753(int16 *f,int reps)
avx3-512.c: | ^~~~~~~~~~~~
avx3-512.c: avx3-512.c:1019:13: warning: 'ntt512_7681' defined but not used [-Wunused-function]
avx3-512.c: 1019 | static void ntt512_7681(int16 *f,int reps)
avx3-512.c: ...
jump32divsteps.c: jump32divsteps.c: In function 'crypto_core_invsntrup761_jumpdivsteps_constbranchindex_jump32xdivsteps':
jump32divsteps.c: jump32divsteps.c:113:31: warning: unused variable 'gx' [-Wunused-variable]
jump32divsteps.c: 113 | __m256i minusdelta_v, temp, gx, ux, vx, qx, rx;
jump32divsteps.c: | ^~
polymul_ntt.c: polymul_ntt.c:166:13: warning: 'polymulntt512_256x256_test' defined but not used [-Wunused-function]
polymul_ntt.c: 166 | static void polymulntt512_256x256_test(__m256i *h, __m256i *f, __m256i *g)
polymul_ntt.c: | ^~~~~~~~~~~~~~~~~~~~~~~~~~
Number of similar (implementation,compiler) pairs: 4, namely:
Implementation | Compiler |
jumpdivsteps | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
jumpdivsteps | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
jumpdivsteps | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
jumpdivsteps | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
TIMECOP error (can be valgrind bug)
error 1
VEX temporary storage exhausted.
Pool = TEMP, start 0x597cd5e8 curr 0x59c92118 end 0x59c92127 (size 5000000)
vex: the `impossible' happened:
VEX temporary storage exhausted.
Increase N_{TEMPORARY,PERMANENT}_BYTES and recompile.
vex storage: T total 493373008 bytes allocated
vex storage: P total 512 bytes allocated
valgrind: the 'impossible' happened:
LibVEX called failure_exit().
host stacktrace:
at 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
sched status:
running_tid=1
Thread 1: status = VgTs_Runnable (lwpid 2006388)
at 0x...: _mm256_mullo_epi16 (avx2intrin.h:555)
by 0x...: barrett_fake (gf4591_avx2.h:58)
by 0x...: crypto_core_invsntrup761_jumpdivsteps_constbranchindex_gf_polymul_128x128_avx2_divR (polymul_NxN.c:431)
by 0x...: crypto_core_invsntrup761_jumpdivsteps_constbranchindex_gf_polymul_128x128_avx2_uvqr_fg (polymul_NxN.c:537)
by 0x...: crypto_core_invsntrup761_jumpdivsteps_constbranchindex_jump256divsteps (jumpNdivsteps.c:271)
by 0x...: crypto_core_invsntrup761_jumpdivsteps_constbranchindex_rq_recip3 (rq_recip3_new.c:200)
by 0x...: crypto_core_invsntrup761_jumpdivsteps_constbranchindex (wrapper.c:10)
by 0x...: test (try.c:106)
by 0x...: main (try-anything.c:345)
client stack range: [0x1FFEFEF000 0x1FFF000FFF] client SP: 0x1FFEFEF7E0
valgrind stack range: [0x10090AE000 0x10091ADFFF] top usage: 18232 of 1048576
Note: see also the FAQ in the source distribution.
It contains workarounds to several common problems.
In particular, if Valgrind aborted or crashed after
identifying problems in your program, there's a good chance
that fixing those problems will prevent Valgrind aborting or
crashing, especially if it happened in m_mallocfree.c.
If that doesn't help, please report this bug to: www.valgrind.org
In the bug report, send all the above text, the valgrind
version, and what OS and version you are using. Thanks.
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
jumpdivsteps | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
Passed TIMECOP
TIMECOP iterations: 1
Number of similar (implementation,compiler) pairs: 33, namely:
Implementation | Compiler |
avx | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
avx | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
avx | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
avx | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
avx | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
avx | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
avx | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
avx | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
jumpdivsteps | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
jumpdivsteps | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
jumpdivsteps | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
jumpdivsteps | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
jumpdivsteps | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
jumpdivsteps | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
jumpdivsteps | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
portable | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
portable | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
portable | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
portable | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
portable | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
portable | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
portable | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
portable | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
portable | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
ref | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
ref | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
ref | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
ref | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
ref | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1)) |
ref | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
ref | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
ref | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |
ref | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0) |