Test results for amd64, r24000, crypto_core/invsntrup761

[Page version: 20250503 15:35:47]

Measurements for amd64, r24000, crypto_core Test results for amd64, r24000, crypto_core Test results for crypto_core/invsntrup761

Computer: r24000
Microarchitecture: amd64; Coffee Lake (906ea)
Architecture: amd64
CPU ID: GenuineIntel-000906ea-bfebfbff
SUPERCOP version: 20250415
Operation: crypto_core
Primitive: invsntrup761

Time	Object size	Test size	Implementation	Compiler	Benchmark date	SUPERCOP version
565200	249844 0 0	259126 828 1032	`jumpdivsteps`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
571317	236496 0 0	245654 828 1032	`jumpdivsteps`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
582952	211484 0 0	224016 820 1032	`jumpdivsteps`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
600162	245232 0 0	259676 812 1000	`jumpdivsteps`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
618781	260998 0 0	273388 812 1000	`jumpdivsteps`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
631399	240869 0 0	246215 828 936	`jumpdivsteps`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
687081	3403 0 0	18766 828 1032	`avx`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
687879	4403 0 0	19846 828 1032	`avx`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
715380	263108 0 0	275460 812 1000	`jumpdivsteps`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
725435	1703 0 0	13215 828 936	`avx`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
728476	4164 0 0	18525 804 1000	`avx`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
747627	1811 0 0	13804 796 1000	`avx`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
756730	1770 0 0	14037 804 1000	`avx`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
758564	1580 0 0	14000 820 1032	`avx`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
791643	204810 0 0	216131 796 968	`jumpdivsteps`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
810091	1484 0 0	12516 788 968	`avx`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
1140345	7526 0 0	21917 804 1000	`portable`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
2026630	5777 0 0	21158 828 1032	`portable`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
2027400	6769 0 0	22230 828 1032	`portable`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
7730661	5055 0 0	20553 836 1032	`ref`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
7735193	4015 0 0	19433 836 1032	`ref`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
7753467	5731 0 0	20007 828 936	`portable`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
8001768	4042 0 0	18471 812 1000	`ref`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
15112689	1811 0 0	13295 828 936	`portable`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
15357196	1742 0 0	14168 820 1032	`portable`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
15660836	1948 0 0	14253 804 1000	`portable`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
17022466	1780 0 0	13796 796 1000	`portable`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
19782008	3801 0 0	18153 836 936	`ref`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
29147035	1100 0 0	13431 812 1000	`ref`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
31613051	1179 0 0	12737 836 936	`ref`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
33505137	1049 0 0	13562 828 1032	`ref`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
36666900	954 0 0	12030 796 968	`ref`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
37746424	1158 0 0	13132 796 1000	`ref`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415
38271614	1513 0 0	12548 788 968	`portable`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20250429	20250415

Compiler output

recip.c: recip.c:73:19: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'vectormodq_swapeliminate' that is compiled without support for 'avx'
recip.c:    73 |   __m256i f0vec = _mm256_set1_epi16(f0);
recip.c:       |                   ^
recip.c: recip.c:73:19: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
recip.c: recip.c:74:19: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'vectormodq_swapeliminate' that is compiled without support for 'avx'
recip.c:    74 |   __m256i g0vec = _mm256_set1_epi16(g0);
recip.c:       |                   ^
recip.c: recip.c:74:19: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
recip.c: recip.c:75:48: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'vectormodq_swapeliminate' that is compiled without support for 'avx'
recip.c:    75 |   __m256i f0vecqinv = _mm256_mullo_epi16(f0vec,qinvvec);
recip.c:       |                                                ^
recip.c: recip.c:59:17: note: expanded from macro 'qinvvec'
recip.c:    59 | #define qinvvec _mm256_set1_epi16(qinv)
recip.c:       |                 ^
recip.c: recip.c:75:48: error: AVX vector return of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
recip.c: recip.c:59:17: note: expanded from macro 'qinvvec'
recip.c:    59 | #define qinvvec _mm256_set1_epi16(qinv)
recip.c:       |                 ^
recip.c: recip.c:75:23: error: always_inline function '_mm256_mullo_epi16' requires target feature 'avx2', but would be inlined into function 'vectormodq_swapeliminate' that is compiled without support for 'avx2'
recip.c:    75 |   __m256i f0vecqinv = _mm256_mullo_epi16(f0vec,qinvvec);
recip.c:       |                       ^
recip.c: recip.c:75:23: error: AVX vector argument of type '__m256i' (vector of 4 'long long' values) without 'avx' enabled changes the ABI
recip.c: recip.c:76:48: error: always_inline function '_mm256_set1_epi16' requires target feature 'avx', but would be inlined into function 'vectormodq_swapeliminate' that is compiled without support for 'avx'
recip.c:    76 |   __m256i g0vecqinv = _mm256_mullo_epi16(g0vec,qinvvec);
recip.c:       |                                                ^
recip.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:

Implementation	Compiler
`avx`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`

Compiler output

avx-768.c: avx-768.c:586:35: warning: unused variable 'h0' [-Wunused-variable]
avx-768.c:   586 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                   ^~
avx-768.c: avx-768.c:586:41: warning: unused variable 'h2' [-Wunused-variable]
avx-768.c:   586 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                         ^~
avx-768.c: avx-768.c:586:44: warning: unused variable 'h3' [-Wunused-variable]
avx-768.c:   586 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                            ^~
avx-768.c: avx-768.c:716:35: warning: unused variable 'h0' [-Wunused-variable]
avx-768.c:   716 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                   ^~
avx-768.c: avx-768.c:716:38: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c:   716 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                      ^~
avx-768.c: avx-768.c:716:41: warning: unused variable 'h2' [-Wunused-variable]
avx-768.c:   716 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                         ^~
avx-768.c: avx-768.c:716:44: warning: unused variable 'h3' [-Wunused-variable]
avx-768.c:   716 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                            ^~
avx-768.c: avx-768.c:847:35: warning: unused variable 'h0' [-Wunused-variable]
avx-768.c:   847 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                   ^~
avx-768.c: avx-768.c:847:38: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c: ...
avx.c: avx.c:23:21: warning: unused function 'mullo' [-Wunused-function]
avx.c:    23 | static inline int16 mullo(int16 x,int16 y)
avx.c:       |                     ^~~~~
avx.c: avx.c:28:21: warning: unused function 'mulhi' [-Wunused-function]
avx.c:    28 | static inline int16 mulhi(int16 x,int16 y)
avx.c:       |                     ^~~~~
avx.c: avx.c:33:21: warning: unused function 'mulhrs' [-Wunused-function]
avx.c:    33 | static inline int16 mulhrs(int16 x,int16 y)
avx.c:       |                     ^~~~~~
avx.c: 3 warnings generated.
avx3-512.c: avx3-512.c:935:35: warning: unused variable 'h0' [-Wunused-variable]
avx3-512.c:   935 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                   ^~
avx3-512.c: avx3-512.c:935:38: warning: unused variable 'h1' [-Wunused-variable]
avx3-512.c:   935 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                      ^~
avx3-512.c: avx3-512.c:935:41: warning: unused variable 'h2' [-Wunused-variable]
avx3-512.c:   935 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                         ^~
avx3-512.c: avx3-512.c:935:44: warning: unused variable 'h3' [-Wunused-variable]
avx3-512.c:   935 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                            ^~
avx3-512.c: avx3-512.c:978:35: warning: unused variable 'h0' [-Wunused-variable]
avx3-512.c:   978 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                   ^~
avx3-512.c: avx3-512.c:978:38: warning: unused variable 'h1' [-Wunused-variable]
avx3-512.c:   978 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                      ^~
avx3-512.c: avx3-512.c:978:41: warning: unused variable 'h2' [-Wunused-variable]
avx3-512.c:   978 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                         ^~
avx3-512.c: avx3-512.c:978:44: warning: unused variable 'h3' [-Wunused-variable]
avx3-512.c:   978 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                            ^~
avx3-512.c: avx3-512.c:1055:35: warning: unused variable 'h0' [-Wunused-variable]
avx3-512.c: ...
jump32divsteps.c: jump32divsteps.c:14:8: warning: unused variable 'f0' [-Wunused-variable]
jump32divsteps.c:    14 |   modq f0, g0;
jump32divsteps.c:       |        ^~
jump32divsteps.c: jump32divsteps.c:14:12: warning: unused variable 'g0' [-Wunused-variable]
jump32divsteps.c:    14 |   modq f0, g0;
jump32divsteps.c:       |            ^~
jump32divsteps.c: jump32divsteps.c:18:25: warning: unused variable 'mask' [-Wunused-variable]
jump32divsteps.c:    18 |   __m256i minusdelta_v, mask, temp, gx, ux, uy, vx, vy, qx, rx;
jump32divsteps.c:       |                         ^~~~
jump32divsteps.c: jump32divsteps.c:18:37: warning: unused variable 'gx' [-Wunused-variable]
jump32divsteps.c:    18 |   __m256i minusdelta_v, mask, temp, gx, ux, uy, vx, vy, qx, rx;
jump32divsteps.c:       |                                     ^~
jump32divsteps.c: jump32divsteps.c:18:45: warning: unused variable 'uy' [-Wunused-variable]
jump32divsteps.c:    18 |   __m256i minusdelta_v, mask, temp, gx, ux, uy, vx, vy, qx, rx;
jump32divsteps.c:       |                                             ^~
jump32divsteps.c: jump32divsteps.c:18:53: warning: unused variable 'vy' [-Wunused-variable]
jump32divsteps.c:    18 |   __m256i minusdelta_v, mask, temp, gx, ux, uy, vx, vy, qx, rx;
jump32divsteps.c:       |                                                     ^~
jump32divsteps.c: jump32divsteps.c:21:11: warning: unused variable 'delta_v' [-Wunused-variable]
jump32divsteps.c:    21 |   __m256i delta_v = _mm256_set1_epi16((int16_t)delta);
jump32divsteps.c:       |           ^~~~~~~
jump32divsteps.c: jump32divsteps.c:115:25: warning: unused variable 'mask' [-Wunused-variable]
jump32divsteps.c:   115 |   __m256i minusdelta_v, mask, temp, gx, ux, uy, vx, vy, qx, rx;
jump32divsteps.c:       |                         ^~~~
jump32divsteps.c: jump32divsteps.c:115:37: warning: unused variable 'gx' [-Wunused-variable]
jump32divsteps.c: ...
jumpNdivsteps.c: jumpNdivsteps.c:23:11: warning: unused variable 'mask' [-Wunused-variable]
jumpNdivsteps.c:    23 |   __m256i mask, mask1, temp;
jumpNdivsteps.c:       |           ^~~~
jumpNdivsteps.c: jumpNdivsteps.c:23:17: warning: unused variable 'mask1' [-Wunused-variable]
jumpNdivsteps.c:    23 |   __m256i mask, mask1, temp;
jumpNdivsteps.c:       |                 ^~~~~
jumpNdivsteps.c: jumpNdivsteps.c:23:24: warning: unused variable 'temp' [-Wunused-variable]
jumpNdivsteps.c:    23 |   __m256i mask, mask1, temp;
jumpNdivsteps.c:       |                        ^~~~
jumpNdivsteps.c: jumpNdivsteps.c:61:11: warning: unused variable 'mask' [-Wunused-variable]
jumpNdivsteps.c:    61 |   __m256i mask,mask1,temp;
jumpNdivsteps.c:       |           ^~~~
jumpNdivsteps.c: jumpNdivsteps.c:61:16: warning: unused variable 'mask1' [-Wunused-variable]
jumpNdivsteps.c:    61 |   __m256i mask,mask1,temp;
jumpNdivsteps.c:       |                ^~~~~
jumpNdivsteps.c: jumpNdivsteps.c:61:22: warning: unused variable 'temp' [-Wunused-variable]
jumpNdivsteps.c:    61 |   __m256i mask,mask1,temp;
jumpNdivsteps.c:       |                      ^~~~
jumpNdivsteps.c: jumpNdivsteps.c:62:7: warning: unused variable 'i' [-Wunused-variable]
jumpNdivsteps.c:    62 |   int i;
jumpNdivsteps.c:       |       ^
jumpNdivsteps.c: jumpNdivsteps.c:133:11: warning: unused variable 'mask' [-Wunused-variable]
jumpNdivsteps.c:   133 |   __m256i mask, mask1, temp;
jumpNdivsteps.c:       |           ^~~~
jumpNdivsteps.c: jumpNdivsteps.c:133:17: warning: unused variable 'mask1' [-Wunused-variable]
jumpNdivsteps.c: ...
polymul_ntt.c: In file included from polymul_ntt.c:12:
polymul_ntt.c: ./crt3modq.h:158:12: warning: unused function 'crt3__inside' [-Wunused-function]
polymul_ntt.c:   158 | static int crt3__inside(short u1, short u2, short u3) {
polymul_ntt.c:       |            ^~~~~~~~~~~~
polymul_ntt.c: polymul_ntt.c:166:13: warning: unused function 'polymulntt512_256x256_test' [-Wunused-function]
polymul_ntt.c:   166 | static void polymulntt512_256x256_test(__m256i *h, __m256i *f, __m256i *g)
polymul_ntt.c:       |             ^~~~~~~~~~~~~~~~~~~~~~~~~~
polymul_ntt.c: polymul_ntt.c:324:13: warning: unused function 'polymulntt768_256x256_test' [-Wunused-function]
polymul_ntt.c:   324 | static void polymulntt768_256x256_test(__m256i *h, __m256i *f, __m256i *g)
polymul_ntt.c:       |             ^~~~~~~~~~~~~~~~~~~~~~~~~~
polymul_ntt.c: 3 warnings generated.

Number of similar (implementation,compiler) pairs: 4, namely:

Implementation	Compiler
`jumpdivsteps`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`jumpdivsteps`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`jumpdivsteps`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`jumpdivsteps`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`

Compiler output

avx-768.c: avx-768.c:544:36: error: invalid output size for constraint '+x'
avx-768.c:   544 |   __asm__("vpsubw %1,%0,%0" : "+x"(a),"+x"(b));
avx-768.c:       |                                    ^
avx-768.c: avx-768.c:550:36: error: invalid output size for constraint '+x'
avx-768.c:   550 |   __asm__("vpaddw %1,%0,%0" : "+x"(a),"+x"(b));
avx-768.c:       |                                    ^
avx-768.c: avx-768.c:586:35: warning: unused variable 'h0' [-Wunused-variable]
avx-768.c:   586 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                   ^~
avx-768.c: avx-768.c:586:41: warning: unused variable 'h2' [-Wunused-variable]
avx-768.c:   586 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                         ^~
avx-768.c: avx-768.c:586:44: warning: unused variable 'h3' [-Wunused-variable]
avx-768.c:   586 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                            ^~
avx-768.c: avx-768.c:716:35: warning: unused variable 'h0' [-Wunused-variable]
avx-768.c:   716 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                   ^~
avx-768.c: avx-768.c:716:38: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c:   716 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                      ^~
avx-768.c: avx-768.c:716:41: warning: unused variable 'h2' [-Wunused-variable]
avx-768.c:   716 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                         ^~
avx-768.c: avx-768.c:716:44: warning: unused variable 'h3' [-Wunused-variable]
avx-768.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:

Implementation	Compiler
`jumpdivsteps`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`

Compiler output

avx-768.c: avx-768.c: In function 'ntt64':
avx-768.c: avx-768.c:586:44: warning: unused variable 'h3' [-Wunused-variable]
avx-768.c:   586 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                            ^~
avx-768.c: avx-768.c:586:41: warning: unused variable 'h2' [-Wunused-variable]
avx-768.c:   586 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                         ^~
avx-768.c: avx-768.c:586:35: warning: unused variable 'h0' [-Wunused-variable]
avx-768.c:   586 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                   ^~
avx-768.c: avx-768.c: In function 'invntt64':
avx-768.c: avx-768.c:716:44: warning: unused variable 'h3' [-Wunused-variable]
avx-768.c:   716 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                            ^~
avx-768.c: avx-768.c:716:41: warning: unused variable 'h2' [-Wunused-variable]
avx-768.c:   716 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                         ^~
avx-768.c: avx-768.c:716:38: warning: unused variable 'h1' [-Wunused-variable]
avx-768.c:   716 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                      ^~
avx-768.c: avx-768.c:716:35: warning: unused variable 'h0' [-Wunused-variable]
avx-768.c:   716 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx-768.c:       |                                   ^~
avx-768.c: avx-768.c: In function 'ntt256':
avx-768.c: avx-768.c:847:44: warning: unused variable 'h3' [-Wunused-variable]
avx-768.c: ...
avx.c: avx.c:106:42: warning: argument 1 of type 'int16x16[1]' {aka '__m256i[1]'} with mismatched bound [-Warray-parameter=]
avx.c:   106 | inline void mult1_over65536_x16(int16x16 h[1],const int16x16 f[1],const int16x16 g[1])
avx.c:       |                                 ~~~~~~~~~^~~~
avx.c: In file included from avx.c:5:
avx.c: avx.h:43:33: note: previously declared as 'int16x16 *' {aka '__m256i *'}
avx.c:    43 | extern void mult1_over65536_x16(int16x16 *,const int16x16 *,const int16x16 *);
avx.c:       |                                 ^~~~~~~~~~
avx.c: avx.c:106:62: warning: argument 2 of type 'const int16x16[1]' {aka 'const __m256i[1]'} with mismatched bound [-Warray-parameter=]
avx.c:   106 | inline void mult1_over65536_x16(int16x16 h[1],const int16x16 f[1],const int16x16 g[1])
avx.c:       |                                               ~~~~~~~~~~~~~~~^~~~
avx.c: avx.h:43:44: note: previously declared as 'const int16x16 *' {aka 'const __m256i *'}
avx.c:    43 | extern void mult1_over65536_x16(int16x16 *,const int16x16 *,const int16x16 *);
avx.c:       |                                            ^~~~~~~~~~~~~~~~
avx.c: avx.c:106:82: warning: argument 3 of type 'const int16x16[1]' {aka 'const __m256i[1]'} with mismatched bound [-Warray-parameter=]
avx.c:   106 | inline void mult1_over65536_x16(int16x16 h[1],const int16x16 f[1],const int16x16 g[1])
avx.c:       |                                                                   ~~~~~~~~~~~~~~~^~~~
avx.c: avx.h:43:61: note: previously declared as 'const int16x16 *' {aka 'const __m256i *'}
avx.c:    43 | extern void mult1_over65536_x16(int16x16 *,const int16x16 *,const int16x16 *);
avx.c:       |                                                             ^~~~~~~~~~~~~~~~
avx.c: avx.c:128:42: warning: argument 1 of type 'int16x16[3]' {aka '__m256i[3]'} with mismatched bound [-Warray-parameter=]
avx.c:   128 | inline void mult2_over65536_x16(int16x16 h[3],const int16x16 f[2],const int16x16 g[2])
avx.c:       |                                 ~~~~~~~~~^~~~
avx.c: avx.h:44:33: note: previously declared as 'int16x16 *' {aka '__m256i *'}
avx.c:    44 | extern void mult2_over65536_x16(int16x16 *,const int16x16 *,const int16x16 *);
avx.c:       |                                 ^~~~~~~~~~
avx.c: ...
avx3-512.c: avx3-512.c: In function 'ntt512':
avx3-512.c: avx3-512.c:935:44: warning: unused variable 'h3' [-Wunused-variable]
avx3-512.c:   935 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                            ^~
avx3-512.c: avx3-512.c:935:41: warning: unused variable 'h2' [-Wunused-variable]
avx3-512.c:   935 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                         ^~
avx3-512.c: avx3-512.c:935:38: warning: unused variable 'h1' [-Wunused-variable]
avx3-512.c:   935 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                      ^~
avx3-512.c: avx3-512.c:935:35: warning: unused variable 'h0' [-Wunused-variable]
avx3-512.c:   935 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                   ^~
avx3-512.c: avx3-512.c: In function 'ntt512_256':
avx3-512.c: avx3-512.c:978:44: warning: unused variable 'h3' [-Wunused-variable]
avx3-512.c:   978 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                            ^~
avx3-512.c: avx3-512.c:978:41: warning: unused variable 'h2' [-Wunused-variable]
avx3-512.c:   978 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                         ^~
avx3-512.c: avx3-512.c:978:38: warning: unused variable 'h1' [-Wunused-variable]
avx3-512.c:   978 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c:       |                                      ^~
avx3-512.c: avx3-512.c:978:35: warning: unused variable 'h0' [-Wunused-variable]
avx3-512.c:   978 |   __m256i f0,f1,f2,f3,g0,g1,g2,g3,h0,h1,h2,h3;
avx3-512.c: ...
jump32divsteps.c: jump32divsteps.c: In function 'crypto_core_invsntrup761_jumpdivsteps_constbranchindex_jump32divsteps':
jump32divsteps.c: jump32divsteps.c:21:11: warning: unused variable 'delta_v' [-Wunused-variable]
jump32divsteps.c:    21 |   __m256i delta_v = _mm256_set1_epi16((int16_t)delta);
jump32divsteps.c:       |           ^~~~~~~
jump32divsteps.c: jump32divsteps.c:18:53: warning: unused variable 'vy' [-Wunused-variable]
jump32divsteps.c:    18 |   __m256i minusdelta_v, mask, temp, gx, ux, uy, vx, vy, qx, rx;
jump32divsteps.c:       |                                                     ^~
jump32divsteps.c: jump32divsteps.c:18:45: warning: unused variable 'uy' [-Wunused-variable]
jump32divsteps.c:    18 |   __m256i minusdelta_v, mask, temp, gx, ux, uy, vx, vy, qx, rx;
jump32divsteps.c:       |                                             ^~
jump32divsteps.c: jump32divsteps.c:18:37: warning: unused variable 'gx' [-Wunused-variable]
jump32divsteps.c:    18 |   __m256i minusdelta_v, mask, temp, gx, ux, uy, vx, vy, qx, rx;
jump32divsteps.c:       |                                     ^~
jump32divsteps.c: jump32divsteps.c:18:25: warning: unused variable 'mask' [-Wunused-variable]
jump32divsteps.c:    18 |   __m256i minusdelta_v, mask, temp, gx, ux, uy, vx, vy, qx, rx;
jump32divsteps.c:       |                         ^~~~
jump32divsteps.c: jump32divsteps.c:14:12: warning: unused variable 'g0' [-Wunused-variable]
jump32divsteps.c:    14 |   modq f0, g0;
jump32divsteps.c:       |            ^~
jump32divsteps.c: jump32divsteps.c:14:8: warning: unused variable 'f0' [-Wunused-variable]
jump32divsteps.c:    14 |   modq f0, g0;
jump32divsteps.c:       |        ^~
jump32divsteps.c: jump32divsteps.c: In function 'crypto_core_invsntrup761_jumpdivsteps_constbranchindex_jump32xdivsteps':
jump32divsteps.c: jump32divsteps.c:115:53: warning: unused variable 'vy' [-Wunused-variable]
jump32divsteps.c:   115 |   __m256i minusdelta_v, mask, temp, gx, ux, uy, vx, vy, qx, rx;
jump32divsteps.c: ...
jumpNdivsteps.c: jumpNdivsteps.c: In function 'crypto_core_invsntrup761_jumpdivsteps_constbranchindex_jump64divsteps':
jumpNdivsteps.c: jumpNdivsteps.c:23:24: warning: unused variable 'temp' [-Wunused-variable]
jumpNdivsteps.c:    23 |   __m256i mask, mask1, temp;
jumpNdivsteps.c:       |                        ^~~~
jumpNdivsteps.c: jumpNdivsteps.c:23:17: warning: unused variable 'mask1' [-Wunused-variable]
jumpNdivsteps.c:    23 |   __m256i mask, mask1, temp;
jumpNdivsteps.c:       |                 ^~~~~
jumpNdivsteps.c: jumpNdivsteps.c:23:11: warning: unused variable 'mask' [-Wunused-variable]
jumpNdivsteps.c:    23 |   __m256i mask, mask1, temp;
jumpNdivsteps.c:       |           ^~~~
jumpNdivsteps.c: jumpNdivsteps.c: In function 'crypto_core_invsntrup761_jumpdivsteps_constbranchindex_jump64xdivsteps':
jumpNdivsteps.c: jumpNdivsteps.c:62:7: warning: unused variable 'i' [-Wunused-variable]
jumpNdivsteps.c:    62 |   int i;
jumpNdivsteps.c:       |       ^
jumpNdivsteps.c: jumpNdivsteps.c:61:22: warning: unused variable 'temp' [-Wunused-variable]
jumpNdivsteps.c:    61 |   __m256i mask,mask1,temp;
jumpNdivsteps.c:       |                      ^~~~
jumpNdivsteps.c: jumpNdivsteps.c:61:16: warning: unused variable 'mask1' [-Wunused-variable]
jumpNdivsteps.c:    61 |   __m256i mask,mask1,temp;
jumpNdivsteps.c:       |                ^~~~~
jumpNdivsteps.c: jumpNdivsteps.c:61:11: warning: unused variable 'mask' [-Wunused-variable]
jumpNdivsteps.c:    61 |   __m256i mask,mask1,temp;
jumpNdivsteps.c:       |           ^~~~
jumpNdivsteps.c: jumpNdivsteps.c: In function 'crypto_core_invsntrup761_jumpdivsteps_constbranchindex_jump128divsteps':
jumpNdivsteps.c: jumpNdivsteps.c:133:24: warning: unused variable 'temp' [-Wunused-variable]
jumpNdivsteps.c: ...
polymul_ntt.c: polymul_ntt.c:324:13: warning: 'polymulntt768_256x256_test' defined but not used [-Wunused-function]
polymul_ntt.c:   324 | static void polymulntt768_256x256_test(__m256i *h, __m256i *f, __m256i *g)
polymul_ntt.c:       |             ^~~~~~~~~~~~~~~~~~~~~~~~~~
polymul_ntt.c: polymul_ntt.c:166:13: warning: 'polymulntt512_256x256_test' defined but not used [-Wunused-function]
polymul_ntt.c:   166 | static void polymulntt512_256x256_test(__m256i *h, __m256i *f, __m256i *g)
polymul_ntt.c:       |             ^~~~~~~~~~~~~~~~~~~~~~~~~~
polymul_ntt.c: In file included from polymul_ntt.c:12:
polymul_ntt.c: crt3modq.h:158:12: warning: 'crt3__inside' defined but not used [-Wunused-function]
polymul_ntt.c:   158 | static int crt3__inside(short u1, short u2, short u3) {
polymul_ntt.c:       |            ^~~~~~~~~~~~

Number of similar (implementation,compiler) pairs: 4, namely:

Implementation	Compiler
`jumpdivsteps`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`jumpdivsteps`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`jumpdivsteps`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`jumpdivsteps`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`

TIMECOP error (can be valgrind bug)

error 1
VEX temporary storage exhausted.
Pool = TEMP,  start 0x597cd5e8 curr 0x59c92118 end 0x59c92127 (size 5000000)
vex: the `impossible' happened:
VEX temporary storage exhausted.
Increase N_{TEMPORARY,PERMANENT}_BYTES and recompile.
vex storage: T total 478083664 bytes allocated
vex storage: P total 512 bytes allocated
valgrind: the 'impossible' happened:
LibVEX called failure_exit().
host stacktrace:
   at 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
   by 0x...: ??? (in /usr/libexec/valgrind/memcheck-amd64-linux)
sched status:
running_tid=1
Thread 1: status = VgTs_Runnable (lwpid 3941940)
   at 0x...: _mm256_mullo_epi16 (avx2intrin.h:555)
   by 0x...: barrett_fake (gf4591_avx2.h:58)
   by 0x...: crypto_core_invsntrup761_jumpdivsteps_constbranchindex_gf_polymul_128x128_avx2_divR (polymul_NxN.c:431)
   by 0x...: crypto_core_invsntrup761_jumpdivsteps_constbranchindex_gf_polymul_128x128_avx2_uvqr_fg (polymul_NxN.c:537)
   by 0x...: crypto_core_invsntrup761_jumpdivsteps_constbranchindex_jump256divsteps (jumpNdivsteps.c:278)
   by 0x...: crypto_core_invsntrup761_jumpdivsteps_constbranchindex_rq_recip3 (rq_recip3_new.c:200)
   by 0x...: crypto_core_invsntrup761_jumpdivsteps_constbranchindex (wrapper.c:10)
   by 0x...: test (try.c:106)
   by 0x...: main (try-anything.c:345)
client stack range: [0x1FFEFEE000 0x1FFF000FFF] client SP: 0x1FFEFEEC00
valgrind stack range: [0x10090AE000 0x10091ADFFF] top usage: 18232 of 1048576
Note: see also the FAQ in the source distribution.
It contains workarounds to several common problems.
In particular, if Valgrind aborted or crashed after
identifying problems in your program, there's a good chance
that fixing those problems will prevent Valgrind aborting or
crashing, especially if it happened in m_mallocfree.c.
If that doesn't help, please report this bug to: www.valgrind.org
In the bug report, send all the above text, the valgrind
version, and what OS and version you are using.  Thanks.

Number of similar (implementation,compiler) pairs: 1, namely:

Implementation	Compiler
`jumpdivsteps`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`

Passed TIMECOP

TIMECOP iterations: 1

Number of similar (implementation,compiler) pairs: 33, namely:

Implementation	Compiler
`avx`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`avx`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`avx`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`avx`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`avx`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`avx`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`avx`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`avx`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`jumpdivsteps`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`jumpdivsteps`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`jumpdivsteps`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`jumpdivsteps`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`jumpdivsteps`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`jumpdivsteps`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`jumpdivsteps`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`portable`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`portable`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`portable`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`portable`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`portable`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`portable`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`portable`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`portable`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`portable`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`ref`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`ref`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`ref`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`ref`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`ref`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`ref`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`ref`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`ref`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`ref`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`