Implementation notes: x86, titan0, crypto_core/inv3sntrup653

Computer: titan0
Architecture: x86
CPU ID: GenuineIntel-000306c3-bfebfbff
SUPERCOP version: 20190803
Operation: crypto_core
Primitive: inv3sntrup653
TimeImplementationCompilerBenchmark dateSUPERCOP version
3077232refgcc -m32 -march=core-avx2 -O3 -fomit-frame-pointer2019073020190729
3130024refgcc -m32 -march=native -mtune=native -O3 -fomit-frame-pointer2019073020190729
4011360refgcc -m32 -march=corei7-avx -O3 -fomit-frame-pointer2019073020190729
4022132refgcc -m32 -march=core-avx-i -O3 -fomit-frame-pointer2019073020190729
4042676refgcc -m32 -march=core2 -msse4 -O3 -fomit-frame-pointer2019073020190729
4049860refgcc -m32 -march=corei7 -O3 -fomit-frame-pointer2019073020190729
4060732refgcc -m32 -march=core2 -msse4.1 -O3 -fomit-frame-pointer2019073020190729
4073456refgcc -m32 -march=core2 -O3 -fomit-frame-pointer2019073020190729
5049880refgcc -funroll-loops -m32 -march=pentium4 -O3 -fomit-frame-pointer2019073020190729
5059636refgcc -funroll-loops -m32 -march=barcelona -O3 -fomit-frame-pointer2019073020190729
5083176refgcc -m32 -march=pentium4 -O3 -fomit-frame-pointer2019073020190729
5090828refgcc -funroll-loops -m32 -march=k8 -O3 -fomit-frame-pointer2019073020190729
5102780refgcc -m32 -march=k8 -O3 -fomit-frame-pointer2019073020190729
5147864refgcc -m32 -march=barcelona -O3 -fomit-frame-pointer2019073020190729
5244656refgcc -m32 -march=pentium-m -O3 -fomit-frame-pointer2019073020190729
5255608refgcc -m32 -march=nocona -O3 -fomit-frame-pointer2019073020190729
5256632refgcc -funroll-loops -m32 -march=prescott -O3 -fomit-frame-pointer2019073020190729
5257052refgcc -m32 -march=prescott -O3 -fomit-frame-pointer2019073020190729
5262460refgcc -funroll-loops -m32 -march=pentium-m -O3 -fomit-frame-pointer2019073020190729
5282480refgcc -funroll-loops -m32 -march=nocona -O3 -fomit-frame-pointer2019073020190729
20909668refgcc -m32 -march=barcelona -O2 -fomit-frame-pointer2019073020190729
20961884refgcc -m32 -march=k8 -O2 -fomit-frame-pointer2019073020190729
25687896refgcc -m32 -march=athlon -O3 -fomit-frame-pointer2019073020190729
26719836refgcc -funroll-loops -m32 -march=k8 -O2 -fomit-frame-pointer2019073020190729
26823996refgcc -funroll-loops -m32 -march=barcelona -O2 -fomit-frame-pointer2019073020190729
26833280refgcc -m32 -march=k6-2 -O2 -fomit-frame-pointer2019073020190729
26839456refgcc -m32 -march=k6 -O2 -fomit-frame-pointer2019073020190729
26842212refgcc -m32 -march=k6-3 -O2 -fomit-frame-pointer2019073020190729
27576432refgcc -funroll-loops -m32 -march=pentium4 -O2 -fomit-frame-pointer2019073020190729
27595312refgcc -funroll-loops -m32 -march=k6-2 -Os -fomit-frame-pointer2019073020190729
27597288refgcc -funroll-loops -m32 -march=k6 -Os -fomit-frame-pointer2019073020190729
27597488refgcc -funroll-loops -m32 -march=k6-3 -Os -fomit-frame-pointer2019073020190729
27698252refgcc -funroll-loops -m32 -march=pentium4 -O -fomit-frame-pointer2019073020190729
28015316refgcc -funroll-loops -m32 -march=athlon -O3 -fomit-frame-pointer2019073020190729
28610924refgcc -funroll-loops -m32 -march=i386 -Os -fomit-frame-pointer2019073020190729
28611804refgcc -funroll-loops -m32 -march=i486 -Os -fomit-frame-pointer2019073020190729
28810952refgcc -m32 -march=k6 -O3 -fomit-frame-pointer2019073020190729
28811756refgcc -m32 -march=k6-2 -O3 -fomit-frame-pointer2019073020190729
28821504refgcc -m32 -march=k6-3 -O3 -fomit-frame-pointer2019073020190729
28892288refgcc -funroll-loops -m32 -march=athlon -O -fomit-frame-pointer2019073020190729
28978588refgcc -m32 -march=pentium4 -O -fomit-frame-pointer2019073020190729
28978648refgcc -m32 -O3 -fomit-frame-pointer2019073020190729
29173644refgcc -funroll-loops -m32 -O3 -fomit-frame-pointer2019073020190729
29369912refgcc -funroll-loops -m32 -O -fomit-frame-pointer2019073020190729
29529188refgcc -m32 -march=core-avx2 -Os -fomit-frame-pointer2019073020190729
29529284refgcc -m32 -march=native -mtune=native -Os -fomit-frame-pointer2019073020190729
29538616refgcc -m32 -march=core-avx-i -Os -fomit-frame-pointer2019073020190729
29541752refgcc -m32 -march=corei7-avx -Os -fomit-frame-pointer2019073020190729
29544208refgcc -m32 -march=core2 -Os -fomit-frame-pointer2019073020190729
29583864refgcc -m32 -march=core2 -msse4.1 -Os -fomit-frame-pointer2019073020190729
29591404refgcc -m32 -march=corei7 -Os -fomit-frame-pointer2019073020190729
29591984refgcc -m32 -march=core2 -msse4 -Os -fomit-frame-pointer2019073020190729
29618604refgcc -m32 -march=i486 -Os -fomit-frame-pointer2019073020190729
29621208refgcc -m32 -march=prescott -Os -fomit-frame-pointer2019073020190729
29623748refgcc -m32 -march=i386 -Os -fomit-frame-pointer2019073020190729
29629148refgcc -m32 -march=nocona -Os -fomit-frame-pointer2019073020190729
29636796refgcc -m32 -march=pentium4 -Os -fomit-frame-pointer2019073020190729
29856300refgcc -m32 -march=k8 -Os -fomit-frame-pointer2019073020190729
29867120refgcc -m32 -Os -fomit-frame-pointer2019073020190729
29873984refgcc -m32 -march=barcelona -Os -fomit-frame-pointer2019073020190729
29883228refgcc -m32 -march=athlon -Os -fomit-frame-pointer2019073020190729
30001112refgcc -m32 -march=pentium -Os -fomit-frame-pointer2019073020190729
30005608refgcc -m32 -march=pentium-mmx -Os -fomit-frame-pointer2019073020190729
30120872refgcc -m32 -march=core-avx2 -O -fomit-frame-pointer2019073020190729
30127124refgcc -m32 -march=native -mtune=native -O -fomit-frame-pointer2019073020190729
30151080refgcc -m32 -O -fomit-frame-pointer2019073020190729
30171888refgcc -m32 -march=core2 -msse4.1 -O -fomit-frame-pointer2019073020190729
30174796refgcc -m32 -march=core-avx-i -O -fomit-frame-pointer2019073020190729
30176424refgcc -m32 -march=core2 -msse4 -O -fomit-frame-pointer2019073020190729
30177856refgcc -m32 -march=corei7-avx -O -fomit-frame-pointer2019073020190729
30195584refgcc -m32 -march=core2 -O -fomit-frame-pointer2019073020190729
30229332refgcc -m32 -march=corei7 -O -fomit-frame-pointer2019073020190729
30230908refgcc -funroll-loops -m32 -march=athlon -O2 -fomit-frame-pointer2019073020190729
30837472refgcc -m32 -march=athlon -O2 -fomit-frame-pointer2019073020190729
30968528refgcc -funroll-loops -m32 -march=pentium-m -O -fomit-frame-pointer2019073020190729
31031144refgcc -funroll-loops -m32 -march=pentium3 -O -fomit-frame-pointer2019073020190729
31032140refgcc -funroll-loops -m32 -march=pentiumpro -O -fomit-frame-pointer2019073020190729
31032596refgcc -funroll-loops -m32 -march=pentium2 -O -fomit-frame-pointer2019073020190729
31299424refgcc -m32 -march=pentium4 -O2 -fomit-frame-pointer2019073020190729
31804344refgcc -m32 -march=athlon -O -fomit-frame-pointer2019073020190729
31959796refgcc -funroll-loops -m32 -march=barcelona -O -fomit-frame-pointer2019073020190729
32039480refgcc -funroll-loops -m32 -march=k8 -O -fomit-frame-pointer2019073020190729
32055436refgcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer2019073020190729
32084056refgcc -m32 -march=corei7-avx -O2 -fomit-frame-pointer2019073020190729
32090324refgcc -m32 -march=corei7 -O2 -fomit-frame-pointer2019073020190729
32091648refgcc -m32 -march=core2 -msse4 -O2 -fomit-frame-pointer2019073020190729
32099712refgcc -m32 -march=native -mtune=native -O2 -fomit-frame-pointer2019073020190729
32103264refgcc -m32 -O2 -fomit-frame-pointer2019073020190729
32163200refgcc -m32 -march=core2 -msse4.1 -O2 -fomit-frame-pointer2019073020190729
32171456refgcc -m32 -march=core2 -O2 -fomit-frame-pointer2019073020190729
32172088refgcc -m32 -march=core-avx2 -O2 -fomit-frame-pointer2019073020190729
32181748refgcc -funroll-loops -m32 -march=pentium-m -O2 -fomit-frame-pointer2019073020190729
32233872refgcc -m32 -march=pentium-mmx -O3 -fomit-frame-pointer2019073020190729
32235528refgcc -m32 -march=pentium -O3 -fomit-frame-pointer2019073020190729
32351732refgcc -funroll-loops -m32 -march=pentiumpro -O2 -fomit-frame-pointer2019073020190729
32379664refgcc -funroll-loops -m32 -march=pentium3 -O2 -fomit-frame-pointer2019073020190729
32395780refgcc -funroll-loops -m32 -march=pentium2 -O2 -fomit-frame-pointer2019073020190729
32554756refgcc -funroll-loops -m32 -march=nocona -O2 -fomit-frame-pointer2019073020190729
32562172refgcc -funroll-loops -m32 -march=prescott -O2 -fomit-frame-pointer2019073020190729
32750240refgcc -funroll-loops -m32 -O2 -fomit-frame-pointer2019073020190729
32959908refgcc -m32 -march=pentium2 -Os -fomit-frame-pointer2019073020190729
32961060refgcc -m32 -march=pentium-m -Os -fomit-frame-pointer2019073020190729
32976864refgcc -m32 -march=pentium3 -Os -fomit-frame-pointer2019073020190729
32977088refgcc -m32 -march=pentiumpro -Os -fomit-frame-pointer2019073020190729
33367596refgcc -funroll-loops -m32 -march=k6-3 -O3 -fomit-frame-pointer2019073020190729
33375576refgcc -funroll-loops -m32 -march=k6 -O3 -fomit-frame-pointer2019073020190729
33384656refgcc -funroll-loops -m32 -march=k6-2 -O3 -fomit-frame-pointer2019073020190729
33435044refgcc -m32 -march=barcelona -O -fomit-frame-pointer2019073020190729
33488980refgcc -m32 -march=k8 -O -fomit-frame-pointer2019073020190729
33601704refgcc -funroll-loops -m32 -march=nocona -O -fomit-frame-pointer2019073020190729
33604444refgcc -funroll-loops -m32 -march=prescott -O -fomit-frame-pointer2019073020190729
33973480refgcc -m32 -march=pentiumpro -O3 -fomit-frame-pointer2019073020190729
33991084refgcc -m32 -march=pentium2 -O3 -fomit-frame-pointer2019073020190729
34004772refgcc -m32 -march=pentium3 -O3 -fomit-frame-pointer2019073020190729
34671188refgcc -funroll-loops -m32 -march=pentium -O3 -fomit-frame-pointer2019073020190729
34678856refgcc -funroll-loops -m32 -march=pentium-mmx -O3 -fomit-frame-pointer2019073020190729
35181924refgcc -funroll-loops -m32 -march=pentium -O2 -fomit-frame-pointer2019073020190729
35183116refgcc -funroll-loops -m32 -march=pentium-mmx -O2 -fomit-frame-pointer2019073020190729
35377268refgcc -m32 -march=pentium-m -O -fomit-frame-pointer2019073020190729
35463620refgcc -m32 -march=pentium3 -O -fomit-frame-pointer2019073020190729
35469324refgcc -m32 -march=pentium2 -O -fomit-frame-pointer2019073020190729
35470752refgcc -m32 -march=pentiumpro -O -fomit-frame-pointer2019073020190729
35557472refgcc -funroll-loops -m32 -march=pentium2 -O3 -fomit-frame-pointer2019073020190729
35568736refgcc -funroll-loops -m32 -march=pentiumpro -O3 -fomit-frame-pointer2019073020190729
35651992refgcc -funroll-loops -m32 -march=pentium3 -O3 -fomit-frame-pointer2019073020190729
35905692refgcc -funroll-loops -m32 -march=k6 -O2 -fomit-frame-pointer2019073020190729
35906524refgcc -funroll-loops -m32 -march=k6-3 -O2 -fomit-frame-pointer2019073020190729
35907804refgcc -funroll-loops -m32 -march=k6-2 -O2 -fomit-frame-pointer2019073020190729
36528756refgcc -funroll-loops -m32 -march=i486 -O3 -fomit-frame-pointer2019073020190729
36531276refgcc -funroll-loops -m32 -march=i386 -O3 -fomit-frame-pointer2019073020190729
36987304refgcc -m32 -march=prescott -O -fomit-frame-pointer2019073020190729
36995900refgcc -m32 -march=nocona -O -fomit-frame-pointer2019073020190729
37065612refgcc -funroll-loops -m32 -march=pentium4 -Os -fomit-frame-pointer2019073020190729
37067036refgcc -funroll-loops -m32 -march=nocona -Os -fomit-frame-pointer2019073020190729
37069188refgcc -funroll-loops -m32 -march=prescott -Os -fomit-frame-pointer2019073020190729
37168076refgcc -m32 -march=pentium2 -O2 -fomit-frame-pointer2019073020190729
37169920refgcc -m32 -march=pentium3 -O2 -fomit-frame-pointer2019073020190729
37171288refgcc -m32 -march=pentiumpro -O2 -fomit-frame-pointer2019073020190729
37189212refgcc -m32 -march=pentium-mmx -O -fomit-frame-pointer2019073020190729
37189944refgcc -m32 -march=pentium -O -fomit-frame-pointer2019073020190729
37194492refgcc -funroll-loops -m32 -march=k6-2 -O -fomit-frame-pointer2019073020190729
37196372refgcc -funroll-loops -m32 -march=k6-3 -O -fomit-frame-pointer2019073020190729
37197908refgcc -funroll-loops -m32 -march=k6 -O -fomit-frame-pointer2019073020190729
37204956refgcc -m32 -march=pentium-m -O2 -fomit-frame-pointer2019073020190729
37387708refgcc -funroll-loops -m32 -march=athlon -Os -fomit-frame-pointer2019073020190729
37388224refgcc -funroll-loops -m32 -march=barcelona -Os -fomit-frame-pointer2019073020190729
37391728refgcc -funroll-loops -m32 -march=k8 -Os -fomit-frame-pointer2019073020190729
37398256refgcc -funroll-loops -m32 -Os -fomit-frame-pointer2019073020190729
37558928refgcc -funroll-loops -m32 -march=pentium -Os -fomit-frame-pointer2019073020190729
37559588refgcc -funroll-loops -m32 -march=pentium-mmx -Os -fomit-frame-pointer2019073020190729
37659200refgcc -m32 -march=k6 -Os -fomit-frame-pointer2019073020190729
37663232refgcc -m32 -march=k6-3 -Os -fomit-frame-pointer2019073020190729
37688248refgcc -m32 -march=k6-2 -Os -fomit-frame-pointer2019073020190729
37808388refgcc -funroll-loops -m32 -march=pentiumpro -Os -fomit-frame-pointer2019073020190729
37809648refgcc -funroll-loops -m32 -march=pentium3 -Os -fomit-frame-pointer2019073020190729
37811160refgcc -funroll-loops -m32 -march=pentium2 -Os -fomit-frame-pointer2019073020190729
37974480refgcc -funroll-loops -m32 -march=pentium-m -Os -fomit-frame-pointer2019073020190729
39084632refgcc -m32 -march=nocona -O2 -fomit-frame-pointer2019073020190729
39108736refgcc -m32 -march=prescott -O2 -fomit-frame-pointer2019073020190729
40200816refgcc -funroll-loops -m32 -march=pentium -O -fomit-frame-pointer2019073020190729
40201368refgcc -funroll-loops -m32 -march=pentium-mmx -O -fomit-frame-pointer2019073020190729
40496832refgcc -m32 -march=k6-3 -O -fomit-frame-pointer2019073020190729
40505240refgcc -m32 -march=k6-2 -O -fomit-frame-pointer2019073020190729
40507304refgcc -m32 -march=k6 -O -fomit-frame-pointer2019073020190729
41476848refgcc -m32 -march=pentium-mmx -O2 -fomit-frame-pointer2019073020190729
41478412refgcc -m32 -march=pentium -O2 -fomit-frame-pointer2019073020190729
41520828refgcc -funroll-loops -m32 -march=i386 -O -fomit-frame-pointer2019073020190729
41525256refgcc -funroll-loops -m32 -march=i486 -O -fomit-frame-pointer2019073020190729
41813352refgcc -funroll-loops -m32 -march=i486 -O2 -fomit-frame-pointer2019073020190729
41852288refgcc -funroll-loops -m32 -march=i386 -O2 -fomit-frame-pointer2019073020190729
43976724refgcc -m32 -march=i386 -O -fomit-frame-pointer2019073020190729
44003816refgcc -m32 -march=i486 -O -fomit-frame-pointer2019073020190729
45541744refgcc -m32 -march=i386 -O3 -fomit-frame-pointer2019073020190729
45561240refgcc -m32 -march=i486 -O3 -fomit-frame-pointer2019073020190729
59506944refgcc -m32 -march=i486 -O2 -fomit-frame-pointer2019073020190729
59512552refgcc -m32 -march=i386 -O2 -fomit-frame-pointer2019073020190729

Compiler output

Implementation: crypto_core/inv3sntrup653/avx
Compiler: gcc -funroll-loops -m32 -O2 -fomit-frame-pointer
r3_recip.c: r3_recip.c: In function 'vec256_divx_1':
r3_recip.c: r3_recip.c:237:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
r3_recip.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
r3_recip.c: ^~~~~~~~~~~~~~~~~
r3_recip.c: _mm_cvtsi128_si32
r3_recip.c: r3_recip.c: In function 'crypto_core_inv3sntrup653_avx':
r3_recip.c: r3_recip.c:362:9: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
r3_recip.c: F0[0] = _mm256_set_epi32(0,0,0,0,0,0,0,1);
r3_recip.c: ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c: In function 'vec256_timesx_1':
r3_recip.c: r3_recip.c:291:29: warning: SSE vector return without SSE enabled changes the ABI [-Wpsabi]
r3_recip.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: r3_recip.c: In function 'vec256_frombits':
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:597:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi32': target specific option mismatch
r3_recip.c: _mm256_shuffle_epi32 (__m256i __A, const int __mask)
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c:69:7: note: called from here
r3_recip.c: h = _mm256_shuffle_epi32(h,0xd8);
r3_recip.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:1068:1: error: inlining failed in call to always_inline '_mm256_permute4x64_epi64': target specific option mismatch
r3_recip.c: ...

Number of similar (compiler,implementation) pairs: 88, namely:
CompilerImplementations
gcc -funroll-loops -m32 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=athlon -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=athlon -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=athlon -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=athlon -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=i386 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=i386 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=i386 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=i386 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=i486 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=i486 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=i486 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=i486 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6-2 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6-2 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6-2 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6-2 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6-3 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6-3 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6-3 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6-3 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium-mmx -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium-mmx -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium-mmx -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium-mmx -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium2 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium2 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium2 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium2 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentiumpro -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentiumpro -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentiumpro -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentiumpro -Os -fomit-frame-pointer avx
gcc -m32 -O2 -fomit-frame-pointer avx
gcc -m32 -O3 -fomit-frame-pointer avx
gcc -m32 -O -fomit-frame-pointer avx
gcc -m32 -Os -fomit-frame-pointer avx
gcc -m32 -march=athlon -O2 -fomit-frame-pointer avx
gcc -m32 -march=athlon -O3 -fomit-frame-pointer avx
gcc -m32 -march=athlon -O -fomit-frame-pointer avx
gcc -m32 -march=athlon -Os -fomit-frame-pointer avx
gcc -m32 -march=i386 -O2 -fomit-frame-pointer avx
gcc -m32 -march=i386 -O3 -fomit-frame-pointer avx
gcc -m32 -march=i386 -O -fomit-frame-pointer avx
gcc -m32 -march=i386 -Os -fomit-frame-pointer avx
gcc -m32 -march=i486 -O2 -fomit-frame-pointer avx
gcc -m32 -march=i486 -O3 -fomit-frame-pointer avx
gcc -m32 -march=i486 -O -fomit-frame-pointer avx
gcc -m32 -march=i486 -Os -fomit-frame-pointer avx
gcc -m32 -march=k6-2 -O2 -fomit-frame-pointer avx
gcc -m32 -march=k6-2 -O3 -fomit-frame-pointer avx
gcc -m32 -march=k6-2 -O -fomit-frame-pointer avx
gcc -m32 -march=k6-2 -Os -fomit-frame-pointer avx
gcc -m32 -march=k6-3 -O2 -fomit-frame-pointer avx
gcc -m32 -march=k6-3 -O3 -fomit-frame-pointer avx
gcc -m32 -march=k6-3 -O -fomit-frame-pointer avx
gcc -m32 -march=k6-3 -Os -fomit-frame-pointer avx
gcc -m32 -march=k6 -O2 -fomit-frame-pointer avx
gcc -m32 -march=k6 -O3 -fomit-frame-pointer avx
gcc -m32 -march=k6 -O -fomit-frame-pointer avx
gcc -m32 -march=k6 -Os -fomit-frame-pointer avx
gcc -m32 -march=pentium-mmx -O2 -fomit-frame-pointer avx
gcc -m32 -march=pentium-mmx -O3 -fomit-frame-pointer avx
gcc -m32 -march=pentium-mmx -O -fomit-frame-pointer avx
gcc -m32 -march=pentium-mmx -Os -fomit-frame-pointer avx
gcc -m32 -march=pentium2 -O2 -fomit-frame-pointer avx
gcc -m32 -march=pentium2 -O3 -fomit-frame-pointer avx
gcc -m32 -march=pentium2 -O -fomit-frame-pointer avx
gcc -m32 -march=pentium2 -Os -fomit-frame-pointer avx
gcc -m32 -march=pentium -O2 -fomit-frame-pointer avx
gcc -m32 -march=pentium -O3 -fomit-frame-pointer avx
gcc -m32 -march=pentium -O -fomit-frame-pointer avx
gcc -m32 -march=pentium -Os -fomit-frame-pointer avx
gcc -m32 -march=pentiumpro -O2 -fomit-frame-pointer avx
gcc -m32 -march=pentiumpro -O3 -fomit-frame-pointer avx
gcc -m32 -march=pentiumpro -O -fomit-frame-pointer avx
gcc -m32 -march=pentiumpro -Os -fomit-frame-pointer avx

Compiler output

Implementation: crypto_core/inv3sntrup653/avx
Compiler: gcc -funroll-loops -m32 -march=barcelona -O2 -fomit-frame-pointer
r3_recip.c: r3_recip.c: In function 'vec256_divx_1':
r3_recip.c: r3_recip.c:237:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
r3_recip.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
r3_recip.c: ^~~~~~~~~~~~~~~~~
r3_recip.c: _mm_cvtsi128_si32
r3_recip.c: r3_recip.c: In function 'crypto_core_inv3sntrup653_avx':
r3_recip.c: r3_recip.c:362:9: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
r3_recip.c: F0[0] = _mm256_set_epi32(0,0,0,0,0,0,0,1);
r3_recip.c: ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: r3_recip.c: In function 'vec256_frombits':
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:597:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi32': target specific option mismatch
r3_recip.c: _mm256_shuffle_epi32 (__m256i __A, const int __mask)
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c:69:7: note: called from here
r3_recip.c: h = _mm256_shuffle_epi32(h,0xd8);
r3_recip.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:1068:1: error: inlining failed in call to always_inline '_mm256_permute4x64_epi64': target specific option mismatch
r3_recip.c: _mm256_permute4x64_epi64 (__m256i __X, const int __M)
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c:68:7: note: called from here
r3_recip.c: h = _mm256_permute4x64_epi64(h,0xd8);
r3_recip.c: ...

Number of similar (compiler,implementation) pairs: 68, namely:
CompilerImplementations
gcc -funroll-loops -m32 -march=barcelona -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=barcelona -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=barcelona -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=barcelona -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k8 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k8 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k8 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k8 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=nocona -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=nocona -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=nocona -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=nocona -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium-m -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium-m -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium-m -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium-m -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium3 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium3 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium3 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium3 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium4 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium4 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium4 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium4 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=prescott -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=prescott -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=prescott -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=prescott -Os -fomit-frame-pointer avx
gcc -m32 -march=core2 -O2 -fomit-frame-pointer avx
gcc -m32 -march=core2 -O3 -fomit-frame-pointer avx
gcc -m32 -march=core2 -O -fomit-frame-pointer avx
gcc -m32 -march=core2 -Os -fomit-frame-pointer avx
gcc -m32 -march=core2 -msse4.1 -O2 -fomit-frame-pointer avx
gcc -m32 -march=core2 -msse4.1 -O3 -fomit-frame-pointer avx
gcc -m32 -march=core2 -msse4.1 -O -fomit-frame-pointer avx
gcc -m32 -march=core2 -msse4.1 -Os -fomit-frame-pointer avx
gcc -m32 -march=core2 -msse4 -O2 -fomit-frame-pointer avx
gcc -m32 -march=core2 -msse4 -O3 -fomit-frame-pointer avx
gcc -m32 -march=core2 -msse4 -O -fomit-frame-pointer avx
gcc -m32 -march=core2 -msse4 -Os -fomit-frame-pointer avx
gcc -m32 -march=corei7 -O2 -fomit-frame-pointer avx
gcc -m32 -march=corei7 -O3 -fomit-frame-pointer avx
gcc -m32 -march=corei7 -O -fomit-frame-pointer avx
gcc -m32 -march=corei7 -Os -fomit-frame-pointer avx
gcc -m32 -march=k8 -O2 -fomit-frame-pointer avx
gcc -m32 -march=k8 -O3 -fomit-frame-pointer avx
gcc -m32 -march=k8 -O -fomit-frame-pointer avx
gcc -m32 -march=k8 -Os -fomit-frame-pointer avx
gcc -m32 -march=nocona -O2 -fomit-frame-pointer avx
gcc -m32 -march=nocona -O3 -fomit-frame-pointer avx
gcc -m32 -march=nocona -O -fomit-frame-pointer avx
gcc -m32 -march=nocona -Os -fomit-frame-pointer avx
gcc -m32 -march=pentium-m -O2 -fomit-frame-pointer avx
gcc -m32 -march=pentium-m -O3 -fomit-frame-pointer avx
gcc -m32 -march=pentium-m -O -fomit-frame-pointer avx
gcc -m32 -march=pentium-m -Os -fomit-frame-pointer avx
gcc -m32 -march=pentium3 -O2 -fomit-frame-pointer avx
gcc -m32 -march=pentium3 -O3 -fomit-frame-pointer avx
gcc -m32 -march=pentium3 -O -fomit-frame-pointer avx
gcc -m32 -march=pentium3 -Os -fomit-frame-pointer avx
gcc -m32 -march=pentium4 -O2 -fomit-frame-pointer avx
gcc -m32 -march=pentium4 -O3 -fomit-frame-pointer avx
gcc -m32 -march=pentium4 -O -fomit-frame-pointer avx
gcc -m32 -march=pentium4 -Os -fomit-frame-pointer avx
gcc -m32 -march=prescott -O2 -fomit-frame-pointer avx
gcc -m32 -march=prescott -O3 -fomit-frame-pointer avx
gcc -m32 -march=prescott -O -fomit-frame-pointer avx
gcc -m32 -march=prescott -Os -fomit-frame-pointer avx

Compiler output

Implementation: crypto_core/inv3sntrup653/avx
Compiler: gcc -m32 -march=barcelona -O2 -fomit-frame-pointer
r3_recip.c: r3_recip.c: In function 'vec256_divx_1':
r3_recip.c: r3_recip.c:237:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
r3_recip.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
r3_recip.c: ^~~~~~~~~~~~~~~~~
r3_recip.c: _mm_cvtsi128_si32
r3_recip.c: r3_recip.c: In function 'crypto_core_inv3sntrup653_avx':
r3_recip.c: r3_recip.c:362:9: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
r3_recip.c: F0[0] = _mm256_set_epi32(0,0,0,0,0,0,0,1);
r3_recip.c: ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: r3_recip.c: In function 'vec256_frombits':
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:597:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi32': target specific option mismatch
r3_recip.c: _mm256_shuffle_epi32 (__m256i __A, const int __mask)
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c:69:7: note: called from here
r3_recip.c: h = _mm256_shuffle_epi32(h,0xd8);
r3_recip.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:1068:1: error: inlining failed in call to always_inline '_mm256_permute4x64_epi64': target specific option mismatch
r3_recip.c: _mm256_permute4x64_epi64 (__m256i __X, const int __M)
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c:68:7: note: called from here
r3_recip.c: h = _mm256_permute4x64_epi64(h,0xd8);
r3_recip.c: ...
r3_recip.c: r3_recip.c: In function 'vec256_divx_1':
r3_recip.c: r3_recip.c:237:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
r3_recip.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
r3_recip.c: ^~~~~~~~~~~~~~~~~
r3_recip.c: _mm_cvtsi128_si32
r3_recip.c: r3_recip.c: In function 'crypto_core_inv3sntrup653_avx':
r3_recip.c: r3_recip.c:362:9: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
r3_recip.c: F0[0] = _mm256_set_epi32(0,0,0,0,0,0,0,1);
r3_recip.c: ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: r3_recip.c: In function 'vec256_frombits':
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:597:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi32': target specific option mismatch
r3_recip.c: _mm256_shuffle_epi32 (__m256i __A, const int __mask)
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c:69:7: note: called from here
r3_recip.c: h = _mm256_shuffle_epi32(h,0xd8);
r3_recip.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:1068:1: error: inlining failed in call to always_inline '_mm256_permute4x64_epi64': target specific option mismatch
r3_recip.c: _mm256_permute4x64_epi64 (__m256i __X, const int __M)
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c:68:7: note: called from here
r3_recip.c: h = _mm256_permute4x64_epi64(h,0xd8);
r3_recip.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m32 -march=barcelona -O2 -fomit-frame-pointer avx
gcc -m32 -march=barcelona -O3 -fomit-frame-pointer avx
gcc -m32 -march=barcelona -O -fomit-frame-pointer avx
gcc -m32 -march=barcelona -Os -fomit-frame-pointer avx

Compiler output

Implementation: crypto_core/inv3sntrup653/avx
Compiler: gcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer
r3_recip.c: r3_recip.c: In function 'vec256_divx_1':
r3_recip.c: r3_recip.c:237:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
r3_recip.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
r3_recip.c: ^~~~~~~~~~~~~~~~~
r3_recip.c: _mm_cvtsi128_si32
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: r3_recip.c: In function 'vec256_frombits':
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:597:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi32': target specific option mismatch
r3_recip.c: _mm256_shuffle_epi32 (__m256i __A, const int __mask)
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c:69:7: note: called from here
r3_recip.c: h = _mm256_shuffle_epi32(h,0xd8);
r3_recip.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:1068:1: error: inlining failed in call to always_inline '_mm256_permute4x64_epi64': target specific option mismatch
r3_recip.c: _mm256_permute4x64_epi64 (__m256i __X, const int __M)
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c:68:7: note: called from here
r3_recip.c: h = _mm256_permute4x64_epi64(h,0xd8);
r3_recip.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:588:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi8': target specific option mismatch
r3_recip.c: ...

Number of similar (compiler,implementation) pairs: 8, namely:
CompilerImplementations
gcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer avx
gcc -m32 -march=core-avx-i -O3 -fomit-frame-pointer avx
gcc -m32 -march=core-avx-i -O -fomit-frame-pointer avx
gcc -m32 -march=core-avx-i -Os -fomit-frame-pointer avx
gcc -m32 -march=corei7-avx -O2 -fomit-frame-pointer avx
gcc -m32 -march=corei7-avx -O3 -fomit-frame-pointer avx
gcc -m32 -march=corei7-avx -O -fomit-frame-pointer avx
gcc -m32 -march=corei7-avx -Os -fomit-frame-pointer avx

Compiler output

Implementation: crypto_core/inv3sntrup653/avx
Compiler: gcc -m32 -march=core-avx2 -O2 -fomit-frame-pointer
r3_recip.c: r3_recip.c: In function 'vec256_divx_1':
r3_recip.c: r3_recip.c:237:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
r3_recip.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
r3_recip.c: ^~~~~~~~~~~~~~~~~
r3_recip.c: _mm_cvtsi128_si32
try.c: crypto_core_inv3sntrup653.a(r3_recip.o): In function `crypto_core_inv3sntrup653_avx':
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: crypto_core_inv3sntrup653.a(r3_recip.o):r3_recip.c:(.text+0x...): more undefined references to `_mm_cvtsi128_si64' follow
try.c: collect2: error: ld returned 1 exit status

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -m32 -march=core-avx2 -O2 -fomit-frame-pointer avx
gcc -m32 -march=core-avx2 -O3 -fomit-frame-pointer avx
gcc -m32 -march=core-avx2 -O -fomit-frame-pointer avx
gcc -m32 -march=native -mtune=native -O2 -fomit-frame-pointer avx
gcc -m32 -march=native -mtune=native -O3 -fomit-frame-pointer avx
gcc -m32 -march=native -mtune=native -O -fomit-frame-pointer avx

Compiler output

Implementation: crypto_core/inv3sntrup653/avx
Compiler: gcc -m32 -march=core-avx2 -Os -fomit-frame-pointer
r3_recip.c: r3_recip.c: In function 'vec256_divx_1':
r3_recip.c: r3_recip.c:237:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
r3_recip.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
r3_recip.c: ^~~~~~~~~~~~~~~~~
r3_recip.c: _mm_cvtsi128_si32
try.c: crypto_core_inv3sntrup653.a(r3_recip.o): In function `vec256_timesx_2':
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: crypto_core_inv3sntrup653.a(r3_recip.o): In function `vec256_divx_3':
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: crypto_core_inv3sntrup653.a(r3_recip.o):r3_recip.c:(.text+0x...): more undefined references to `_mm_cvtsi128_si64' follow
try.c: collect2: error: ld returned 1 exit status

Number of similar (compiler,implementation) pairs: 2, namely:
CompilerImplementations
gcc -m32 -march=core-avx2 -Os -fomit-frame-pointer avx
gcc -m32 -march=native -mtune=native -Os -fomit-frame-pointer avx