Implementation notes: x86, samba, crypto_core/inv3sntrup653

Computer: samba
Architecture: x86
CPU ID: GenuineIntel-000506e3-bfebfbff
SUPERCOP version: 20190803
Operation: crypto_core
Primitive: inv3sntrup653
TimeImplementationCompilerBenchmark dateSUPERCOP version
2778966refgcc -m32 -march=core-avx2 -O3 -fomit-frame-pointer2019072920190729
2782530refgcc -m32 -march=native -mtune=native -O3 -fomit-frame-pointer2019072920190729
3729151refgcc -m32 -march=core2 -msse4.1 -O3 -fomit-frame-pointer2019072920190729
3735246refgcc -m32 -march=core2 -O3 -fomit-frame-pointer2019072920190729
3743863refgcc -m32 -march=core-avx-i -O3 -fomit-frame-pointer2019072920190729
3746610refgcc -m32 -march=core2 -msse4 -O3 -fomit-frame-pointer2019072920190729
3747289refgcc -m32 -march=corei7-avx -O3 -fomit-frame-pointer2019072920190729
3750738refgcc -m32 -march=corei7 -O3 -fomit-frame-pointer2019072920190729
4714822refgcc -funroll-loops -m32 -march=pentium4 -O3 -fomit-frame-pointer2019072920190729
4737081refgcc -m32 -march=pentium4 -O3 -fomit-frame-pointer2019072920190729
4771549refgcc -funroll-loops -m32 -march=barcelona -O3 -fomit-frame-pointer2019072920190729
4810475refgcc -funroll-loops -m32 -march=k8 -O3 -fomit-frame-pointer2019072920190729
4852692refgcc -m32 -march=k8 -O3 -fomit-frame-pointer2019072920190729
4857360refgcc -m32 -march=pentium-m -O3 -fomit-frame-pointer2019072920190729
4866167refgcc -funroll-loops -m32 -march=pentium-m -O3 -fomit-frame-pointer2019072920190729
4878443refgcc -m32 -march=barcelona -O3 -fomit-frame-pointer2019072920190729
4902305refgcc -m32 -march=prescott -O3 -fomit-frame-pointer2019072920190729
4903669refgcc -m32 -march=nocona -O3 -fomit-frame-pointer2019072920190729
4907589refgcc -funroll-loops -m32 -march=prescott -O3 -fomit-frame-pointer2019072920190729
4909208refgcc -funroll-loops -m32 -march=nocona -O3 -fomit-frame-pointer2019072920190729
19448373refgcc -m32 -march=barcelona -O2 -fomit-frame-pointer2019072920190729
19456571refgcc -m32 -march=k8 -O2 -fomit-frame-pointer2019072920190729
24012340refgcc -m32 -march=k6-2 -O2 -fomit-frame-pointer2019072920190729
24175324refgcc -m32 -march=k6-3 -O2 -fomit-frame-pointer2019072920190729
24186361refgcc -m32 -march=k6 -O2 -fomit-frame-pointer2019072920190729
24709013refgcc -funroll-loops -m32 -march=k8 -O2 -fomit-frame-pointer2019072920190729
24711625refgcc -funroll-loops -m32 -march=barcelona -O2 -fomit-frame-pointer2019072920190729
25394483refgcc -m32 -march=athlon -O3 -fomit-frame-pointer2019072920190729
26751694refgcc -m32 -march=k6-3 -O3 -fomit-frame-pointer2019072920190729
26756351refgcc -m32 -march=k6 -O3 -fomit-frame-pointer2019072920190729
26757450refgcc -m32 -march=k6-2 -O3 -fomit-frame-pointer2019072920190729
26885954refgcc -funroll-loops -m32 -march=pentium4 -O2 -fomit-frame-pointer2019072920190729
27004865refgcc -funroll-loops -m32 -march=k6 -Os -fomit-frame-pointer2019072920190729
27005117refgcc -funroll-loops -m32 -march=k6-2 -Os -fomit-frame-pointer2019072920190729
27005204refgcc -funroll-loops -m32 -march=k6-3 -Os -fomit-frame-pointer2019072920190729
27165569refgcc -funroll-loops -m32 -march=pentium4 -O -fomit-frame-pointer2019072920190729
27466414refgcc -funroll-loops -m32 -march=athlon -O3 -fomit-frame-pointer2019072920190729
28221705refgcc -funroll-loops -m32 -march=athlon -O -fomit-frame-pointer2019072920190729
28246191refgcc -funroll-loops -m32 -march=i486 -Os -fomit-frame-pointer2019072920190729
28246913refgcc -funroll-loops -m32 -march=i386 -Os -fomit-frame-pointer2019072920190729
28338383refgcc -m32 -march=pentium4 -O -fomit-frame-pointer2019072920190729
28543929refgcc -funroll-loops -m32 -O3 -fomit-frame-pointer2019072920190729
28602612refgcc -m32 -O3 -fomit-frame-pointer2019072920190729
28801040refgcc -funroll-loops -m32 -O -fomit-frame-pointer2019072920190729
29310458refgcc -m32 -march=native -mtune=native -Os -fomit-frame-pointer2019072920190729
29315133refgcc -m32 -march=core-avx2 -Os -fomit-frame-pointer2019072920190729
29360652refgcc -m32 -march=core2 -Os -fomit-frame-pointer2019072920190729
29369192refgcc -m32 -march=core-avx-i -Os -fomit-frame-pointer2019072920190729
29370567refgcc -m32 -march=corei7-avx -Os -fomit-frame-pointer2019072920190729
29402290refgcc -m32 -march=nocona -Os -fomit-frame-pointer2019072920190729
29402607refgcc -m32 -march=pentium4 -Os -fomit-frame-pointer2019072920190729
29407254refgcc -m32 -march=prescott -Os -fomit-frame-pointer2019072920190729
29420070refgcc -m32 -O -fomit-frame-pointer2019072920190729
29429476refgcc -m32 -march=core-avx2 -O -fomit-frame-pointer2019072920190729
29429794refgcc -m32 -march=native -mtune=native -O -fomit-frame-pointer2019072920190729
29518377refgcc -funroll-loops -m32 -march=athlon -O2 -fomit-frame-pointer2019072920190729
29772460refgcc -m32 -march=corei7 -Os -fomit-frame-pointer2019072920190729
29772762refgcc -m32 -march=core2 -msse4.1 -Os -fomit-frame-pointer2019072920190729
29772813refgcc -m32 -march=core2 -msse4 -Os -fomit-frame-pointer2019072920190729
29930293refgcc -m32 -march=athlon -O2 -fomit-frame-pointer2019072920190729
29934956refgcc -m32 -march=k8 -Os -fomit-frame-pointer2019072920190729
29937159refgcc -m32 -march=barcelona -Os -fomit-frame-pointer2019072920190729
29950256refgcc -m32 -march=i486 -Os -fomit-frame-pointer2019072920190729
29953625refgcc -m32 -march=i386 -Os -fomit-frame-pointer2019072920190729
30015936refgcc -m32 -Os -fomit-frame-pointer2019072920190729
30016500refgcc -m32 -march=athlon -Os -fomit-frame-pointer2019072920190729
30153925refgcc -funroll-loops -m32 -march=pentium-m -O -fomit-frame-pointer2019072920190729
30192343refgcc -m32 -march=pentium4 -O2 -fomit-frame-pointer2019072920190729
30204956refgcc -funroll-loops -m32 -march=pentium2 -O -fomit-frame-pointer2019072920190729
30205832refgcc -funroll-loops -m32 -march=pentiumpro -O -fomit-frame-pointer2019072920190729
30208061refgcc -funroll-loops -m32 -march=pentium3 -O -fomit-frame-pointer2019072920190729
30237683refgcc -m32 -march=pentium -Os -fomit-frame-pointer2019072920190729
30240689refgcc -m32 -march=pentium-mmx -Os -fomit-frame-pointer2019072920190729
30260389refgcc -m32 -march=core2 -msse4 -O -fomit-frame-pointer2019072920190729
30260663refgcc -m32 -march=corei7-avx -O -fomit-frame-pointer2019072920190729
30261522refgcc -m32 -march=core-avx-i -O -fomit-frame-pointer2019072920190729
30261522refgcc -m32 -march=core2 -O -fomit-frame-pointer2019072920190729
30261526refgcc -m32 -march=corei7 -O -fomit-frame-pointer2019072920190729
30261530refgcc -m32 -march=core2 -msse4.1 -O -fomit-frame-pointer2019072920190729
30504393refgcc -funroll-loops -m32 -O2 -fomit-frame-pointer2019072920190729
30895463refgcc -funroll-loops -m32 -march=pentium3 -O2 -fomit-frame-pointer2019072920190729
30895502refgcc -funroll-loops -m32 -march=pentiumpro -O2 -fomit-frame-pointer2019072920190729
30897283refgcc -funroll-loops -m32 -march=pentium2 -O2 -fomit-frame-pointer2019072920190729
30952621refgcc -funroll-loops -m32 -march=pentium-m -O2 -fomit-frame-pointer2019072920190729
31024752refgcc -m32 -march=corei7-avx -O2 -fomit-frame-pointer2019072920190729
31025162refgcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer2019072920190729
31029535refgcc -m32 -O2 -fomit-frame-pointer2019072920190729
31030861refgcc -m32 -march=core2 -O2 -fomit-frame-pointer2019072920190729
31039992refgcc -m32 -march=core-avx2 -O2 -fomit-frame-pointer2019072920190729
31040522refgcc -m32 -march=native -mtune=native -O2 -fomit-frame-pointer2019072920190729
31042238refgcc -m32 -march=corei7 -O2 -fomit-frame-pointer2019072920190729
31042838refgcc -m32 -march=core2 -msse4 -O2 -fomit-frame-pointer2019072920190729
31043663refgcc -m32 -march=core2 -msse4.1 -O2 -fomit-frame-pointer2019072920190729
31319089refgcc -funroll-loops -m32 -march=barcelona -O -fomit-frame-pointer2019072920190729
31324228refgcc -funroll-loops -m32 -march=k8 -O -fomit-frame-pointer2019072920190729
31866018refgcc -m32 -march=pentium -O3 -fomit-frame-pointer2019072920190729
31884161refgcc -m32 -march=pentium-mmx -O3 -fomit-frame-pointer2019072920190729
31939117refgcc -m32 -march=athlon -O -fomit-frame-pointer2019072920190729
32087917refgcc -funroll-loops -m32 -march=prescott -O2 -fomit-frame-pointer2019072920190729
32088766refgcc -funroll-loops -m32 -march=nocona -O2 -fomit-frame-pointer2019072920190729
32715284refgcc -funroll-loops -m32 -march=prescott -O -fomit-frame-pointer2019072920190729
32717934refgcc -funroll-loops -m32 -march=nocona -O -fomit-frame-pointer2019072920190729
32757472refgcc -m32 -march=k8 -O -fomit-frame-pointer2019072920190729
32760195refgcc -m32 -march=barcelona -O -fomit-frame-pointer2019072920190729
32851142refgcc -funroll-loops -m32 -march=k6 -O3 -fomit-frame-pointer2019072920190729
32854732refgcc -funroll-loops -m32 -march=k6-3 -O3 -fomit-frame-pointer2019072920190729
32858609refgcc -funroll-loops -m32 -march=k6-2 -O3 -fomit-frame-pointer2019072920190729
33228201refgcc -m32 -march=pentiumpro -Os -fomit-frame-pointer2019072920190729
33228395refgcc -m32 -march=pentium3 -Os -fomit-frame-pointer2019072920190729
33228452refgcc -m32 -march=pentium2 -Os -fomit-frame-pointer2019072920190729
33239791refgcc -m32 -march=pentium-m -Os -fomit-frame-pointer2019072920190729
33507983refgcc -funroll-loops -m32 -march=pentium-mmx -O2 -fomit-frame-pointer2019072920190729
33509221refgcc -funroll-loops -m32 -march=pentium -O2 -fomit-frame-pointer2019072920190729
33581811refgcc -m32 -march=pentium2 -O3 -fomit-frame-pointer2019072920190729
33582641refgcc -m32 -march=pentiumpro -O3 -fomit-frame-pointer2019072920190729
33679248refgcc -m32 -march=pentium3 -O3 -fomit-frame-pointer2019072920190729
33990850refgcc -funroll-loops -m32 -march=k6-3 -O2 -fomit-frame-pointer2019072920190729
33993526refgcc -funroll-loops -m32 -march=k6-2 -O2 -fomit-frame-pointer2019072920190729
33994046refgcc -funroll-loops -m32 -march=k6 -O2 -fomit-frame-pointer2019072920190729
34194470refgcc -funroll-loops -m32 -march=pentium3 -O3 -fomit-frame-pointer2019072920190729
34195329refgcc -funroll-loops -m32 -march=pentiumpro -O3 -fomit-frame-pointer2019072920190729
34229050refgcc -funroll-loops -m32 -march=pentium2 -O3 -fomit-frame-pointer2019072920190729
34620932refgcc -m32 -march=pentium-m -O -fomit-frame-pointer2019072920190729
34631974refgcc -funroll-loops -m32 -march=pentium -O3 -fomit-frame-pointer2019072920190729
34634562refgcc -funroll-loops -m32 -march=pentium-mmx -O3 -fomit-frame-pointer2019072920190729
34814430refgcc -funroll-loops -m32 -march=k6-2 -O -fomit-frame-pointer2019072920190729
34814994refgcc -funroll-loops -m32 -march=k6 -O -fomit-frame-pointer2019072920190729
34816326refgcc -funroll-loops -m32 -march=k6-3 -O -fomit-frame-pointer2019072920190729
35454730refgcc -m32 -march=pentiumpro -O -fomit-frame-pointer2019072920190729
35455106refgcc -m32 -march=pentium2 -O -fomit-frame-pointer2019072920190729
35458980refgcc -m32 -march=pentium3 -O -fomit-frame-pointer2019072920190729
35558478refgcc -m32 -march=pentium-m -O2 -fomit-frame-pointer2019072920190729
35614859refgcc -m32 -march=pentium2 -O2 -fomit-frame-pointer2019072920190729
35615117refgcc -m32 -march=pentium3 -O2 -fomit-frame-pointer2019072920190729
35620823refgcc -m32 -march=pentiumpro -O2 -fomit-frame-pointer2019072920190729
35781315refgcc -funroll-loops -m32 -march=i386 -O3 -fomit-frame-pointer2019072920190729
35783101refgcc -funroll-loops -m32 -march=i486 -O3 -fomit-frame-pointer2019072920190729
36978804refgcc -m32 -march=pentium -O -fomit-frame-pointer2019072920190729
36981056refgcc -m32 -march=pentium-mmx -O -fomit-frame-pointer2019072920190729
37101175refgcc -m32 -march=k6 -Os -fomit-frame-pointer2019072920190729
37102806refgcc -m32 -march=k6-2 -Os -fomit-frame-pointer2019072920190729
37102973refgcc -m32 -march=k6-3 -Os -fomit-frame-pointer2019072920190729
37123730refgcc -m32 -march=nocona -O -fomit-frame-pointer2019072920190729
37124502refgcc -m32 -march=prescott -O -fomit-frame-pointer2019072920190729
37189252refgcc -funroll-loops -m32 -march=prescott -Os -fomit-frame-pointer2019072920190729
37189942refgcc -funroll-loops -m32 -march=pentium4 -Os -fomit-frame-pointer2019072920190729
37190061refgcc -funroll-loops -m32 -march=nocona -Os -fomit-frame-pointer2019072920190729
37425062refgcc -funroll-loops -m32 -march=athlon -Os -fomit-frame-pointer2019072920190729
37432349refgcc -funroll-loops -m32 -march=k8 -Os -fomit-frame-pointer2019072920190729
37433012refgcc -funroll-loops -m32 -Os -fomit-frame-pointer2019072920190729
37433402refgcc -funroll-loops -m32 -march=barcelona -Os -fomit-frame-pointer2019072920190729
37711730refgcc -funroll-loops -m32 -march=pentium3 -Os -fomit-frame-pointer2019072920190729
37720383refgcc -funroll-loops -m32 -march=pentiumpro -Os -fomit-frame-pointer2019072920190729
37722734refgcc -funroll-loops -m32 -march=pentium2 -Os -fomit-frame-pointer2019072920190729
37842075refgcc -funroll-loops -m32 -march=pentium-mmx -Os -fomit-frame-pointer2019072920190729
37842234refgcc -funroll-loops -m32 -march=pentium -Os -fomit-frame-pointer2019072920190729
37954121refgcc -m32 -march=prescott -O2 -fomit-frame-pointer2019072920190729
37954278refgcc -m32 -march=nocona -O2 -fomit-frame-pointer2019072920190729
38139042refgcc -funroll-loops -m32 -march=pentium-m -Os -fomit-frame-pointer2019072920190729
39891421refgcc -m32 -march=k6 -O -fomit-frame-pointer2019072920190729
39894811refgcc -m32 -march=k6-2 -O -fomit-frame-pointer2019072920190729
39900093refgcc -m32 -march=k6-3 -O -fomit-frame-pointer2019072920190729
39967966refgcc -funroll-loops -m32 -march=pentium -O -fomit-frame-pointer2019072920190729
39969048refgcc -funroll-loops -m32 -march=pentium-mmx -O -fomit-frame-pointer2019072920190729
41083375refgcc -funroll-loops -m32 -march=i486 -O2 -fomit-frame-pointer2019072920190729
41107357refgcc -funroll-loops -m32 -march=i386 -O2 -fomit-frame-pointer2019072920190729
41115590refgcc -funroll-loops -m32 -march=i486 -O -fomit-frame-pointer2019072920190729
41122095refgcc -funroll-loops -m32 -march=i386 -O -fomit-frame-pointer2019072920190729
42268802refgcc -m32 -march=pentium -O2 -fomit-frame-pointer2019072920190729
42330915refgcc -m32 -march=pentium-mmx -O2 -fomit-frame-pointer2019072920190729
43961371refgcc -m32 -march=i386 -O -fomit-frame-pointer2019072920190729
44511676refgcc -m32 -march=i486 -O -fomit-frame-pointer2019072920190729
45286555refgcc -m32 -march=i386 -O3 -fomit-frame-pointer2019072920190729
45290637refgcc -m32 -march=i486 -O3 -fomit-frame-pointer2019072920190729
59380105refgcc -m32 -march=i486 -O2 -fomit-frame-pointer2019072920190729
59385873refgcc -m32 -march=i386 -O2 -fomit-frame-pointer2019072920190729

Compiler output

Implementation: crypto_core/inv3sntrup653/avx
Compiler: gcc -funroll-loops -m32 -O2 -fomit-frame-pointer
r3_recip.c: r3_recip.c: In function 'vec256_divx_1':
r3_recip.c: r3_recip.c:237:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
r3_recip.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
r3_recip.c: ^~~~~~~~~~~~~~~~~
r3_recip.c: _mm_cvtsi128_si32
r3_recip.c: r3_recip.c: In function 'crypto_core_inv3sntrup653_avx':
r3_recip.c: r3_recip.c:362:9: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
r3_recip.c: F0[0] = _mm256_set_epi32(0,0,0,0,0,0,0,1);
r3_recip.c: ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c: In function 'vec256_timesx_1':
r3_recip.c: r3_recip.c:291:29: warning: SSE vector return without SSE enabled changes the ABI [-Wpsabi]
r3_recip.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: r3_recip.c: In function 'vec256_frombits':
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:597:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi32': target specific option mismatch
r3_recip.c: _mm256_shuffle_epi32 (__m256i __A, const int __mask)
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c:69:7: note: called from here
r3_recip.c: h = _mm256_shuffle_epi32(h,0xd8);
r3_recip.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:1068:1: error: inlining failed in call to always_inline '_mm256_permute4x64_epi64': target specific option mismatch
r3_recip.c: ...

Number of similar (compiler,implementation) pairs: 88, namely:
CompilerImplementations
gcc -funroll-loops -m32 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=athlon -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=athlon -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=athlon -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=athlon -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=i386 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=i386 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=i386 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=i386 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=i486 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=i486 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=i486 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=i486 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6-2 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6-2 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6-2 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6-2 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6-3 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6-3 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6-3 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6-3 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k6 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium-mmx -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium-mmx -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium-mmx -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium-mmx -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium2 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium2 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium2 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium2 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentiumpro -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentiumpro -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentiumpro -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentiumpro -Os -fomit-frame-pointer avx
gcc -m32 -O2 -fomit-frame-pointer avx
gcc -m32 -O3 -fomit-frame-pointer avx
gcc -m32 -O -fomit-frame-pointer avx
gcc -m32 -Os -fomit-frame-pointer avx
gcc -m32 -march=athlon -O2 -fomit-frame-pointer avx
gcc -m32 -march=athlon -O3 -fomit-frame-pointer avx
gcc -m32 -march=athlon -O -fomit-frame-pointer avx
gcc -m32 -march=athlon -Os -fomit-frame-pointer avx
gcc -m32 -march=i386 -O2 -fomit-frame-pointer avx
gcc -m32 -march=i386 -O3 -fomit-frame-pointer avx
gcc -m32 -march=i386 -O -fomit-frame-pointer avx
gcc -m32 -march=i386 -Os -fomit-frame-pointer avx
gcc -m32 -march=i486 -O2 -fomit-frame-pointer avx
gcc -m32 -march=i486 -O3 -fomit-frame-pointer avx
gcc -m32 -march=i486 -O -fomit-frame-pointer avx
gcc -m32 -march=i486 -Os -fomit-frame-pointer avx
gcc -m32 -march=k6-2 -O2 -fomit-frame-pointer avx
gcc -m32 -march=k6-2 -O3 -fomit-frame-pointer avx
gcc -m32 -march=k6-2 -O -fomit-frame-pointer avx
gcc -m32 -march=k6-2 -Os -fomit-frame-pointer avx
gcc -m32 -march=k6-3 -O2 -fomit-frame-pointer avx
gcc -m32 -march=k6-3 -O3 -fomit-frame-pointer avx
gcc -m32 -march=k6-3 -O -fomit-frame-pointer avx
gcc -m32 -march=k6-3 -Os -fomit-frame-pointer avx
gcc -m32 -march=k6 -O2 -fomit-frame-pointer avx
gcc -m32 -march=k6 -O3 -fomit-frame-pointer avx
gcc -m32 -march=k6 -O -fomit-frame-pointer avx
gcc -m32 -march=k6 -Os -fomit-frame-pointer avx
gcc -m32 -march=pentium-mmx -O2 -fomit-frame-pointer avx
gcc -m32 -march=pentium-mmx -O3 -fomit-frame-pointer avx
gcc -m32 -march=pentium-mmx -O -fomit-frame-pointer avx
gcc -m32 -march=pentium-mmx -Os -fomit-frame-pointer avx
gcc -m32 -march=pentium2 -O2 -fomit-frame-pointer avx
gcc -m32 -march=pentium2 -O3 -fomit-frame-pointer avx
gcc -m32 -march=pentium2 -O -fomit-frame-pointer avx
gcc -m32 -march=pentium2 -Os -fomit-frame-pointer avx
gcc -m32 -march=pentium -O2 -fomit-frame-pointer avx
gcc -m32 -march=pentium -O3 -fomit-frame-pointer avx
gcc -m32 -march=pentium -O -fomit-frame-pointer avx
gcc -m32 -march=pentium -Os -fomit-frame-pointer avx
gcc -m32 -march=pentiumpro -O2 -fomit-frame-pointer avx
gcc -m32 -march=pentiumpro -O3 -fomit-frame-pointer avx
gcc -m32 -march=pentiumpro -O -fomit-frame-pointer avx
gcc -m32 -march=pentiumpro -Os -fomit-frame-pointer avx

Compiler output

Implementation: crypto_core/inv3sntrup653/avx
Compiler: gcc -funroll-loops -m32 -march=barcelona -O2 -fomit-frame-pointer
r3_recip.c: r3_recip.c: In function 'vec256_divx_1':
r3_recip.c: r3_recip.c:237:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
r3_recip.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
r3_recip.c: ^~~~~~~~~~~~~~~~~
r3_recip.c: _mm_cvtsi128_si32
r3_recip.c: r3_recip.c: In function 'crypto_core_inv3sntrup653_avx':
r3_recip.c: r3_recip.c:362:9: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
r3_recip.c: F0[0] = _mm256_set_epi32(0,0,0,0,0,0,0,1);
r3_recip.c: ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: r3_recip.c: In function 'vec256_frombits':
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:597:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi32': target specific option mismatch
r3_recip.c: _mm256_shuffle_epi32 (__m256i __A, const int __mask)
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c:69:7: note: called from here
r3_recip.c: h = _mm256_shuffle_epi32(h,0xd8);
r3_recip.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:1068:1: error: inlining failed in call to always_inline '_mm256_permute4x64_epi64': target specific option mismatch
r3_recip.c: _mm256_permute4x64_epi64 (__m256i __X, const int __M)
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c:68:7: note: called from here
r3_recip.c: h = _mm256_permute4x64_epi64(h,0xd8);
r3_recip.c: ...

Number of similar (compiler,implementation) pairs: 68, namely:
CompilerImplementations
gcc -funroll-loops -m32 -march=barcelona -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=barcelona -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=barcelona -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=barcelona -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k8 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k8 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k8 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=k8 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=nocona -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=nocona -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=nocona -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=nocona -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium-m -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium-m -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium-m -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium-m -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium3 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium3 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium3 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium3 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium4 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium4 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium4 -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=pentium4 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=prescott -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=prescott -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=prescott -O -fomit-frame-pointer avx
gcc -funroll-loops -m32 -march=prescott -Os -fomit-frame-pointer avx
gcc -m32 -march=core2 -O2 -fomit-frame-pointer avx
gcc -m32 -march=core2 -O3 -fomit-frame-pointer avx
gcc -m32 -march=core2 -O -fomit-frame-pointer avx
gcc -m32 -march=core2 -Os -fomit-frame-pointer avx
gcc -m32 -march=core2 -msse4.1 -O2 -fomit-frame-pointer avx
gcc -m32 -march=core2 -msse4.1 -O3 -fomit-frame-pointer avx
gcc -m32 -march=core2 -msse4.1 -O -fomit-frame-pointer avx
gcc -m32 -march=core2 -msse4.1 -Os -fomit-frame-pointer avx
gcc -m32 -march=core2 -msse4 -O2 -fomit-frame-pointer avx
gcc -m32 -march=core2 -msse4 -O3 -fomit-frame-pointer avx
gcc -m32 -march=core2 -msse4 -O -fomit-frame-pointer avx
gcc -m32 -march=core2 -msse4 -Os -fomit-frame-pointer avx
gcc -m32 -march=corei7 -O2 -fomit-frame-pointer avx
gcc -m32 -march=corei7 -O3 -fomit-frame-pointer avx
gcc -m32 -march=corei7 -O -fomit-frame-pointer avx
gcc -m32 -march=corei7 -Os -fomit-frame-pointer avx
gcc -m32 -march=k8 -O2 -fomit-frame-pointer avx
gcc -m32 -march=k8 -O3 -fomit-frame-pointer avx
gcc -m32 -march=k8 -O -fomit-frame-pointer avx
gcc -m32 -march=k8 -Os -fomit-frame-pointer avx
gcc -m32 -march=nocona -O2 -fomit-frame-pointer avx
gcc -m32 -march=nocona -O3 -fomit-frame-pointer avx
gcc -m32 -march=nocona -O -fomit-frame-pointer avx
gcc -m32 -march=nocona -Os -fomit-frame-pointer avx
gcc -m32 -march=pentium-m -O2 -fomit-frame-pointer avx
gcc -m32 -march=pentium-m -O3 -fomit-frame-pointer avx
gcc -m32 -march=pentium-m -O -fomit-frame-pointer avx
gcc -m32 -march=pentium-m -Os -fomit-frame-pointer avx
gcc -m32 -march=pentium3 -O2 -fomit-frame-pointer avx
gcc -m32 -march=pentium3 -O3 -fomit-frame-pointer avx
gcc -m32 -march=pentium3 -O -fomit-frame-pointer avx
gcc -m32 -march=pentium3 -Os -fomit-frame-pointer avx
gcc -m32 -march=pentium4 -O2 -fomit-frame-pointer avx
gcc -m32 -march=pentium4 -O3 -fomit-frame-pointer avx
gcc -m32 -march=pentium4 -O -fomit-frame-pointer avx
gcc -m32 -march=pentium4 -Os -fomit-frame-pointer avx
gcc -m32 -march=prescott -O2 -fomit-frame-pointer avx
gcc -m32 -march=prescott -O3 -fomit-frame-pointer avx
gcc -m32 -march=prescott -O -fomit-frame-pointer avx
gcc -m32 -march=prescott -Os -fomit-frame-pointer avx

Compiler output

Implementation: crypto_core/inv3sntrup653/avx
Compiler: gcc -m32 -march=barcelona -O2 -fomit-frame-pointer
r3_recip.c: r3_recip.c: In function 'vec256_divx_1':
r3_recip.c: r3_recip.c:237:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
r3_recip.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
r3_recip.c: ^~~~~~~~~~~~~~~~~
r3_recip.c: _mm_cvtsi128_si32
r3_recip.c: r3_recip.c: In function 'crypto_core_inv3sntrup653_avx':
r3_recip.c: r3_recip.c:362:9: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
r3_recip.c: F0[0] = _mm256_set_epi32(0,0,0,0,0,0,0,1);
r3_recip.c: ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: r3_recip.c: In function 'vec256_frombits':
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:597:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi32': target specific option mismatch
r3_recip.c: _mm256_shuffle_epi32 (__m256i __A, const int __mask)
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c:69:7: note: called from here
r3_recip.c: h = _mm256_shuffle_epi32(h,0xd8);
r3_recip.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:1068:1: error: inlining failed in call to always_inline '_mm256_permute4x64_epi64': target specific option mismatch
r3_recip.c: _mm256_permute4x64_epi64 (__m256i __X, const int __M)
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c:68:7: note: called from here
r3_recip.c: h = _mm256_permute4x64_epi64(h,0xd8);
r3_recip.c: ...
r3_recip.c: r3_recip.c: In function 'vec256_divx_1':
r3_recip.c: r3_recip.c:237:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
r3_recip.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
r3_recip.c: ^~~~~~~~~~~~~~~~~
r3_recip.c: _mm_cvtsi128_si32
r3_recip.c: r3_recip.c: In function 'crypto_core_inv3sntrup653_avx':
r3_recip.c: r3_recip.c:362:9: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
r3_recip.c: F0[0] = _mm256_set_epi32(0,0,0,0,0,0,0,1);
r3_recip.c: ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: r3_recip.c: In function 'vec256_frombits':
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:597:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi32': target specific option mismatch
r3_recip.c: _mm256_shuffle_epi32 (__m256i __A, const int __mask)
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c:69:7: note: called from here
r3_recip.c: h = _mm256_shuffle_epi32(h,0xd8);
r3_recip.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:1068:1: error: inlining failed in call to always_inline '_mm256_permute4x64_epi64': target specific option mismatch
r3_recip.c: _mm256_permute4x64_epi64 (__m256i __X, const int __M)
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c:68:7: note: called from here
r3_recip.c: h = _mm256_permute4x64_epi64(h,0xd8);
r3_recip.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m32 -march=barcelona -O2 -fomit-frame-pointer avx
gcc -m32 -march=barcelona -O3 -fomit-frame-pointer avx
gcc -m32 -march=barcelona -O -fomit-frame-pointer avx
gcc -m32 -march=barcelona -Os -fomit-frame-pointer avx

Compiler output

Implementation: crypto_core/inv3sntrup653/avx
Compiler: gcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer
r3_recip.c: r3_recip.c: In function 'vec256_divx_1':
r3_recip.c: r3_recip.c:237:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
r3_recip.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
r3_recip.c: ^~~~~~~~~~~~~~~~~
r3_recip.c: _mm_cvtsi128_si32
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: r3_recip.c: In function 'vec256_frombits':
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:597:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi32': target specific option mismatch
r3_recip.c: _mm256_shuffle_epi32 (__m256i __A, const int __mask)
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c:69:7: note: called from here
r3_recip.c: h = _mm256_shuffle_epi32(h,0xd8);
r3_recip.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:1068:1: error: inlining failed in call to always_inline '_mm256_permute4x64_epi64': target specific option mismatch
r3_recip.c: _mm256_permute4x64_epi64 (__m256i __X, const int __M)
r3_recip.c: ^~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: r3_recip.c:68:7: note: called from here
r3_recip.c: h = _mm256_permute4x64_epi64(h,0xd8);
r3_recip.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
r3_recip.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
r3_recip.c: from r3_recip.c:3:
r3_recip.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:588:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi8': target specific option mismatch
r3_recip.c: ...

Number of similar (compiler,implementation) pairs: 8, namely:
CompilerImplementations
gcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer avx
gcc -m32 -march=core-avx-i -O3 -fomit-frame-pointer avx
gcc -m32 -march=core-avx-i -O -fomit-frame-pointer avx
gcc -m32 -march=core-avx-i -Os -fomit-frame-pointer avx
gcc -m32 -march=corei7-avx -O2 -fomit-frame-pointer avx
gcc -m32 -march=corei7-avx -O3 -fomit-frame-pointer avx
gcc -m32 -march=corei7-avx -O -fomit-frame-pointer avx
gcc -m32 -march=corei7-avx -Os -fomit-frame-pointer avx

Compiler output

Implementation: crypto_core/inv3sntrup653/avx
Compiler: gcc -m32 -march=core-avx2 -O2 -fomit-frame-pointer
r3_recip.c: r3_recip.c: In function 'vec256_divx_1':
r3_recip.c: r3_recip.c:237:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
r3_recip.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
r3_recip.c: ^~~~~~~~~~~~~~~~~
r3_recip.c: _mm_cvtsi128_si32
try.c: crypto_core_inv3sntrup653.a(r3_recip.o): In function `crypto_core_inv3sntrup653_avx':
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: crypto_core_inv3sntrup653.a(r3_recip.o):r3_recip.c:(.text+0x...): more undefined references to `_mm_cvtsi128_si64' follow
try.c: collect2: error: ld returned 1 exit status

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -m32 -march=core-avx2 -O2 -fomit-frame-pointer avx
gcc -m32 -march=core-avx2 -O3 -fomit-frame-pointer avx
gcc -m32 -march=core-avx2 -O -fomit-frame-pointer avx
gcc -m32 -march=native -mtune=native -O2 -fomit-frame-pointer avx
gcc -m32 -march=native -mtune=native -O3 -fomit-frame-pointer avx
gcc -m32 -march=native -mtune=native -O -fomit-frame-pointer avx

Compiler output

Implementation: crypto_core/inv3sntrup653/avx
Compiler: gcc -m32 -march=core-avx2 -Os -fomit-frame-pointer
r3_recip.c: r3_recip.c: In function 'vec256_divx_1':
r3_recip.c: r3_recip.c:237:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
r3_recip.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
r3_recip.c: ^~~~~~~~~~~~~~~~~
r3_recip.c: _mm_cvtsi128_si32
try.c: crypto_core_inv3sntrup653.a(r3_recip.o): In function `vec256_timesx_2':
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: crypto_core_inv3sntrup653.a(r3_recip.o): In function `vec256_divx_3':
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: r3_recip.c:(.text+0x...): undefined reference to `_mm_cvtsi128_si64'
try.c: crypto_core_inv3sntrup653.a(r3_recip.o):r3_recip.c:(.text+0x...): more undefined references to `_mm_cvtsi128_si64' follow
try.c: collect2: error: ld returned 1 exit status

Number of similar (compiler,implementation) pairs: 2, namely:
CompilerImplementations
gcc -m32 -march=core-avx2 -Os -fomit-frame-pointer avx
gcc -m32 -march=native -mtune=native -Os -fomit-frame-pointer avx