Implementation notes: x86, titan0, crypto_kem/ntruhps2048509

Computer: titan0
Architecture: x86
CPU ID: GenuineIntel-000306c3-bfebfbff
SUPERCOP version: 20190803
Operation: crypto_kem
Primitive: ntruhps2048509
TimeImplementationCompilerBenchmark dateSUPERCOP version
1618844refgcc -funroll-loops -m32 -march=pentium4 -O2 -fomit-frame-pointer2019081020190803
1628788refgcc -funroll-loops -m32 -march=pentium4 -O3 -fomit-frame-pointer2019081020190803
1698320refgcc -funroll-loops -m32 -march=nocona -O2 -fomit-frame-pointer2019081020190803
1698840refgcc -funroll-loops -m32 -march=prescott -O2 -fomit-frame-pointer2019081020190803
1706492refgcc -funroll-loops -m32 -O2 -fomit-frame-pointer2019081020190803
1710368refgcc -funroll-loops -m32 -march=athlon -O2 -fomit-frame-pointer2019081020190803
1710520refgcc -funroll-loops -m32 -march=nocona -O3 -fomit-frame-pointer2019081020190803
1710992refgcc -funroll-loops -m32 -march=prescott -O3 -fomit-frame-pointer2019081020190803
1715000refgcc -funroll-loops -m32 -march=k8 -O2 -fomit-frame-pointer2019081020190803
1725836refgcc -funroll-loops -m32 -march=k8 -O3 -fomit-frame-pointer2019081020190803
1731600refgcc -funroll-loops -m32 -march=barcelona -O3 -fomit-frame-pointer2019081020190803
1736380refgcc -funroll-loops -m32 -march=barcelona -O2 -fomit-frame-pointer2019081020190803
1744128refgcc -funroll-loops -m32 -O3 -fomit-frame-pointer2019081020190803
1751488refgcc -funroll-loops -m32 -march=athlon -O3 -fomit-frame-pointer2019081020190803
2126232refgcc -m32 -march=core2 -msse4.1 -O3 -fomit-frame-pointer2019081020190803
2132700refgcc -m32 -march=corei7 -O3 -fomit-frame-pointer2019081020190803
2132784refgcc -m32 -march=core2 -msse4 -O2 -fomit-frame-pointer2019081020190803
2132824refgcc -m32 -march=nocona -O3 -fomit-frame-pointer2019081020190803
2134124refgcc -m32 -march=core-avx-i -O3 -fomit-frame-pointer2019081020190803
2135788refgcc -m32 -march=core2 -msse4.1 -O2 -fomit-frame-pointer2019081020190803
2136208refgcc -m32 -march=corei7 -O2 -fomit-frame-pointer2019081020190803
2136748refgcc -m32 -march=core2 -O3 -fomit-frame-pointer2019081020190803
2136920refgcc -m32 -march=corei7-avx -O3 -fomit-frame-pointer2019081020190803
2139212refgcc -m32 -march=corei7-avx -O2 -fomit-frame-pointer2019081020190803
2141772refgcc -m32 -march=prescott -O3 -fomit-frame-pointer2019081020190803
2141792refgcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer2019081020190803
2143944refgcc -m32 -march=core2 -O2 -fomit-frame-pointer2019081020190803
2145284refgcc -m32 -march=core2 -msse4 -O3 -fomit-frame-pointer2019081020190803
2146600refgcc -m32 -march=nocona -O2 -fomit-frame-pointer2019081020190803
2149416refgcc -m32 -march=prescott -O2 -fomit-frame-pointer2019081020190803
2168188refgcc -m32 -march=pentium4 -O2 -fomit-frame-pointer2019081020190803
2169340refgcc -m32 -march=core-avx2 -O3 -fomit-frame-pointer2019081020190803
2178072refgcc -m32 -march=core-avx2 -O2 -fomit-frame-pointer2019081020190803
2178508refgcc -m32 -march=native -mtune=native -O2 -fomit-frame-pointer2019081020190803
2180300refgcc -m32 -march=native -mtune=native -O3 -fomit-frame-pointer2019081020190803
2217644refgcc -m32 -march=pentium4 -O3 -fomit-frame-pointer2019081020190803
2509884refgcc -funroll-loops -m32 -march=pentium3 -O2 -fomit-frame-pointer2019081020190803
2510336refgcc -funroll-loops -m32 -march=pentiumpro -O2 -fomit-frame-pointer2019081020190803
2510512refgcc -funroll-loops -m32 -march=pentium2 -O2 -fomit-frame-pointer2019081020190803
2516336refgcc -funroll-loops -m32 -march=pentium-m -O2 -fomit-frame-pointer2019081020190803
2520584refgcc -funroll-loops -m32 -march=pentium-m -O3 -fomit-frame-pointer2019081020190803
2542072refgcc -funroll-loops -m32 -march=pentium3 -O3 -fomit-frame-pointer2019081020190803
2543880refgcc -funroll-loops -m32 -march=pentiumpro -O3 -fomit-frame-pointer2019081020190803
2553016refgcc -funroll-loops -m32 -march=pentium2 -O3 -fomit-frame-pointer2019081020190803
2596324refgcc -m32 -march=k8 -O3 -fomit-frame-pointer2019081020190803
2606608refgcc -m32 -march=k8 -O2 -fomit-frame-pointer2019081020190803
2610404refgcc -m32 -march=barcelona -O3 -fomit-frame-pointer2019081020190803
2626008refgcc -m32 -march=athlon -O3 -fomit-frame-pointer2019081020190803
2628172refgcc -m32 -march=barcelona -O2 -fomit-frame-pointer2019081020190803
2632452refgcc -m32 -march=athlon -O2 -fomit-frame-pointer2019081020190803
2635836refgcc -m32 -O3 -fomit-frame-pointer2019081020190803
2651524refgcc -m32 -O2 -fomit-frame-pointer2019081020190803
2955920refgcc -funroll-loops -m32 -march=i386 -O2 -fomit-frame-pointer2019081020190803
2958788refgcc -funroll-loops -m32 -march=i486 -O2 -fomit-frame-pointer2019081020190803
2974720refgcc -funroll-loops -m32 -march=i386 -O3 -fomit-frame-pointer2019081020190803
3002624refgcc -funroll-loops -m32 -march=i486 -O3 -fomit-frame-pointer2019081020190803
3023936refgcc -m32 -march=pentium-m -O3 -fomit-frame-pointer2019081020190803
3030628refgcc -m32 -march=pentium3 -O3 -fomit-frame-pointer2019081020190803
3034656refgcc -m32 -march=pentium-m -O2 -fomit-frame-pointer2019081020190803
3035548refgcc -m32 -march=pentium3 -O2 -fomit-frame-pointer2019081020190803
3036008refgcc -m32 -march=pentiumpro -O2 -fomit-frame-pointer2019081020190803
3036420refgcc -m32 -march=pentium2 -O2 -fomit-frame-pointer2019081020190803
3040420refgcc -m32 -march=pentiumpro -O3 -fomit-frame-pointer2019081020190803
3049808refgcc -m32 -march=pentium2 -O3 -fomit-frame-pointer2019081020190803
3147940refgcc -funroll-loops -m32 -march=k6-2 -O3 -fomit-frame-pointer2019081020190803
3148456refgcc -m32 -march=k6-2 -O3 -fomit-frame-pointer2019081020190803
3149404refgcc -m32 -march=k6-3 -O3 -fomit-frame-pointer2019081020190803
3154720refgcc -funroll-loops -m32 -march=k6-3 -O3 -fomit-frame-pointer2019081020190803
3200636refgcc -funroll-loops -m32 -march=k6-3 -O2 -fomit-frame-pointer2019081020190803
3203000refgcc -funroll-loops -m32 -march=k6-2 -O2 -fomit-frame-pointer2019081020190803
3204152refgcc -funroll-loops -m32 -march=k6 -O2 -fomit-frame-pointer2019081020190803
3264168refgcc -funroll-loops -m32 -march=k6 -O3 -fomit-frame-pointer2019081020190803
3509532refgcc -funroll-loops -m32 -march=pentium -O2 -fomit-frame-pointer2019081020190803
3514536refgcc -funroll-loops -m32 -march=pentium-mmx -O2 -fomit-frame-pointer2019081020190803
3561176refgcc -funroll-loops -m32 -march=pentium-mmx -O3 -fomit-frame-pointer2019081020190803
3563676refgcc -funroll-loops -m32 -march=pentium -O3 -fomit-frame-pointer2019081020190803
4513116refgcc -m32 -march=i386 -O2 -fomit-frame-pointer2019081020190803
4515044refgcc -m32 -march=i486 -O2 -fomit-frame-pointer2019081020190803
4542304refgcc -m32 -march=i386 -O3 -fomit-frame-pointer2019081020190803
4547224refgcc -m32 -march=i486 -O3 -fomit-frame-pointer2019081020190803
4621248refgcc -m32 -march=pentium-mmx -O2 -fomit-frame-pointer2019081020190803
4636048refgcc -m32 -march=pentium -O2 -fomit-frame-pointer2019081020190803
4646420refgcc -m32 -march=pentium-mmx -O3 -fomit-frame-pointer2019081020190803
4646776refgcc -m32 -march=pentium -O3 -fomit-frame-pointer2019081020190803
4920708refgcc -m32 -march=k6-3 -O2 -fomit-frame-pointer2019081020190803
4922524refgcc -m32 -march=k6 -O3 -fomit-frame-pointer2019081020190803
4930024refgcc -m32 -march=k6 -O2 -fomit-frame-pointer2019081020190803
4935184refgcc -m32 -march=k6-2 -O2 -fomit-frame-pointer2019081020190803
6261240refgcc -m32 -march=i386 -O -fomit-frame-pointer2019081020190803
6270116refgcc -m32 -march=pentium -O -fomit-frame-pointer2019081020190803
6274796refgcc -m32 -march=pentium-mmx -O -fomit-frame-pointer2019081020190803
6281136refgcc -m32 -march=k6-2 -O -fomit-frame-pointer2019081020190803
6282148refgcc -m32 -march=i486 -O -fomit-frame-pointer2019081020190803
6282768refgcc -m32 -march=k6-3 -O -fomit-frame-pointer2019081020190803
6293964refgcc -m32 -march=nocona -O -fomit-frame-pointer2019081020190803
6296832refgcc -m32 -march=corei7 -O -fomit-frame-pointer2019081020190803
6298648refgcc -m32 -march=athlon -O -fomit-frame-pointer2019081020190803
6300564refgcc -m32 -march=pentium4 -O -fomit-frame-pointer2019081020190803
6303552refgcc -m32 -march=prescott -O -fomit-frame-pointer2019081020190803
6305328refgcc -m32 -O -fomit-frame-pointer2019081020190803
6305508refgcc -m32 -march=core2 -msse4 -O -fomit-frame-pointer2019081020190803
6305644refgcc -m32 -march=core2 -O -fomit-frame-pointer2019081020190803
6307740refgcc -m32 -march=native -mtune=native -O -fomit-frame-pointer2019081020190803
6311192refgcc -m32 -march=core-avx-i -O -fomit-frame-pointer2019081020190803
6311744refgcc -m32 -march=corei7-avx -O -fomit-frame-pointer2019081020190803
6317276refgcc -funroll-loops -m32 -march=pentium2 -O -fomit-frame-pointer2019081020190803
6317856refgcc -funroll-loops -m32 -march=k6-2 -O -fomit-frame-pointer2019081020190803
6317880refgcc -m32 -march=core-avx2 -O -fomit-frame-pointer2019081020190803
6323468refgcc -m32 -march=barcelona -O -fomit-frame-pointer2019081020190803
6324980refgcc -m32 -march=core2 -msse4.1 -O -fomit-frame-pointer2019081020190803
6325812refgcc -m32 -march=k6 -O -fomit-frame-pointer2019081020190803
6328684refgcc -funroll-loops -m32 -march=k6-3 -O -fomit-frame-pointer2019081020190803
6329548refgcc -funroll-loops -m32 -march=prescott -O -fomit-frame-pointer2019081020190803
6330204refgcc -funroll-loops -m32 -march=pentium4 -O -fomit-frame-pointer2019081020190803
6331444refgcc -funroll-loops -m32 -march=athlon -O -fomit-frame-pointer2019081020190803
6332512refgcc -funroll-loops -m32 -march=pentium -O -fomit-frame-pointer2019081020190803
6336128refgcc -funroll-loops -m32 -march=i386 -O -fomit-frame-pointer2019081020190803
6343716refgcc -funroll-loops -m32 -march=i486 -O -fomit-frame-pointer2019081020190803
6344784refgcc -funroll-loops -m32 -march=k6 -O -fomit-frame-pointer2019081020190803
6345044refgcc -funroll-loops -m32 -march=pentium-mmx -O -fomit-frame-pointer2019081020190803
6346124refgcc -funroll-loops -m32 -march=nocona -O -fomit-frame-pointer2019081020190803
6347548refgcc -funroll-loops -m32 -march=pentium3 -O -fomit-frame-pointer2019081020190803
6348220refgcc -funroll-loops -m32 -O -fomit-frame-pointer2019081020190803
6350744refgcc -funroll-loops -m32 -march=barcelona -O -fomit-frame-pointer2019081020190803
6353920refgcc -funroll-loops -m32 -march=k8 -O -fomit-frame-pointer2019081020190803
6354360refgcc -funroll-loops -m32 -march=pentiumpro -O -fomit-frame-pointer2019081020190803
6366884refgcc -m32 -march=k8 -O -fomit-frame-pointer2019081020190803
6375492refgcc -m32 -march=pentium2 -O -fomit-frame-pointer2019081020190803
6378356refgcc -funroll-loops -m32 -march=pentium-m -O -fomit-frame-pointer2019081020190803
6382460refgcc -m32 -march=pentiumpro -O -fomit-frame-pointer2019081020190803
6387348refgcc -m32 -march=pentium3 -O -fomit-frame-pointer2019081020190803
6414116refgcc -m32 -march=pentium-m -O -fomit-frame-pointer2019081020190803
7447712refgcc -m32 -march=pentium3 -Os -fomit-frame-pointer2019081020190803
7457132refgcc -m32 -march=pentiumpro -Os -fomit-frame-pointer2019081020190803
7471972refgcc -m32 -march=pentium-m -Os -fomit-frame-pointer2019081020190803
7476780refgcc -m32 -march=pentium2 -Os -fomit-frame-pointer2019081020190803
7596408refgcc -m32 -march=i486 -Os -fomit-frame-pointer2019081020190803
7608432refgcc -m32 -march=nocona -Os -fomit-frame-pointer2019081020190803
7615540refgcc -m32 -march=prescott -Os -fomit-frame-pointer2019081020190803
7629104refgcc -m32 -march=pentium4 -Os -fomit-frame-pointer2019081020190803
7630060refgcc -m32 -march=i386 -Os -fomit-frame-pointer2019081020190803
7640256refgcc -m32 -march=corei7-avx -Os -fomit-frame-pointer2019081020190803
7644716refgcc -m32 -march=core2 -Os -fomit-frame-pointer2019081020190803
7649424refgcc -m32 -Os -fomit-frame-pointer2019081020190803
7654520refgcc -m32 -march=athlon -Os -fomit-frame-pointer2019081020190803
7661228refgcc -m32 -march=core-avx-i -Os -fomit-frame-pointer2019081020190803
7662180refgcc -m32 -march=pentium -Os -fomit-frame-pointer2019081020190803
7666836refgcc -m32 -march=pentium-mmx -Os -fomit-frame-pointer2019081020190803
7676012refgcc -m32 -march=corei7 -Os -fomit-frame-pointer2019081020190803
7683716refgcc -m32 -march=k8 -Os -fomit-frame-pointer2019081020190803
7684412refgcc -m32 -march=core2 -msse4.1 -Os -fomit-frame-pointer2019081020190803
7690708refgcc -m32 -march=core2 -msse4 -Os -fomit-frame-pointer2019081020190803
7690920refgcc -m32 -march=barcelona -Os -fomit-frame-pointer2019081020190803
7703200refgcc -m32 -march=native -mtune=native -Os -fomit-frame-pointer2019081020190803
7720924refgcc -m32 -march=core-avx2 -Os -fomit-frame-pointer2019081020190803
7823292refgcc -funroll-loops -m32 -march=pentium3 -Os -fomit-frame-pointer2019081020190803
7826416refgcc -funroll-loops -m32 -march=pentiumpro -Os -fomit-frame-pointer2019081020190803
7859252refgcc -funroll-loops -m32 -march=pentium-m -Os -fomit-frame-pointer2019081020190803
7885068refgcc -funroll-loops -m32 -march=pentium2 -Os -fomit-frame-pointer2019081020190803
7981916refgcc -funroll-loops -m32 -march=pentium-mmx -Os -fomit-frame-pointer2019081020190803
7984616refgcc -funroll-loops -m32 -march=pentium -Os -fomit-frame-pointer2019081020190803
7991276refgcc -funroll-loops -m32 -march=athlon -Os -fomit-frame-pointer2019081020190803
8005796refgcc -funroll-loops -m32 -march=pentium4 -Os -fomit-frame-pointer2019081020190803
8015036refgcc -funroll-loops -m32 -march=k8 -Os -fomit-frame-pointer2019081020190803
8027472refgcc -funroll-loops -m32 -march=i386 -Os -fomit-frame-pointer2019081020190803
8033056refgcc -funroll-loops -m32 -Os -fomit-frame-pointer2019081020190803
8043196refgcc -funroll-loops -m32 -march=nocona -Os -fomit-frame-pointer2019081020190803
8046876refgcc -funroll-loops -m32 -march=prescott -Os -fomit-frame-pointer2019081020190803
8077204refgcc -funroll-loops -m32 -march=barcelona -Os -fomit-frame-pointer2019081020190803
8092620refgcc -funroll-loops -m32 -march=i486 -Os -fomit-frame-pointer2019081020190803
9594984refgcc -m32 -march=k6-3 -Os -fomit-frame-pointer2019081020190803
9599976refgcc -m32 -march=k6-2 -Os -fomit-frame-pointer2019081020190803
9604320refgcc -funroll-loops -m32 -march=k6-3 -Os -fomit-frame-pointer2019081020190803
9615352refgcc -funroll-loops -m32 -march=k6-2 -Os -fomit-frame-pointer2019081020190803
9624136refgcc -m32 -march=k6 -Os -fomit-frame-pointer2019081020190803
9636620refgcc -funroll-loops -m32 -march=k6 -Os -fomit-frame-pointer2019081020190803

Compiler output

Implementation: crypto_kem/ntruhps2048509/avx2
Compiler: gcc -funroll-loops -m32 -O2 -fomit-frame-pointer
poly_r2_inv.c: poly_r2_inv.c: In function 'poly_R2_tobytes':
poly_r2_inv.c: poly_r2_inv.c:21:14: warning: implicit declaration of function '_pext_u64'; did you mean '_pext_u32'? [-Wimplicit-function-declaration]
poly_r2_inv.c: out[i] = _pext_u64(coeff_pointer[2*i], 0x1000100010001);
poly_r2_inv.c: ^~~~~~~~~
poly_r2_inv.c: _pext_u32
poly_r2_inv.c: poly_r2_inv.c: In function 'poly_R2_frombytes':
poly_r2_inv.c: poly_r2_inv.c:39:26: warning: implicit declaration of function '_pdep_u64'; did you mean '_pdep_u32'? [-Wimplicit-function-declaration]
poly_r2_inv.c: coeff_pointer[2*i] = _pdep_u64(in[i], 0x1000100010001);
poly_r2_inv.c: ^~~~~~~~~
poly_r2_inv.c: _pdep_u32
poly_s3_inv.c: poly_s3_inv.c: In function 'vec256_divx_1':
poly_s3_inv.c: poly_s3_inv.c:246:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
poly_s3_inv.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
poly_s3_inv.c: ^~~~~~~~~~~~~~~~~
poly_s3_inv.c: _mm_cvtsi128_si32
poly_s3_inv.c: poly_s3_inv.c: In function '__poly_S3_inv':
poly_s3_inv.c: poly_s3_inv.c:325:9: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
poly_s3_inv.c: F0[0] = _mm256_set_epi32(4294967295,4294967295,4294967295,4294967295,4294967295,4294967295,4294967295,4294967295);
poly_s3_inv.c: ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: poly_s3_inv.c: In function 'vec256_timesx_1':
poly_s3_inv.c: poly_s3_inv.c:277:29: warning: SSE vector return without SSE enabled changes the ABI [-Wpsabi]
poly_s3_inv.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
poly_s3_inv.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
poly_s3_inv.c: from poly_s3_inv.c:12:
poly_s3_inv.c: poly_s3_inv.c: In function 'vec256_frombits':
poly_s3_inv.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:597:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi32': target specific option mismatch
poly_s3_inv.c: _mm256_shuffle_epi32 (__m256i __A, const int __mask)
poly_s3_inv.c: ^~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: poly_s3_inv.c:78:7: note: called from here
poly_s3_inv.c: h = _mm256_shuffle_epi32(h,0xd8);
poly_s3_inv.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
poly_s3_inv.c: from poly_s3_inv.c:12:
poly_s3_inv.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:1068:1: error: inlining failed in call to always_inline '_mm256_permute4x64_epi64': target specific option mismatch
poly_s3_inv.c: ...

Number of similar (compiler,implementation) pairs: 88, namely:
CompilerImplementations
gcc -funroll-loops -m32 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=athlon -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=athlon -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=athlon -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=athlon -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i386 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i386 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i386 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i386 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i486 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i486 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i486 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=i486 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-2 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-2 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-2 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-2 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-3 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-3 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-3 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6-3 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k6 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-mmx -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-mmx -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-mmx -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-mmx -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium2 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium2 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium2 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium2 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentiumpro -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentiumpro -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentiumpro -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentiumpro -Os -fomit-frame-pointer avx2
gcc -m32 -O2 -fomit-frame-pointer avx2
gcc -m32 -O3 -fomit-frame-pointer avx2
gcc -m32 -O -fomit-frame-pointer avx2
gcc -m32 -Os -fomit-frame-pointer avx2
gcc -m32 -march=athlon -O2 -fomit-frame-pointer avx2
gcc -m32 -march=athlon -O3 -fomit-frame-pointer avx2
gcc -m32 -march=athlon -O -fomit-frame-pointer avx2
gcc -m32 -march=athlon -Os -fomit-frame-pointer avx2
gcc -m32 -march=i386 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=i386 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=i386 -O -fomit-frame-pointer avx2
gcc -m32 -march=i386 -Os -fomit-frame-pointer avx2
gcc -m32 -march=i486 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=i486 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=i486 -O -fomit-frame-pointer avx2
gcc -m32 -march=i486 -Os -fomit-frame-pointer avx2
gcc -m32 -march=k6-2 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=k6-2 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=k6-2 -O -fomit-frame-pointer avx2
gcc -m32 -march=k6-2 -Os -fomit-frame-pointer avx2
gcc -m32 -march=k6-3 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=k6-3 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=k6-3 -O -fomit-frame-pointer avx2
gcc -m32 -march=k6-3 -Os -fomit-frame-pointer avx2
gcc -m32 -march=k6 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=k6 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=k6 -O -fomit-frame-pointer avx2
gcc -m32 -march=k6 -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentium-mmx -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentium-mmx -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentium-mmx -O -fomit-frame-pointer avx2
gcc -m32 -march=pentium-mmx -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentium2 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentium2 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentium2 -O -fomit-frame-pointer avx2
gcc -m32 -march=pentium2 -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentium -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentium -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentium -O -fomit-frame-pointer avx2
gcc -m32 -march=pentium -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentiumpro -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentiumpro -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentiumpro -O -fomit-frame-pointer avx2
gcc -m32 -march=pentiumpro -Os -fomit-frame-pointer avx2

Compiler output

Implementation: crypto_kem/ntruhps2048509/avx2
Compiler: gcc -funroll-loops -m32 -march=barcelona -O2 -fomit-frame-pointer
poly_r2_inv.c: poly_r2_inv.c: In function 'poly_R2_tobytes':
poly_r2_inv.c: poly_r2_inv.c:21:14: warning: implicit declaration of function '_pext_u64'; did you mean '_pext_u32'? [-Wimplicit-function-declaration]
poly_r2_inv.c: out[i] = _pext_u64(coeff_pointer[2*i], 0x1000100010001);
poly_r2_inv.c: ^~~~~~~~~
poly_r2_inv.c: _pext_u32
poly_r2_inv.c: poly_r2_inv.c: In function 'poly_R2_frombytes':
poly_r2_inv.c: poly_r2_inv.c:39:26: warning: implicit declaration of function '_pdep_u64'; did you mean '_pdep_u32'? [-Wimplicit-function-declaration]
poly_r2_inv.c: coeff_pointer[2*i] = _pdep_u64(in[i], 0x1000100010001);
poly_r2_inv.c: ^~~~~~~~~
poly_r2_inv.c: _pdep_u32
poly_s3_inv.c: poly_s3_inv.c: In function 'vec256_divx_1':
poly_s3_inv.c: poly_s3_inv.c:246:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
poly_s3_inv.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
poly_s3_inv.c: ^~~~~~~~~~~~~~~~~
poly_s3_inv.c: _mm_cvtsi128_si32
poly_s3_inv.c: poly_s3_inv.c: In function '__poly_S3_inv':
poly_s3_inv.c: poly_s3_inv.c:325:9: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
poly_s3_inv.c: F0[0] = _mm256_set_epi32(4294967295,4294967295,4294967295,4294967295,4294967295,4294967295,4294967295,4294967295);
poly_s3_inv.c: ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
poly_s3_inv.c: from poly_s3_inv.c:12:
poly_s3_inv.c: poly_s3_inv.c: In function 'vec256_frombits':
poly_s3_inv.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:597:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi32': target specific option mismatch
poly_s3_inv.c: _mm256_shuffle_epi32 (__m256i __A, const int __mask)
poly_s3_inv.c: ^~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: poly_s3_inv.c:78:7: note: called from here
poly_s3_inv.c: h = _mm256_shuffle_epi32(h,0xd8);
poly_s3_inv.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
poly_s3_inv.c: from poly_s3_inv.c:12:
poly_s3_inv.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:1068:1: error: inlining failed in call to always_inline '_mm256_permute4x64_epi64': target specific option mismatch
poly_s3_inv.c: _mm256_permute4x64_epi64 (__m256i __X, const int __M)
poly_s3_inv.c: ^~~~~~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: poly_s3_inv.c:77:7: note: called from here
poly_s3_inv.c: h = _mm256_permute4x64_epi64(h,0xd8);
poly_s3_inv.c: ...

Number of similar (compiler,implementation) pairs: 68, namely:
CompilerImplementations
gcc -funroll-loops -m32 -march=barcelona -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=barcelona -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=barcelona -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=barcelona -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k8 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k8 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k8 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=k8 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=nocona -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=nocona -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=nocona -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=nocona -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-m -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-m -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-m -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium-m -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium3 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium3 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium3 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium3 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium4 -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium4 -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium4 -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=pentium4 -Os -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=prescott -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=prescott -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=prescott -O -fomit-frame-pointer avx2
gcc -funroll-loops -m32 -march=prescott -Os -fomit-frame-pointer avx2
gcc -m32 -march=core2 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=core2 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=core2 -O -fomit-frame-pointer avx2
gcc -m32 -march=core2 -Os -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4.1 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4.1 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4.1 -O -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4.1 -Os -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4 -O -fomit-frame-pointer avx2
gcc -m32 -march=core2 -msse4 -Os -fomit-frame-pointer avx2
gcc -m32 -march=corei7 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=corei7 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=corei7 -O -fomit-frame-pointer avx2
gcc -m32 -march=corei7 -Os -fomit-frame-pointer avx2
gcc -m32 -march=k8 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=k8 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=k8 -O -fomit-frame-pointer avx2
gcc -m32 -march=k8 -Os -fomit-frame-pointer avx2
gcc -m32 -march=nocona -O2 -fomit-frame-pointer avx2
gcc -m32 -march=nocona -O3 -fomit-frame-pointer avx2
gcc -m32 -march=nocona -O -fomit-frame-pointer avx2
gcc -m32 -march=nocona -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentium-m -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentium-m -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentium-m -O -fomit-frame-pointer avx2
gcc -m32 -march=pentium-m -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentium3 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentium3 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentium3 -O -fomit-frame-pointer avx2
gcc -m32 -march=pentium3 -Os -fomit-frame-pointer avx2
gcc -m32 -march=pentium4 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=pentium4 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=pentium4 -O -fomit-frame-pointer avx2
gcc -m32 -march=pentium4 -Os -fomit-frame-pointer avx2
gcc -m32 -march=prescott -O2 -fomit-frame-pointer avx2
gcc -m32 -march=prescott -O3 -fomit-frame-pointer avx2
gcc -m32 -march=prescott -O -fomit-frame-pointer avx2
gcc -m32 -march=prescott -Os -fomit-frame-pointer avx2

Compiler output

Implementation: crypto_kem/ntruhps2048509/avx2
Compiler: gcc -m32 -march=barcelona -O2 -fomit-frame-pointer
poly_r2_inv.c: poly_r2_inv.c: In function 'poly_R2_tobytes':
poly_r2_inv.c: poly_r2_inv.c:21:14: warning: implicit declaration of function '_pext_u64'; did you mean '_pext_u32'? [-Wimplicit-function-declaration]
poly_r2_inv.c: out[i] = _pext_u64(coeff_pointer[2*i], 0x1000100010001);
poly_r2_inv.c: ^~~~~~~~~
poly_r2_inv.c: _pext_u32
poly_r2_inv.c: poly_r2_inv.c: In function 'poly_R2_frombytes':
poly_r2_inv.c: poly_r2_inv.c:39:26: warning: implicit declaration of function '_pdep_u64'; did you mean '_pdep_u32'? [-Wimplicit-function-declaration]
poly_r2_inv.c: coeff_pointer[2*i] = _pdep_u64(in[i], 0x1000100010001);
poly_r2_inv.c: ^~~~~~~~~
poly_r2_inv.c: _pdep_u32
poly_s3_inv.c: poly_s3_inv.c: In function 'vec256_divx_1':
poly_s3_inv.c: poly_s3_inv.c:246:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
poly_s3_inv.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
poly_s3_inv.c: ^~~~~~~~~~~~~~~~~
poly_s3_inv.c: _mm_cvtsi128_si32
poly_s3_inv.c: poly_s3_inv.c: In function '__poly_S3_inv':
poly_s3_inv.c: poly_s3_inv.c:325:9: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
poly_s3_inv.c: F0[0] = _mm256_set_epi32(4294967295,4294967295,4294967295,4294967295,4294967295,4294967295,4294967295,4294967295);
poly_s3_inv.c: ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
poly_s3_inv.c: from poly_s3_inv.c:12:
poly_s3_inv.c: poly_s3_inv.c: In function 'vec256_frombits':
poly_s3_inv.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:597:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi32': target specific option mismatch
poly_s3_inv.c: _mm256_shuffle_epi32 (__m256i __A, const int __mask)
poly_s3_inv.c: ^~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: poly_s3_inv.c:78:7: note: called from here
poly_s3_inv.c: h = _mm256_shuffle_epi32(h,0xd8);
poly_s3_inv.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
poly_s3_inv.c: from poly_s3_inv.c:12:
poly_s3_inv.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:1068:1: error: inlining failed in call to always_inline '_mm256_permute4x64_epi64': target specific option mismatch
poly_s3_inv.c: _mm256_permute4x64_epi64 (__m256i __X, const int __M)
poly_s3_inv.c: ^~~~~~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: poly_s3_inv.c:77:7: note: called from here
poly_s3_inv.c: h = _mm256_permute4x64_epi64(h,0xd8);
poly_s3_inv.c: ...
poly_r2_inv.c: poly_r2_inv.c: In function 'poly_R2_tobytes':
poly_r2_inv.c: poly_r2_inv.c:21:14: warning: implicit declaration of function '_pext_u64'; did you mean '_pext_u32'? [-Wimplicit-function-declaration]
poly_r2_inv.c: out[i] = _pext_u64(coeff_pointer[2*i], 0x1000100010001);
poly_r2_inv.c: ^~~~~~~~~
poly_r2_inv.c: _pext_u32
poly_r2_inv.c: poly_r2_inv.c: In function 'poly_R2_frombytes':
poly_r2_inv.c: poly_r2_inv.c:39:26: warning: implicit declaration of function '_pdep_u64'; did you mean '_pdep_u32'? [-Wimplicit-function-declaration]
poly_r2_inv.c: coeff_pointer[2*i] = _pdep_u64(in[i], 0x1000100010001);
poly_r2_inv.c: ^~~~~~~~~
poly_r2_inv.c: _pdep_u32
poly_s3_inv.c: poly_s3_inv.c: In function 'vec256_divx_1':
poly_s3_inv.c: poly_s3_inv.c:246:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
poly_s3_inv.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
poly_s3_inv.c: ^~~~~~~~~~~~~~~~~
poly_s3_inv.c: _mm_cvtsi128_si32
poly_s3_inv.c: poly_s3_inv.c: In function '__poly_S3_inv':
poly_s3_inv.c: poly_s3_inv.c:325:9: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
poly_s3_inv.c: F0[0] = _mm256_set_epi32(4294967295,4294967295,4294967295,4294967295,4294967295,4294967295,4294967295,4294967295);
poly_s3_inv.c: ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
poly_s3_inv.c: from poly_s3_inv.c:12:
poly_s3_inv.c: poly_s3_inv.c: In function 'vec256_frombits':
poly_s3_inv.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:597:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi32': target specific option mismatch
poly_s3_inv.c: _mm256_shuffle_epi32 (__m256i __A, const int __mask)
poly_s3_inv.c: ^~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: poly_s3_inv.c:78:7: note: called from here
poly_s3_inv.c: h = _mm256_shuffle_epi32(h,0xd8);
poly_s3_inv.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
poly_s3_inv.c: from poly_s3_inv.c:12:
poly_s3_inv.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:1068:1: error: inlining failed in call to always_inline '_mm256_permute4x64_epi64': target specific option mismatch
poly_s3_inv.c: _mm256_permute4x64_epi64 (__m256i __X, const int __M)
poly_s3_inv.c: ^~~~~~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: poly_s3_inv.c:77:7: note: called from here
poly_s3_inv.c: h = _mm256_permute4x64_epi64(h,0xd8);
poly_s3_inv.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m32 -march=barcelona -O2 -fomit-frame-pointer avx2
gcc -m32 -march=barcelona -O3 -fomit-frame-pointer avx2
gcc -m32 -march=barcelona -O -fomit-frame-pointer avx2
gcc -m32 -march=barcelona -Os -fomit-frame-pointer avx2

Compiler output

Implementation: crypto_kem/ntruhps2048509/avx2
Compiler: gcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer
poly_r2_inv.c: poly_r2_inv.c: In function 'poly_R2_tobytes':
poly_r2_inv.c: poly_r2_inv.c:21:14: warning: implicit declaration of function '_pext_u64'; did you mean '_pext_u32'? [-Wimplicit-function-declaration]
poly_r2_inv.c: out[i] = _pext_u64(coeff_pointer[2*i], 0x1000100010001);
poly_r2_inv.c: ^~~~~~~~~
poly_r2_inv.c: _pext_u32
poly_r2_inv.c: poly_r2_inv.c: In function 'poly_R2_frombytes':
poly_r2_inv.c: poly_r2_inv.c:39:26: warning: implicit declaration of function '_pdep_u64'; did you mean '_pdep_u32'? [-Wimplicit-function-declaration]
poly_r2_inv.c: coeff_pointer[2*i] = _pdep_u64(in[i], 0x1000100010001);
poly_r2_inv.c: ^~~~~~~~~
poly_r2_inv.c: _pdep_u32
poly_s3_inv.c: poly_s3_inv.c: In function 'vec256_divx_1':
poly_s3_inv.c: poly_s3_inv.c:246:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
poly_s3_inv.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
poly_s3_inv.c: ^~~~~~~~~~~~~~~~~
poly_s3_inv.c: _mm_cvtsi128_si32
poly_s3_inv.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
poly_s3_inv.c: from poly_s3_inv.c:12:
poly_s3_inv.c: poly_s3_inv.c: In function 'vec256_frombits':
poly_s3_inv.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:597:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi32': target specific option mismatch
poly_s3_inv.c: _mm256_shuffle_epi32 (__m256i __A, const int __mask)
poly_s3_inv.c: ^~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: poly_s3_inv.c:78:7: note: called from here
poly_s3_inv.c: h = _mm256_shuffle_epi32(h,0xd8);
poly_s3_inv.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
poly_s3_inv.c: from poly_s3_inv.c:12:
poly_s3_inv.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:1068:1: error: inlining failed in call to always_inline '_mm256_permute4x64_epi64': target specific option mismatch
poly_s3_inv.c: _mm256_permute4x64_epi64 (__m256i __X, const int __M)
poly_s3_inv.c: ^~~~~~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: poly_s3_inv.c:77:7: note: called from here
poly_s3_inv.c: h = _mm256_permute4x64_epi64(h,0xd8);
poly_s3_inv.c: ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
poly_s3_inv.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/7/include/immintrin.h:43:0,
poly_s3_inv.c: from poly_s3_inv.c:12:
poly_s3_inv.c: /usr/lib/gcc/x86_64-linux-gnu/7/include/avx2intrin.h:588:1: error: inlining failed in call to always_inline '_mm256_shuffle_epi8': target specific option mismatch
poly_s3_inv.c: ...

Number of similar (compiler,implementation) pairs: 8, namely:
CompilerImplementations
gcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer avx2
gcc -m32 -march=core-avx-i -O3 -fomit-frame-pointer avx2
gcc -m32 -march=core-avx-i -O -fomit-frame-pointer avx2
gcc -m32 -march=core-avx-i -Os -fomit-frame-pointer avx2
gcc -m32 -march=corei7-avx -O2 -fomit-frame-pointer avx2
gcc -m32 -march=corei7-avx -O3 -fomit-frame-pointer avx2
gcc -m32 -march=corei7-avx -O -fomit-frame-pointer avx2
gcc -m32 -march=corei7-avx -Os -fomit-frame-pointer avx2

Compiler output

Implementation: crypto_kem/ntruhps2048509/avx2
Compiler: gcc -m32 -march=core-avx2 -O2 -fomit-frame-pointer
poly_r2_inv.c: poly_r2_inv.c: In function 'poly_R2_tobytes':
poly_r2_inv.c: poly_r2_inv.c:21:14: warning: implicit declaration of function '_pext_u64'; did you mean '_pext_u32'? [-Wimplicit-function-declaration]
poly_r2_inv.c: out[i] = _pext_u64(coeff_pointer[2*i], 0x1000100010001);
poly_r2_inv.c: ^~~~~~~~~
poly_r2_inv.c: _pext_u32
poly_r2_inv.c: poly_r2_inv.c: In function 'poly_R2_frombytes':
poly_r2_inv.c: poly_r2_inv.c:39:26: warning: implicit declaration of function '_pdep_u64'; did you mean '_pdep_u32'? [-Wimplicit-function-declaration]
poly_r2_inv.c: coeff_pointer[2*i] = _pdep_u64(in[i], 0x1000100010001);
poly_r2_inv.c: ^~~~~~~~~
poly_r2_inv.c: _pdep_u32
poly_s3_inv.c: poly_s3_inv.c: In function 'vec256_divx_1':
poly_s3_inv.c: poly_s3_inv.c:246:29: warning: implicit declaration of function '_mm_cvtsi128_si64'; did you mean '_mm_cvtsi128_si32'? [-Wimplicit-function-declaration]
poly_s3_inv.c: unsigned long long low0 = _mm_cvtsi128_si64(_mm256_castsi256_si128(f0));
poly_s3_inv.c: ^~~~~~~~~~~~~~~~~
poly_s3_inv.c: _mm_cvtsi128_si32
poly_r2_mul.S: poly_r2_mul.S: Assembler messages:
poly_r2_mul.S: poly_r2_mul.S:111: Error: bad register name `%rsi)'
poly_r2_mul.S: poly_r2_mul.S:112: Error: bad register name `%rsi)'
poly_r2_mul.S: poly_r2_mul.S:113: Error: bad register name `%rdx)'
poly_r2_mul.S: poly_r2_mul.S:114: Error: bad register name `%rdx)'
poly_r2_mul.S: poly_r2_mul.S:117: Error: bad register name `%xmm11'
poly_r2_mul.S: poly_r2_mul.S:118: Error: bad register name `%xmm12'
poly_r2_mul.S: poly_r2_mul.S:119: Error: bad register name `%xmm11'
poly_r2_mul.S: poly_r2_mul.S:120: Error: bad register name `%xmm11'
poly_r2_mul.S: poly_r2_mul.S:121: Error: bad register name `%xmm11'
poly_r2_mul.S: poly_r2_mul.S:122: Error: bad register name `%xmm14'
poly_r2_mul.S: poly_r2_mul.S:123: Error: bad register name `%xmm11'
poly_r2_mul.S: poly_r2_mul.S:124: Error: bad register name `%ymm14'
poly_r2_mul.S: poly_r2_mul.S:125: Error: bad register name `%xmm15'
poly_r2_mul.S: poly_r2_mul.S:126: Error: bad register name `%rip)'
poly_r2_mul.S: poly_r2_mul.S:127: Error: bad register name `%rip)'
poly_r2_mul.S: poly_r2_mul.S:128: Error: bad register name `%rip)'
poly_r2_mul.S: poly_r2_mul.S:129: Error: bad register name `%ymm14'
poly_r2_mul.S: poly_r2_mul.S:130: Error: bad register name `%ymm14'
poly_r2_mul.S: poly_r2_mul.S:131: Error: bad register name `%xmm11'
poly_r2_mul.S: poly_r2_mul.S:132: Error: bad register name `%xmm12'
poly_r2_mul.S: poly_r2_mul.S:133: Error: bad register name `%xmm11'
poly_r2_mul.S: poly_r2_mul.S:134: Error: bad register name `%xmm11'
poly_r2_mul.S: poly_r2_mul.S:135: Error: bad register name `%xmm11'
poly_r2_mul.S: poly_r2_mul.S:136: Error: bad register name `%xmm13'
poly_r2_mul.S: ...

Number of similar (compiler,implementation) pairs: 8, namely:
CompilerImplementations
gcc -m32 -march=core-avx2 -O2 -fomit-frame-pointer avx2
gcc -m32 -march=core-avx2 -O3 -fomit-frame-pointer avx2
gcc -m32 -march=core-avx2 -O -fomit-frame-pointer avx2
gcc -m32 -march=core-avx2 -Os -fomit-frame-pointer avx2
gcc -m32 -march=native -mtune=native -O2 -fomit-frame-pointer avx2
gcc -m32 -march=native -mtune=native -O3 -fomit-frame-pointer avx2
gcc -m32 -march=native -mtune=native -O -fomit-frame-pointer avx2
gcc -m32 -march=native -mtune=native -Os -fomit-frame-pointer avx2