Implementation notes: amd64, oki, crypto_sign/lattisigns512

Computer: oki
Architecture: amd64
CPU ID: GenuineIntel-00050654-bfebfbff
SUPERCOP version: 20181123
Operation: crypto_sign
Primitive: lattisigns512
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
475650? ? ?? ? ?avxgcc_-m64_-march=core-avx2_-O3_-fomit-frame-pointer2019011420181123
480598? ? ?? ? ?avxgcc_-m64_-march=native_-mtune=native_-O3_-fomit-frame-pointer2019011420181123
493520? ? ?? ? ?avxgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv2019011420181123
496710? ? ?? ? ?avxgcc_-m64_-march=core-avx2_-O2_-fomit-frame-pointer2019011420181123
515378? ? ?? ? ?avxgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv2019011420181123
525060? ? ?? ? ?avxgcc_-m64_-march=core-avx2_-Os_-fomit-frame-pointer2019011420181123
529060? ? ?? ? ?avxgcc_-m64_-march=native_-mtune=native_-Os_-fomit-frame-pointer2019011420181123
536764? ? ?? ? ?avxgcc_-m64_-march=native_-mtune=native_-O2_-fomit-frame-pointer2019011420181123
539080? ? ?? ? ?avxclang_-O3_-fwrapv_-mavx_-fomit-frame-pointer_-Qunused-arguments2019011420181123
539762? ? ?? ? ?avxclang_-O3_-fwrapv_-mavx_-maes_-mpclmul_-fomit-frame-pointer_-Qunused-arguments2019011420181123
554614? ? ?? ? ?avxgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv2019011420181123
557578? ? ?? ? ?avxgcc_-m64_-march=core-avx2_-O_-fomit-frame-pointer2019011420181123
559744? ? ?? ? ?avxgcc_-m64_-march=core-avx-i_-O2_-fomit-frame-pointer2019011420181123
561132? ? ?? ? ?avxgcc_-m64_-march=core-avx-i_-O3_-fomit-frame-pointer2019011420181123
561268? ? ?? ? ?avxgcc_-m64_-march=corei7-avx_-O3_-fomit-frame-pointer2019011420181123
563972? ? ?? ? ?avxclang_-O3_-fwrapv_-march=x86-64_-mcpu=core-avx2_-mavx2_-maes_-mpclmul_-fomit-frame-pointer_-Qunused-arguments2019011420181123
565734? ? ?? ? ?avxgcc_-m64_-march=corei7-avx_-O2_-fomit-frame-pointer2019011420181123
570790? ? ?? ? ?avxclang_-O3_-fwrapv_-mavx2_-fomit-frame-pointer_-Qunused-arguments2019011420181123
572042? ? ?? ? ?avxgcc_-m64_-march=corei7-avx_-Os_-fomit-frame-pointer2019011420181123
572502? ? ?? ? ?avxgcc_-m64_-march=core-avx-i_-Os_-fomit-frame-pointer2019011420181123
572718? ? ?? ? ?avxgcc_-m64_-march=core-avx-i_-O_-fomit-frame-pointer2019011420181123
572886? ? ?? ? ?avxgcc_-m64_-march=corei7-avx_-O_-fomit-frame-pointer2019011420181123
583612? ? ?? ? ?avxgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv2019011420181123
591794? ? ?? ? ?avxgcc_-m64_-march=native_-mtune=native_-O_-fomit-frame-pointer2019011420181123

Compiler output

Implementation: avx
Security model: unknown
Compiler: cc
ntt_transform.c: ntt_transform.c: In function 'ntt_transform':
ntt_transform.c: ntt_transform.c:24:3: error: unknown type name '__m256d'
ntt_transform.c: __m256d vt,vo0,vo10,vo11,vo20,vo21,vo22,vo23,vc,vp,vpinv,neg2,neg4;
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:25:3: error: unknown type name '__m256d'
ntt_transform.c: __m256d vx0,vx1,vx2,vx3,vx4,vx5,vx6,vx7;
ntt_transform.c: ^

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
cc avx

Compiler output

Implementation: avx
Security model: unknown
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
ntt_transform.c: ntt_transform.c:27:11: error: always_inline function '_mm256_set_pd' requires target feature 'sse4.2', but would be inlined into function 'ntt_transform' that is compiled without support for 'sse4.2'
ntt_transform.c: vpinv = _mm256_set_pd(PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:28:11: error: always_inline function '_mm256_set_pd' requires target feature 'sse4.2', but would be inlined into function 'ntt_transform' that is compiled without support for 'sse4.2'
ntt_transform.c: vp = _mm256_set_pd(8383489., 8383489., 8383489., 8383489.);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:32:10: error: always_inline function '_mm256_load_pd' requires target feature 'sse4.2', but would be inlined into function 'ntt_transform' that is compiled without support for 'sse4.2'
ntt_transform.c: vo10 = _mm256_load_pd(o+pos);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:33:10: error: always_inline function '_mm256_load_pd' requires target feature 'sse4.2', but would be inlined into function 'ntt_transform' that is compiled without support for 'sse4.2'
ntt_transform.c: vo20 = _mm256_load_pd(o+pos+4);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:34:10: error: always_inline function '_mm256_load_pd' requires target feature 'sse4.2', but would be inlined into function 'ntt_transform' that is compiled without support for 'sse4.2'
ntt_transform.c: neg2 = _mm256_load_pd(_neg2);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:35:10: error: always_inline function '_mm256_load_pd' requires target feature 'sse4.2', but would be inlined into function 'ntt_transform' that is compiled without support for 'sse4.2'
ntt_transform.c: neg4 = _mm256_load_pd(_neg4);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:41:11: error: always_inline function '_mm256_load_pd' requires target feature 'sse4.2', but would be inlined into function 'ntt_transform' that is compiled without support for 'sse4.2'
ntt_transform.c: vx0 = _mm256_load_pd(out+s);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:42:10: error: always_inline function '_mm256_mul_pd' requires target feature 'sse4.2', but would be inlined into function 'ntt_transform' that is compiled without support for 'sse4.2'
ntt_transform.c: vt = _mm256_mul_pd(vx0,neg2);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:43:11: error: always_inline function '_mm256_hadd_pd' requires target feature 'sse4.2', but would be inlined into function 'ntt_transform' that is compiled without support for 'sse4.2'
ntt_transform.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments avx
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments avx
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments avx
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments avx

Compiler output

Implementation: avx
Security model: unknown
Compiler: clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments
try.c: fatal error: error in backend: Cannot select: 0x185f0a0: v4i64 = X86ISD::VTRUNC 0x185eed0
try.c: 0x185eed0: v16i32 = vselect 0x1841a90, 0x17f6a90, 0x185eda0
try.c: 0x1841a90: v4i1 = X86ISD::PCMPGTM 0x1848990, 0x17e8d30
try.c: 0x1848990: v4i64 = X86ISD::VBROADCAST 0x17e90c0
try.c: 0x17e90c0: i64,ch = load<LD8[%uglygep72]> 0x1742dd0, 0x17dc140, undef:i64
try.c: 0x17dc140: i64 = add 0x1849870, 0x17f2280
try.c: 0x1849870: i64 = X86ISD::Wrapper TargetGlobalAddress:i64<[64 x i64]* @cycles> 0
try.c: 0x183f0f0: i64 = TargetGlobalAddress<[64 x i64]* @cycles> 0
try.c: 0x17f2280: i64 = shl 0x183e8a0, Constant:i8<3>
try.c: 0x183e8a0: i64,ch = CopyFromReg 0x1742dd0, Register:i64 %vreg50
try.c: 0x1849d30: i64 = Register %vreg50
try.c: 0x17f3880: i8 = Constant<3>
try.c: 0x1847000: i64 = undef
try.c: 0x17e8d30: v4i64,ch = CopyFromReg 0x1742dd0, Register:v4i64 %vreg13
try.c: 0x17faeb0: v4i64 = Register %vreg13
try.c: 0x17f6a90: v16i32 = X86ISD::VBROADCAST 0x183eb00
try.c: 0x183eb00: i32,ch = load<LD4[ConstantPool]> 0x1742dd0, 0x185ea10, undef:i64
try.c: 0x185ea10: i64 = X86ISD::Wrapper TargetConstantPool:i64<i32 1> 0
try.c: 0x17f81c0: i64 = TargetConstantPool<i32 1> 0
try.c: 0x1847000: i64 = undef
try.c: 0x185eda0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x185ec70: i32 = Constant<0>
try.c: 0x185ec70: i32 = Constant<0>
try.c: 0x185ec70: i32 = Constant<0>
try.c: 0x185ec70: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments avx

Compiler output

Implementation: avx
Security model: unknown
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments
try.c: fatal error: error in backend: Cannot select: 0x231c410: v4i64 = X86ISD::VTRUNC 0x2320270
try.c: 0x2320270: v16i32 = vselect 0x2325f00, 0x22b0820, 0x2320140
try.c: 0x2325f00: v4i1 = X86ISD::PCMPGTM 0x2306d10, 0x22a2710
try.c: 0x2306d10: v4i64 = X86ISD::VBROADCAST 0x22a2aa0
try.c: 0x22a2aa0: i64,ch = load<LD8[%uglygep72]> 0x2202db0, 0x22f7230, undef:i64
try.c: 0x22f7230: i64 = add 0x2307bf0, 0x22833f0
try.c: 0x2307bf0: i64 = X86ISD::Wrapper TargetGlobalAddress:i64<[64 x i64]* @cycles> 0
try.c: 0x22e82f0: i64 = TargetGlobalAddress<[64 x i64]* @cycles> 0
try.c: 0x22833f0: i64 = shl 0x22e7aa0, Constant:i8<3>
try.c: 0x22e7aa0: i64,ch = CopyFromReg 0x2202db0, Register:i64 %vreg50
try.c: 0x23080b0: i64 = Register %vreg50
try.c: 0x22849f0: i8 = Constant<3>
try.c: 0x2305380: i64 = undef
try.c: 0x22a2710: v4i64,ch = CopyFromReg 0x2202db0, Register:v4i64 %vreg13
try.c: 0x22b2710: v4i64 = Register %vreg13
try.c: 0x22b0820: v16i32 = X86ISD::VBROADCAST 0x22e7d00
try.c: 0x22e7d00: i32,ch = load<LD4[ConstantPool]> 0x2202db0, 0x231fdb0, undef:i64
try.c: 0x231fdb0: i64 = X86ISD::Wrapper TargetConstantPool:i64<i32 1> 0
try.c: 0x22ca800: i64 = TargetConstantPool<i32 1> 0
try.c: 0x2305380: i64 = undef
try.c: 0x2320140: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x2320010: i32 = Constant<0>
try.c: 0x2320010: i32 = Constant<0>
try.c: 0x2320010: i32 = Constant<0>
try.c: 0x2320010: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments avx

Compiler output

Implementation: avx
Security model: unknown
Compiler: gcc
ntt_transform.c: ntt_transform.c: In function 'ntt_transform':
ntt_transform.c: ntt_transform.c:27:9: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
ntt_transform.c: vpinv = _mm256_set_pd(PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE);
ntt_transform.c: ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ntt_transform.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:41:0,
ntt_transform.c: from ntt_transform.c:9:
ntt_transform.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/avxintrin.h:859:1: error: inlining failed in call to always_inline '_mm256_load_pd': target specific option mismatch
ntt_transform.c: _mm256_load_pd (double const *__P)
ntt_transform.c: ^~~~~~~~~~~~~~
ntt_transform.c: ntt_transform.c:35:8: note: called from here
ntt_transform.c: neg4 = _mm256_load_pd(_neg4);
ntt_transform.c: ~~~~~^~~~~~~~~~~~~~~~~~~~~~~
ntt_transform.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:41:0,
ntt_transform.c: from ntt_transform.c:9:
ntt_transform.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/avxintrin.h:859:1: error: inlining failed in call to always_inline '_mm256_load_pd': target specific option mismatch
ntt_transform.c: _mm256_load_pd (double const *__P)
ntt_transform.c: ^~~~~~~~~~~~~~
ntt_transform.c: ntt_transform.c:34:8: note: called from here
ntt_transform.c: neg2 = _mm256_load_pd(_neg2);
ntt_transform.c: ~~~~~^~~~~~~~~~~~~~~~~~~~~~~
ntt_transform.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:41:0,
ntt_transform.c: from ntt_transform.c:9:
ntt_transform.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/avxintrin.h:859:1: error: inlining failed in call to always_inline '_mm256_load_pd': target specific option mismatch
ntt_transform.c: _mm256_load_pd (double const *__P)
ntt_transform.c: ^~~~~~~~~~~~~~
ntt_transform.c: ...

Number of similar (compiler,implementation) pairs: 86, namely:
CompilerImplementations
gcc avx
gcc -O2 -fomit-frame-pointer avx
gcc -O3 -fomit-frame-pointer avx
gcc -O -fomit-frame-pointer avx
gcc -Os -fomit-frame-pointer avx
gcc -fno-schedule-insns -O2 -fomit-frame-pointer avx
gcc -fno-schedule-insns -O3 -fomit-frame-pointer avx
gcc -fno-schedule-insns -O -fomit-frame-pointer avx
gcc -fno-schedule-insns -Os -fomit-frame-pointer avx
gcc -funroll-loops avx
gcc -funroll-loops -O2 -fomit-frame-pointer avx
gcc -funroll-loops -O3 -fomit-frame-pointer avx
gcc -funroll-loops -O -fomit-frame-pointer avx
gcc -funroll-loops -Os -fomit-frame-pointer avx
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer avx
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer avx
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer avx
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer avx
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -O -fomit-frame-pointer avx
gcc -funroll-loops -m64 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer avx
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer avx
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer avx
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer avx
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer avx
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer avx
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer avx
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer avx
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer avx
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer avx
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer avx
gcc -m64 -O2 -fomit-frame-pointer avx
gcc -m64 -O3 -fomit-frame-pointer avx
gcc -m64 -O -fomit-frame-pointer avx
gcc -m64 -Os -fomit-frame-pointer avx
gcc -m64 -march=core2 -O2 -fomit-frame-pointer avx
gcc -m64 -march=core2 -O3 -fomit-frame-pointer avx
gcc -m64 -march=core2 -O -fomit-frame-pointer avx
gcc -m64 -march=core2 -Os -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer avx
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer avx
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer avx
gcc -m64 -march=corei7 -O -fomit-frame-pointer avx
gcc -m64 -march=corei7 -Os -fomit-frame-pointer avx
gcc -m64 -march=k8 -O2 -fomit-frame-pointer avx
gcc -m64 -march=k8 -O3 -fomit-frame-pointer avx
gcc -m64 -march=k8 -O -fomit-frame-pointer avx
gcc -m64 -march=k8 -Os -fomit-frame-pointer avx
gcc -m64 -march=nocona -O2 -fomit-frame-pointer avx
gcc -m64 -march=nocona -O3 -fomit-frame-pointer avx
gcc -m64 -march=nocona -O -fomit-frame-pointer avx
gcc -m64 -march=nocona -Os -fomit-frame-pointer avx
gcc -march=barcelona -O2 -fomit-frame-pointer avx
gcc -march=barcelona -O3 -fomit-frame-pointer avx
gcc -march=barcelona -O -fomit-frame-pointer avx
gcc -march=barcelona -Os -fomit-frame-pointer avx
gcc -march=k8 -O2 -fomit-frame-pointer avx
gcc -march=k8 -O3 -fomit-frame-pointer avx
gcc -march=k8 -O -fomit-frame-pointer avx
gcc -march=k8 -Os -fomit-frame-pointer avx
gcc -march=nocona -O2 -fomit-frame-pointer avx
gcc -march=nocona -O3 -fomit-frame-pointer avx
gcc -march=nocona -O -fomit-frame-pointer avx
gcc -march=nocona -Os -fomit-frame-pointer avx

Compiler output

Implementation: avx
Security model: unknown
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
ntt_transform.c: ntt_transform.c: In function 'ntt_transform':
ntt_transform.c: ntt_transform.c:27:9: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
ntt_transform.c: vpinv = _mm256_set_pd(PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE);
ntt_transform.c: ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ntt_transform.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:41:0,
ntt_transform.c: from ntt_transform.c:9:
ntt_transform.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/avxintrin.h:859:1: error: inlining failed in call to always_inline '_mm256_load_pd': target specific option mismatch
ntt_transform.c: _mm256_load_pd (double const *__P)
ntt_transform.c: ^~~~~~~~~~~~~~
ntt_transform.c: ntt_transform.c:35:8: note: called from here
ntt_transform.c: neg4 = _mm256_load_pd(_neg4);
ntt_transform.c: ~~~~~^~~~~~~~~~~~~~~~~~~~~~~
ntt_transform.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:41:0,
ntt_transform.c: from ntt_transform.c:9:
ntt_transform.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/avxintrin.h:859:1: error: inlining failed in call to always_inline '_mm256_load_pd': target specific option mismatch
ntt_transform.c: _mm256_load_pd (double const *__P)
ntt_transform.c: ^~~~~~~~~~~~~~
ntt_transform.c: ntt_transform.c:34:8: note: called from here
ntt_transform.c: neg2 = _mm256_load_pd(_neg2);
ntt_transform.c: ~~~~~^~~~~~~~~~~~~~~~~~~~~~~
ntt_transform.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:41:0,
ntt_transform.c: from ntt_transform.c:9:
ntt_transform.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/avxintrin.h:859:1: error: inlining failed in call to always_inline '_mm256_load_pd': target specific option mismatch
ntt_transform.c: _mm256_load_pd (double const *__P)
ntt_transform.c: ^~~~~~~~~~~~~~
ntt_transform.c: ...
ntt_transform.c: ntt_transform.c: In function 'ntt_transform':
ntt_transform.c: ntt_transform.c:27:9: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
ntt_transform.c: vpinv = _mm256_set_pd(PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE);
ntt_transform.c: ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ntt_transform.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:41:0,
ntt_transform.c: from ntt_transform.c:9:
ntt_transform.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/avxintrin.h:859:1: error: inlining failed in call to always_inline '_mm256_load_pd': target specific option mismatch
ntt_transform.c: _mm256_load_pd (double const *__P)
ntt_transform.c: ^~~~~~~~~~~~~~
ntt_transform.c: ntt_transform.c:35:8: note: called from here
ntt_transform.c: neg4 = _mm256_load_pd(_neg4);
ntt_transform.c: ~~~~~^~~~~~~~~~~~~~~~~~~~~~~
ntt_transform.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:41:0,
ntt_transform.c: from ntt_transform.c:9:
ntt_transform.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/avxintrin.h:859:1: error: inlining failed in call to always_inline '_mm256_load_pd': target specific option mismatch
ntt_transform.c: _mm256_load_pd (double const *__P)
ntt_transform.c: ^~~~~~~~~~~~~~
ntt_transform.c: ntt_transform.c:34:8: note: called from here
ntt_transform.c: neg2 = _mm256_load_pd(_neg2);
ntt_transform.c: ~~~~~^~~~~~~~~~~~~~~~~~~~~~~
ntt_transform.c: In file included from /usr/lib64/gcc/x86_64-suse-linux/7/include/immintrin.h:41:0,
ntt_transform.c: from ntt_transform.c:9:
ntt_transform.c: /usr/lib64/gcc/x86_64-suse-linux/7/include/avxintrin.h:859:1: error: inlining failed in call to always_inline '_mm256_load_pd': target specific option mismatch
ntt_transform.c: _mm256_load_pd (double const *__P)
ntt_transform.c: ^~~~~~~~~~~~~~
ntt_transform.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer avx
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer avx
gcc -m64 -march=barcelona -O -fomit-frame-pointer avx
gcc -m64 -march=barcelona -Os -fomit-frame-pointer avx