Implementation notes: amd64, cel02, crypto_sign/rainbow5cclassic963664

Computer: cel02
Architecture: amd64
CPU ID: GenuineIntel-00050657-bfebfbff
SUPERCOP version: 20201130
Operation: crypto_sign
Primitive: rainbow5cclassic963664
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
2141064103666 8 0142945 904 1656T:avx2clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121120201130
2418346167675 8 0188054 936 1720T:ssse3gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
276731097888 8 0138150 936 1688T:ssse3gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
291771296687 8 0136190 936 1688T:ssse3gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
319483284480 8 0124489 904 1656T:ssse3clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121120201130
341631453149 8 099950 928 1656T:ssse3gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
820953894458 0 0146630 928 1688T:amd64gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
8537954126547 0 0178583 904 1656T:amd64clang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121120201130
10277788116801 0 0167823 904 1656T:refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121120201130
1274754866953 0 0117665 896 1656T:amd64clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121120201130
12955834168221 0 0223054 928 1720T:amd64gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
1392006495629 0 0147470 928 1688T:amd64gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
21081148128061 0 0182934 928 1720T:refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
2206416227959 0 079174 920 1656T:refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
2232198259608 0 0111438 928 1688T:refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
2262362654288 0 0104985 896 1656T:refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121120201130
2367319858874 0 0111078 928 1688T:refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
2751082231589 0 082974 920 1656T:amd64gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130

Compiler output

Implementation: T:amd64
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x55b6f2acf400: v4i64 = X86ISD::VTRUNC 0x55b6f2acf2d0
try.c: 0x55b6f2acf2d0: v16i32 = vselect 0x55b6f2ad2f20, 0x55b6f2a62700, 0x55b6f2acf1a0
try.c: 0x55b6f2ad2f20: v4i1 = X86ISD::PCMPGTM 0x55b6f2ab5a90, 0x55b6f2ab1620
try.c: 0x55b6f2ab5a90: v4i64 = X86ISD::VBROADCAST 0x55b6f2a6b0f0
try.c: 0x55b6f2a6b0f0: i64,ch = load<LD8[%lsr.iv6971]> 0x55b6f29c6960, 0x55b6f2a9fb00, undef:i64
try.c: 0x55b6f2a9fb00: i64,ch = CopyFromReg 0x55b6f29c6960, Register:i64 %vreg50
try.c: 0x55b6f2ab1880: i64 = Register %vreg50
try.c: 0x55b6f2a6c5c0: i64 = undef
try.c: 0x55b6f2ab1620: v4i64,ch = CopyFromReg 0x55b6f29c6960, Register:v4i64 %vreg13
try.c: 0x55b6f2ab62e0: v4i64 = Register %vreg13
try.c: 0x55b6f2a62700: v16i32 = X86ISD::VBROADCAST 0x55b6f2ab5cf0
try.c: 0x55b6f2ab5cf0: i32,ch = load<LD4[ConstantPool]> 0x55b6f29c6960, 0x55b6f2a5d2a0, undef:i64
try.c: 0x55b6f2a5d2a0: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x55b6f2aa0d00: i64 = TargetConstantPool<i32 1> 0
try.c: 0x55b6f2a6c5c0: i64 = undef
try.c: 0x55b6f2acf1a0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x55b6f2acf070: i32 = Constant<0>
try.c: 0x55b6f2acf070: i32 = Constant<0>
try.c: 0x55b6f2acf070: i32 = Constant<0>
try.c: 0x55b6f2acf070: i32 = Constant<0>
try.c: 0x55b6f2acf070: i32 = Constant<0>
try.c: 0x55b6f2acf070: i32 = Constant<0>
try.c: 0x55b6f2acf070: i32 = Constant<0>
try.c: 0x55b6f2acf070: i32 = Constant<0>
try.c: 0x55b6f2acf070: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:amd64

Compiler output

Implementation: T:amd64
Security model: timingleaks
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x556047e60640: v4i64 = X86ISD::VTRUNC 0x556047e60510
try.c: 0x556047e60510: v16i32 = vselect 0x556047e42b30, 0x556047dd7460, 0x556047e603e0
try.c: 0x556047e42b30: v4i1 = X86ISD::PCMPGTM 0x556047e3bf60, 0x556047e37af0
try.c: 0x556047e3bf60: v4i64 = X86ISD::VBROADCAST 0x556047dd7920
try.c: 0x556047dd7920: i64,ch = load<LD8[%lsr.iv6971]> 0x556047d35a40, 0x556047de7e60, undef:i64
try.c: 0x556047de7e60: i64,ch = CopyFromReg 0x556047d35a40, Register:i64 %vreg50
try.c: 0x556047e37d50: i64 = Register %vreg50
try.c: 0x556047dcb700: i64 = undef
try.c: 0x556047e37af0: v4i64,ch = CopyFromReg 0x556047d35a40, Register:v4i64 %vreg13
try.c: 0x556047e3c7b0: v4i64 = Register %vreg13
try.c: 0x556047dd7460: v16i32 = X86ISD::VBROADCAST 0x556047e3c1c0
try.c: 0x556047e3c1c0: i32,ch = load<LD4[ConstantPool]> 0x556047d35a40, 0x556047de6430, undef:i64
try.c: 0x556047de6430: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x556047dcc080: i64 = TargetConstantPool<i32 1> 0
try.c: 0x556047dcb700: i64 = undef
try.c: 0x556047e603e0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x556047e602b0: i32 = Constant<0>
try.c: 0x556047e602b0: i32 = Constant<0>
try.c: 0x556047e602b0: i32 = Constant<0>
try.c: 0x556047e602b0: i32 = Constant<0>
try.c: 0x556047e602b0: i32 = Constant<0>
try.c: 0x556047e602b0: i32 = Constant<0>
try.c: 0x556047e602b0: i32 = Constant<0>
try.c: 0x556047e602b0: i32 = Constant<0>
try.c: 0x556047e602b0: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:amd64

Compiler output

Implementation: T:amd64
Security model: timingleaks
Compiler: clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x5572dae4a8d0: v4i64 = X86ISD::VTRUNC 0x5572dae4a7a0
try.c: 0x5572dae4a7a0: v16i32 = vselect 0x5572dae460f0, 0x5572dae0fac0, 0x5572dae4a670
try.c: 0x5572dae460f0: v4i1 = X86ISD::PCMPGTM 0x5572dae438d0, 0x5572dae3f460
try.c: 0x5572dae438d0: v4i64 = X86ISD::VBROADCAST 0x5572dade68a0
try.c: 0x5572dade68a0: i64,ch = load<LD8[%lsr.iv6971]> 0x5572dad54920, 0x5572dae3a2c0, undef:i64
try.c: 0x5572dae3a2c0: i64,ch = CopyFromReg 0x5572dad54920, Register:i64 %vreg50
try.c: 0x5572dae3f6c0: i64 = Register %vreg50
try.c: 0x5572dade7d70: i64 = undef
try.c: 0x5572dae3f460: v4i64,ch = CopyFromReg 0x5572dad54920, Register:v4i64 %vreg13
try.c: 0x5572dae44120: v4i64 = Register %vreg13
try.c: 0x5572dae0fac0: v16i32 = X86ISD::VBROADCAST 0x5572dae43b30
try.c: 0x5572dae43b30: i32,ch = load<LD4[ConstantPool]> 0x5572dad54920, 0x5572dadeafc0, undef:i64
try.c: 0x5572dadeafc0: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x5572dae2b070: i64 = TargetConstantPool<i32 1> 0
try.c: 0x5572dade7d70: i64 = undef
try.c: 0x5572dae4a670: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x5572dae4a540: i32 = Constant<0>
try.c: 0x5572dae4a540: i32 = Constant<0>
try.c: 0x5572dae4a540: i32 = Constant<0>
try.c: 0x5572dae4a540: i32 = Constant<0>
try.c: 0x5572dae4a540: i32 = Constant<0>
try.c: 0x5572dae4a540: i32 = Constant<0>
try.c: 0x5572dae4a540: i32 = Constant<0>
try.c: 0x5572dae4a540: i32 = Constant<0>
try.c: 0x5572dae4a540: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:amd64

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x555b1bd2f3b0: v4i64 = X86ISD::VTRUNC 0x555b1bd2f280
try.c: 0x555b1bd2f280: v16i32 = vselect 0x555b1bd29d80, 0x555b1bcd0e80, 0x555b1bd2f150
try.c: 0x555b1bd29d80: v4i1 = X86ISD::PCMPGTM 0x555b1bd24940, 0x555b1bd204d0
try.c: 0x555b1bd24940: v4i64 = X86ISD::VBROADCAST 0x555b1bccb460
try.c: 0x555b1bccb460: i64,ch = load<LD8[%lsr.iv6971]> 0x555b1bc35900, 0x555b1bd0a890, undef:i64
try.c: 0x555b1bd0a890: i64,ch = CopyFromReg 0x555b1bc35900, Register:i64 %vreg50
try.c: 0x555b1bd20730: i64 = Register %vreg50
try.c: 0x555b1bccc930: i64 = undef
try.c: 0x555b1bd204d0: v4i64,ch = CopyFromReg 0x555b1bc35900, Register:v4i64 %vreg13
try.c: 0x555b1bd25190: v4i64 = Register %vreg13
try.c: 0x555b1bcd0e80: v16i32 = X86ISD::VBROADCAST 0x555b1bd24ba0
try.c: 0x555b1bd24ba0: i32,ch = load<LD4[ConstantPool]> 0x555b1bc35900, 0x555b1bccaa40, undef:i64
try.c: 0x555b1bccaa40: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x555b1bd13140: i64 = TargetConstantPool<i32 1> 0
try.c: 0x555b1bccc930: i64 = undef
try.c: 0x555b1bd2f150: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x555b1bd2f020: i32 = Constant<0>
try.c: 0x555b1bd2f020: i32 = Constant<0>
try.c: 0x555b1bd2f020: i32 = Constant<0>
try.c: 0x555b1bd2f020: i32 = Constant<0>
try.c: 0x555b1bd2f020: i32 = Constant<0>
try.c: 0x555b1bd2f020: i32 = Constant<0>
try.c: 0x555b1bd2f020: i32 = Constant<0>
try.c: 0x555b1bd2f020: i32 = Constant<0>
try.c: 0x555b1bd2f020: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x56238af0d880: v4i64 = X86ISD::VTRUNC 0x56238af0d750
try.c: 0x56238af0d750: v16i32 = vselect 0x56238af08250, 0x56238ae8fb20, 0x56238af0d620
try.c: 0x56238af08250: v4i1 = X86ISD::PCMPGTM 0x56238aef40c0, 0x56238aef0650
try.c: 0x56238aef40c0: v4i64 = X86ISD::VBROADCAST 0x56238ae8ffe0
try.c: 0x56238ae8ffe0: i64,ch = load<LD8[%lsr.iv6971]> 0x56238adeea40, 0x56238ae8a1c0, undef:i64
try.c: 0x56238ae8a1c0: i64,ch = CopyFromReg 0x56238adeea40, Register:i64 %vreg50
try.c: 0x56238aef08b0: i64 = Register %vreg50
try.c: 0x56238ae8dce0: i64 = undef
try.c: 0x56238aef0650: v4i64,ch = CopyFromReg 0x56238adeea40, Register:v4i64 %vreg13
try.c: 0x56238aef4910: v4i64 = Register %vreg13
try.c: 0x56238ae8fb20: v16i32 = X86ISD::VBROADCAST 0x56238aef4320
try.c: 0x56238aef4320: i32,ch = load<LD4[ConstantPool]> 0x56238adeea40, 0x56238ae924c0, undef:i64
try.c: 0x56238ae924c0: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x56238ae8e660: i64 = TargetConstantPool<i32 1> 0
try.c: 0x56238ae8dce0: i64 = undef
try.c: 0x56238af0d620: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x56238af0d4f0: i32 = Constant<0>
try.c: 0x56238af0d4f0: i32 = Constant<0>
try.c: 0x56238af0d4f0: i32 = Constant<0>
try.c: 0x56238af0d4f0: i32 = Constant<0>
try.c: 0x56238af0d4f0: i32 = Constant<0>
try.c: 0x56238af0d4f0: i32 = Constant<0>
try.c: 0x56238af0d4f0: i32 = Constant<0>
try.c: 0x56238af0d4f0: i32 = Constant<0>
try.c: 0x56238af0d4f0: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x55cf06df0280: v4i64 = X86ISD::VTRUNC 0x55cf06df0150
try.c: 0x55cf06df0150: v16i32 = vselect 0x55cf06de0270, 0x55cf06d79b20, 0x55cf06df0020
try.c: 0x55cf06de0270: v4i1 = X86ISD::PCMPGTM 0x55cf06dd48f0, 0x55cf06dd0480
try.c: 0x55cf06dd48f0: v4i64 = X86ISD::VBROADCAST 0x55cf06d75ca0
try.c: 0x55cf06d75ca0: i64,ch = load<LD8[%lsr.iv6971]> 0x55cf06ce59d0, 0x55cf06dbf7a0, undef:i64
try.c: 0x55cf06dbf7a0: i64,ch = CopyFromReg 0x55cf06ce59d0, Register:i64 %vreg50
try.c: 0x55cf06dd06e0: i64 = Register %vreg50
try.c: 0x55cf06d78190: i64 = undef
try.c: 0x55cf06dd0480: v4i64,ch = CopyFromReg 0x55cf06ce59d0, Register:v4i64 %vreg13
try.c: 0x55cf06dd5140: v4i64 = Register %vreg13
try.c: 0x55cf06d79b20: v16i32 = X86ISD::VBROADCAST 0x55cf06dd4b50
try.c: 0x55cf06dd4b50: i32,ch = load<LD4[ConstantPool]> 0x55cf06ce59d0, 0x55cf06d75280, undef:i64
try.c: 0x55cf06d75280: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x55cf06dbde80: i64 = TargetConstantPool<i32 1> 0
try.c: 0x55cf06d78190: i64 = undef
try.c: 0x55cf06df0020: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x55cf06defef0: i32 = Constant<0>
try.c: 0x55cf06defef0: i32 = Constant<0>
try.c: 0x55cf06defef0: i32 = Constant<0>
try.c: 0x55cf06defef0: i32 = Constant<0>
try.c: 0x55cf06defef0: i32 = Constant<0>
try.c: 0x55cf06defef0: i32 = Constant<0>
try.c: 0x55cf06defef0: i32 = Constant<0>
try.c: 0x55cf06defef0: i32 = Constant<0>
try.c: 0x55cf06defef0: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
blas_comm.c: In file included from blas_comm.c:6:
blas_comm.c: In file included from ./blas.h:25:
blas_comm.c: ./blas_avx2.h:88:17: error: always_inline function '_mm256_loadu_si256' requires target feature 'sse4.2', but would be inlined into function 'gf256v_add_avx2' that is compiled without support for 'sse4.2'
blas_comm.c: __m256i inp = _mm256_loadu_si256( (__m256i*) (a+i*32) );
blas_comm.c: ^
blas_comm.c: ./blas_avx2.h:89:17: error: always_inline function '_mm256_loadu_si256' requires target feature 'sse4.2', but would be inlined into function 'gf256v_add_avx2' that is compiled without support for 'sse4.2'
blas_comm.c: __m256i out = _mm256_loadu_si256( (__m256i*) (accu_b+i*32) );
blas_comm.c: ^
blas_comm.c: ./blas_avx2.h:91:3: error: always_inline function '_mm256_storeu_si256' requires target feature 'sse4.2', but would be inlined into function 'gf256v_add_avx2' that is compiled without support for 'sse4.2'
blas_comm.c: _mm256_storeu_si256( (__m256i*) (accu_b+i*32) , out );
blas_comm.c: ^
blas_comm.c: fatal error: error in backend: Do not know how to split this operator's operand!
blas_comm.c:
blas_comm.c: clang: error: clang frontend command failed with exit code 70 (use -v to see invocation)
blas_comm.c: clang version 3.8.1-24 (tags/RELEASE_381/final)
blas_comm.c: Target: x86_64-pc-linux-gnu
blas_comm.c: Thread model: posix
blas_comm.c: InstalledDir: /usr/bin
blas_comm.c: clang: note: diagnostic msg: PLEASE submit a bug report to http://llvm.org/bugs/ and include the crash backtrace, preprocessed source, and associated run script.
blas_comm.c: clang: note: diagnostic msg:
blas_comm.c: ********************
blas_comm.c:
blas_comm.c: PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:
blas_comm.c: Preprocessed source(s) and associated run script(s) are located at:
blas_comm.c: clang: note: diagnostic msg: /tmp/blas_comm-9bb96d.c
blas_comm.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
blas_matrix_avx2.c: blas_matrix_avx2.c: In function '_gf256mat_gauss_elim_avx2_32x':
blas_matrix_avx2.c: blas_matrix_avx2.c:479:19: warning: implicit declaration of function '_mm256_setr_m128i' [-Wimplicit-function-declaration]
blas_matrix_avx2.c: __m256i mask = _mm256_setr_m128i( add_mask , add_mask );
blas_matrix_avx2.c: ^~~~~~~~~~~~~~~~~
blas_matrix_avx2.c: blas_matrix_avx2.c:479:19: error: incompatible types when initializing type '__m256i {aka __vector(4) long long int}' using type 'int'
blas_matrix_avx2.c: blas_matrix_avx2.c:495:17: error: incompatible types when initializing type '__m256i {aka __vector(4) long long int}' using type 'int'
blas_matrix_avx2.c: __m256i pi2 = _mm256_setr_m128i( pi1 , pi1 );
blas_matrix_avx2.c: ^~~~~~~~~~~~~~~~~

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2

Compiler output

Implementation: T:ref
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x55a71db49140: v4i64 = X86ISD::VTRUNC 0x55a71db49010
try.c: 0x55a71db49010: v16i32 = vselect 0x55a71db40050, 0x55a71dafe070, 0x55a71db48ee0
try.c: 0x55a71db40050: v4i1 = X86ISD::PCMPGTM 0x55a71db42b10, 0x55a71db3e6a0
try.c: 0x55a71db42b10: v4i64 = X86ISD::VBROADCAST 0x55a71dafb210
try.c: 0x55a71dafb210: i64,ch = load<LD8[%lsr.iv6971]> 0x55a71da53950, 0x55a71db31fa0, undef:i64
try.c: 0x55a71db31fa0: i64,ch = CopyFromReg 0x55a71da53950, Register:i64 %vreg50
try.c: 0x55a71db3e900: i64 = Register %vreg50
try.c: 0x55a71dafc6e0: i64 = undef
try.c: 0x55a71db3e6a0: v4i64,ch = CopyFromReg 0x55a71da53950, Register:v4i64 %vreg13
try.c: 0x55a71db43360: v4i64 = Register %vreg13
try.c: 0x55a71dafe070: v16i32 = X86ISD::VBROADCAST 0x55a71db42d70
try.c: 0x55a71db42d70: i32,ch = load<LD4[ConstantPool]> 0x55a71da53950, 0x55a71daead00, undef:i64
try.c: 0x55a71daead00: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x55a71db23250: i64 = TargetConstantPool<i32 1> 0
try.c: 0x55a71dafc6e0: i64 = undef
try.c: 0x55a71db48ee0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x55a71db48db0: i32 = Constant<0>
try.c: 0x55a71db48db0: i32 = Constant<0>
try.c: 0x55a71db48db0: i32 = Constant<0>
try.c: 0x55a71db48db0: i32 = Constant<0>
try.c: 0x55a71db48db0: i32 = Constant<0>
try.c: 0x55a71db48db0: i32 = Constant<0>
try.c: 0x55a71db48db0: i32 = Constant<0>
try.c: 0x55a71db48db0: i32 = Constant<0>
try.c: 0x55a71db48db0: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref

Compiler output

Implementation: T:ref
Security model: timingleaks
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x5583a0bd7690: v4i64 = X86ISD::VTRUNC 0x5583a0bd7560
try.c: 0x5583a0bd7560: v16i32 = vselect 0x5583a0bc3af0, 0x5583a0b7ed60, 0x5583a0bd7430
try.c: 0x5583a0bc3af0: v4i1 = X86ISD::PCMPGTM 0x5583a0bbcc80, 0x5583a0bb9820
try.c: 0x5583a0bbcc80: v4i64 = X86ISD::VBROADCAST 0x5583a0b7f220
try.c: 0x5583a0b7f220: i64,ch = load<LD8[%lsr.iv6971]> 0x5583a0ab7a30, 0x5583a0b55a00, undef:i64
try.c: 0x5583a0b55a00: i64,ch = CopyFromReg 0x5583a0ab7a30, Register:i64 %vreg50
try.c: 0x5583a0bb9a80: i64 = Register %vreg50
try.c: 0x5583a0b5a4d0: i64 = undef
try.c: 0x5583a0bb9820: v4i64,ch = CopyFromReg 0x5583a0ab7a30, Register:v4i64 %vreg13
try.c: 0x5583a0bbd4d0: v4i64 = Register %vreg13
try.c: 0x5583a0b7ed60: v16i32 = X86ISD::VBROADCAST 0x5583a0bbcee0
try.c: 0x5583a0bbcee0: i32,ch = load<LD4[ConstantPool]> 0x5583a0ab7a30, 0x5583a0b537c0, undef:i64
try.c: 0x5583a0b537c0: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x5583a0b5ae50: i64 = TargetConstantPool<i32 1> 0
try.c: 0x5583a0b5a4d0: i64 = undef
try.c: 0x5583a0bd7430: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x5583a0bd7300: i32 = Constant<0>
try.c: 0x5583a0bd7300: i32 = Constant<0>
try.c: 0x5583a0bd7300: i32 = Constant<0>
try.c: 0x5583a0bd7300: i32 = Constant<0>
try.c: 0x5583a0bd7300: i32 = Constant<0>
try.c: 0x5583a0bd7300: i32 = Constant<0>
try.c: 0x5583a0bd7300: i32 = Constant<0>
try.c: 0x5583a0bd7300: i32 = Constant<0>
try.c: 0x5583a0bd7300: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref

Compiler output

Implementation: T:ref
Security model: timingleaks
Compiler: clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x56353f94df20: v4i64 = X86ISD::VTRUNC 0x56353f94ddf0
try.c: 0x56353f94ddf0: v16i32 = vselect 0x56353f969410, 0x56353f8f09f0, 0x56353f94dcc0
try.c: 0x56353f969410: v4i1 = X86ISD::PCMPGTM 0x56353f948ce0, 0x56353f944870
try.c: 0x56353f948ce0: v4i64 = X86ISD::VBROADCAST 0x56353f9024e0
try.c: 0x56353f9024e0: i64,ch = load<LD8[%lsr.iv6971]> 0x56353f859940, 0x56353f934100, undef:i64
try.c: 0x56353f934100: i64,ch = CopyFromReg 0x56353f859940, Register:i64 %vreg50
try.c: 0x56353f944ad0: i64 = Register %vreg50
try.c: 0x56353f9039b0: i64 = undef
try.c: 0x56353f944870: v4i64,ch = CopyFromReg 0x56353f859940, Register:v4i64 %vreg13
try.c: 0x56353f949530: v4i64 = Register %vreg13
try.c: 0x56353f8f09f0: v16i32 = X86ISD::VBROADCAST 0x56353f948f40
try.c: 0x56353f948f40: i32,ch = load<LD4[ConstantPool]> 0x56353f859940, 0x56353f8e58c0, undef:i64
try.c: 0x56353f8e58c0: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x56353f937a50: i64 = TargetConstantPool<i32 1> 0
try.c: 0x56353f9039b0: i64 = undef
try.c: 0x56353f94dcc0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x56353f94db90: i32 = Constant<0>
try.c: 0x56353f94db90: i32 = Constant<0>
try.c: 0x56353f94db90: i32 = Constant<0>
try.c: 0x56353f94db90: i32 = Constant<0>
try.c: 0x56353f94db90: i32 = Constant<0>
try.c: 0x56353f94db90: i32 = Constant<0>
try.c: 0x56353f94db90: i32 = Constant<0>
try.c: 0x56353f94db90: i32 = Constant<0>
try.c: 0x56353f94db90: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref

Compiler output

Implementation: T:ssse3
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x5634f59a2f20: v4i64 = X86ISD::VTRUNC 0x5634f59a2df0
try.c: 0x5634f59a2df0: v16i32 = vselect 0x5634f59890c0, 0x5634f592f500, 0x5634f59a2cc0
try.c: 0x5634f59890c0: v4i1 = X86ISD::PCMPGTM 0x5634f598b770, 0x5634f5987300
try.c: 0x5634f598b770: v4i64 = X86ISD::VBROADCAST 0x5634f5956f40
try.c: 0x5634f5956f40: i64,ch = load<LD8[%lsr.iv6971]> 0x5634f589c960, 0x5634f5949630, undef:i64
try.c: 0x5634f5949630: i64,ch = CopyFromReg 0x5634f589c960, Register:i64 %vreg50
try.c: 0x5634f5987560: i64 = Register %vreg50
try.c: 0x5634f5958410: i64 = undef
try.c: 0x5634f5987300: v4i64,ch = CopyFromReg 0x5634f589c960, Register:v4i64 %vreg13
try.c: 0x5634f598bfc0: v4i64 = Register %vreg13
try.c: 0x5634f592f500: v16i32 = X86ISD::VBROADCAST 0x5634f598b9d0
try.c: 0x5634f598b9d0: i32,ch = load<LD4[ConstantPool]> 0x5634f589c960, 0x5634f59362e0, undef:i64
try.c: 0x5634f59362e0: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x5634f5945c70: i64 = TargetConstantPool<i32 1> 0
try.c: 0x5634f5958410: i64 = undef
try.c: 0x5634f59a2cc0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x5634f59a2b90: i32 = Constant<0>
try.c: 0x5634f59a2b90: i32 = Constant<0>
try.c: 0x5634f59a2b90: i32 = Constant<0>
try.c: 0x5634f59a2b90: i32 = Constant<0>
try.c: 0x5634f59a2b90: i32 = Constant<0>
try.c: 0x5634f59a2b90: i32 = Constant<0>
try.c: 0x5634f59a2b90: i32 = Constant<0>
try.c: 0x5634f59a2b90: i32 = Constant<0>
try.c: 0x5634f59a2b90: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ssse3

Compiler output

Implementation: T:ssse3
Security model: timingleaks
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x561124f1b140: v4i64 = X86ISD::VTRUNC 0x561124f1b010
try.c: 0x561124f1b010: v16i32 = vselect 0x561124f17b20, 0x561124e95180, 0x561124f1aee0
try.c: 0x561124f17b20: v4i1 = X86ISD::PCMPGTM 0x561124f00780, 0x561124efc310
try.c: 0x561124f00780: v4i64 = X86ISD::VBROADCAST 0x561124e95640
try.c: 0x561124e95640: i64,ch = load<LD8[%lsr.iv6971]> 0x561124df9a40, 0x561124e9f360, undef:i64
try.c: 0x561124e9f360: i64,ch = CopyFromReg 0x561124df9a40, Register:i64 %vreg50
try.c: 0x561124efc570: i64 = Register %vreg50
try.c: 0x561124eaed50: i64 = undef
try.c: 0x561124efc310: v4i64,ch = CopyFromReg 0x561124df9a40, Register:v4i64 %vreg13
try.c: 0x561124f00fd0: v4i64 = Register %vreg13
try.c: 0x561124e95180: v16i32 = X86ISD::VBROADCAST 0x561124f009e0
try.c: 0x561124f009e0: i32,ch = load<LD4[ConstantPool]> 0x561124df9a40, 0x561124e9bd70, undef:i64
try.c: 0x561124e9bd70: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x561124eaf6d0: i64 = TargetConstantPool<i32 1> 0
try.c: 0x561124eaed50: i64 = undef
try.c: 0x561124f1aee0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x561124f1adb0: i32 = Constant<0>
try.c: 0x561124f1adb0: i32 = Constant<0>
try.c: 0x561124f1adb0: i32 = Constant<0>
try.c: 0x561124f1adb0: i32 = Constant<0>
try.c: 0x561124f1adb0: i32 = Constant<0>
try.c: 0x561124f1adb0: i32 = Constant<0>
try.c: 0x561124f1adb0: i32 = Constant<0>
try.c: 0x561124f1adb0: i32 = Constant<0>
try.c: 0x561124f1adb0: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ssse3

Compiler output

Implementation: T:ssse3
Security model: timingleaks
Compiler: clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x55bbbd7811a0: v4i64 = X86ISD::VTRUNC 0x55bbbd781070
try.c: 0x55bbbd781070: v16i32 = vselect 0x55bbbd788c50, 0x55bbbd722350, 0x55bbbd780f40
try.c: 0x55bbbd788c50: v4i1 = X86ISD::PCMPGTM 0x55bbbd766820, 0x55bbbd7623b0
try.c: 0x55bbbd766820: v4i64 = X86ISD::VBROADCAST 0x55bbbd706aa0
try.c: 0x55bbbd706aa0: i64,ch = load<LD8[%lsr.iv6971]> 0x55bbbd677950, 0x55bbbd752150, undef:i64
try.c: 0x55bbbd752150: i64,ch = CopyFromReg 0x55bbbd677950, Register:i64 %vreg50
try.c: 0x55bbbd762610: i64 = Register %vreg50
try.c: 0x55bbbd7209c0: i64 = undef
try.c: 0x55bbbd7623b0: v4i64,ch = CopyFromReg 0x55bbbd677950, Register:v4i64 %vreg13
try.c: 0x55bbbd767070: v4i64 = Register %vreg13
try.c: 0x55bbbd722350: v16i32 = X86ISD::VBROADCAST 0x55bbbd766a80
try.c: 0x55bbbd766a80: i32,ch = load<LD4[ConstantPool]> 0x55bbbd677950, 0x55bbbd706080, undef:i64
try.c: 0x55bbbd706080: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x55bbbd6e9780: i64 = TargetConstantPool<i32 1> 0
try.c: 0x55bbbd7209c0: i64 = undef
try.c: 0x55bbbd780f40: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x55bbbd780e10: i32 = Constant<0>
try.c: 0x55bbbd780e10: i32 = Constant<0>
try.c: 0x55bbbd780e10: i32 = Constant<0>
try.c: 0x55bbbd780e10: i32 = Constant<0>
try.c: 0x55bbbd780e10: i32 = Constant<0>
try.c: 0x55bbbd780e10: i32 = Constant<0>
try.c: 0x55bbbd780e10: i32 = Constant<0>
try.c: 0x55bbbd780e10: i32 = Constant<0>
try.c: 0x55bbbd780e10: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ssse3

Compiler output

Implementation: T:ssse3
Security model: timingleaks
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
blas_matrix_ref.c: In file included from blas_matrix_ref.c:6:
blas_matrix_ref.c: In file included from ./blas.h:25:
blas_matrix_ref.c: In file included from ./blas_sse.h:16:
blas_matrix_ref.c: ./gf16_sse.h:34:9: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'linear_transform_8x8_128b' that is compiled without support for 'ssse3'
blas_matrix_ref.c: return _mm_shuffle_epi8(tab_l,v&mask_f)^_mm_shuffle_epi8(tab_h,_mm_srli_epi16(v,4)&mask_f);
blas_matrix_ref.c: ^
blas_matrix_ref.c: ./gf16_sse.h:34:42: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'linear_transform_8x8_128b' that is compiled without support for 'ssse3'
blas_matrix_ref.c: return _mm_shuffle_epi8(tab_l,v&mask_f)^_mm_shuffle_epi8(tab_h,_mm_srli_epi16(v,4)&mask_f);
blas_matrix_ref.c: ^
blas_matrix_ref.c: fatal error: error in backend: Cannot select: 0x557fcfa22360: v16i8 = X86ISD::PSHUFB 0x557fcfa65cd0, 0x557fcfa38760
blas_matrix_ref.c: 0x557fcfa65cd0: v16i8,ch = CopyFromReg 0x557fcf95b5b0, Register:v16i8 %vreg15
blas_matrix_ref.c: 0x557fcfa37b80: v16i8 = Register %vreg15
blas_matrix_ref.c: 0x557fcfa38760: v16i8 = bitcast 0x557fcfa5ceb0
blas_matrix_ref.c: 0x557fcfa5ceb0: v2i64 = and 0x557fcfa61560, 0x557fcfa5d830
blas_matrix_ref.c: 0x557fcfa61560: v2i64 = bitcast 0x557fcfa60be0
blas_matrix_ref.c: 0x557fcfa60be0: v8i16 = X86ISD::VSRLI 0x557fcfa66190, Constant:i8<4>
blas_matrix_ref.c: 0x557fcfa66190: v8i16 = bitcast 0x557fcfa610a0
blas_matrix_ref.c: 0x557fcfa610a0: v2i64,ch = load<LD16[%uglygep1617](align=1)(tbaa=<0x557fcf5e0628>)> 0x557fcf95b5b0, 0x557fcfa5c9f0, undef:i64
blas_matrix_ref.c: 0x557fcfa5c9f0: i64 = add 0x557fcfa60d10, 0x557fcfa617c0
blas_matrix_ref.c: 0x557fcfa60d10: i64,ch = CopyFromReg 0x557fcf95b5b0, Register:i64 %vreg12
blas_matrix_ref.c: 0x557fcfa61a20: i64 = Register %vreg12
blas_matrix_ref.c: 0x557fcfa617c0: i64,ch = CopyFromReg 0x557fcf95b5b0, Register:i64 %vreg17
blas_matrix_ref.c: 0x557fcfa61300: i64 = Register %vreg17
blas_matrix_ref.c: 0x557fcfa662c0: i64 = undef
blas_matrix_ref.c: 0x557fcfa38630: i8 = Constant<4>
blas_matrix_ref.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ssse3