Implementation notes: amd64, cel02, crypto_sign/rainbow1acompres363232

Computer: cel02
Architecture: amd64
CPU ID: GenuineIntel-00050657-bfebfbff
SUPERCOP version: 20201130
Operation: crypto_sign
Primitive: rainbow1acompres363232
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
1227074092491 8 0151107 928 1688T:ssse3gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
13825630138183 0 0211091 920 1720T:amd64gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
19756424143689 8 0182451 928 1720T:ssse3gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
2008667892456 8 0150746 896 1656T:ssse3clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121120201130
2266172851810 8 0116875 920 1656T:ssse3gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
23321636109462 8 0167026 896 1656T:avx2clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121120201130
2421163091353 8 0149179 928 1688T:ssse3gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
2493217889006 0 0159179 920 1688T:amd64gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
2519596060655 0 0129714 888 1656T:amd64clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121120201130
25466538103378 0 0173928 896 1656T:amd64clang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121120201130
2867464450978 0 0120018 888 1656T:refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121120201130
2875007287050 0 0157563 920 1688T:amd64gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
29678346106856 0 0180083 920 1720T:refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
2974117695315 0 0164880 896 1656T:refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121120201130
3010074254950 0 0125499 920 1688T:refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
3099481655237 0 0125403 920 1688T:refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
3255973429916 0 099571 912 1656T:amd64gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
3762460626925 0 096411 912 1656T:refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130

Compiler output

Implementation: T:amd64
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x55f20f47d3b0: v4i64 = X86ISD::VTRUNC 0x55f20f47d280
try.c: 0x55f20f47d280: v16i32 = vselect 0x55f20f479d90, 0x55f20f430800, 0x55f20f47d150
try.c: 0x55f20f479d90: v4i1 = X86ISD::PCMPGTM 0x55f20f4759f0, 0x55f20f471580
try.c: 0x55f20f4759f0: v4i64 = X86ISD::VBROADCAST 0x55f20f43abd0
try.c: 0x55f20f43abd0: i64,ch = load<LD8[%lsr.iv6971]> 0x55f20f386960, 0x55f20f45fe80, undef:i64
try.c: 0x55f20f45fe80: i64,ch = CopyFromReg 0x55f20f386960, Register:i64 %vreg50
try.c: 0x55f20f4717e0: i64 = Register %vreg50
try.c: 0x55f20f43c0a0: i64 = undef
try.c: 0x55f20f471580: v4i64,ch = CopyFromReg 0x55f20f386960, Register:v4i64 %vreg13
try.c: 0x55f20f476240: v4i64 = Register %vreg13
try.c: 0x55f20f430800: v16i32 = X86ISD::VBROADCAST 0x55f20f475c50
try.c: 0x55f20f475c50: i32,ch = load<LD4[ConstantPool]> 0x55f20f386960, 0x55f20f41e530, undef:i64
try.c: 0x55f20f41e530: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x55f20f3f1d30: i64 = TargetConstantPool<i32 1> 0
try.c: 0x55f20f43c0a0: i64 = undef
try.c: 0x55f20f47d150: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x55f20f47d020: i32 = Constant<0>
try.c: 0x55f20f47d020: i32 = Constant<0>
try.c: 0x55f20f47d020: i32 = Constant<0>
try.c: 0x55f20f47d020: i32 = Constant<0>
try.c: 0x55f20f47d020: i32 = Constant<0>
try.c: 0x55f20f47d020: i32 = Constant<0>
try.c: 0x55f20f47d020: i32 = Constant<0>
try.c: 0x55f20f47d020: i32 = Constant<0>
try.c: 0x55f20f47d020: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:amd64

Compiler output

Implementation: T:amd64
Security model: timingleaks
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x5632256cc800: v4i64 = X86ISD::VTRUNC 0x5632256cc6d0
try.c: 0x5632256cc6d0: v16i32 = vselect 0x5632256ce4d0, 0x563225650440, 0x5632256cc5a0
try.c: 0x5632256ce4d0: v4i1 = X86ISD::PCMPGTM 0x5632256b3860, 0x5632256af9f0
try.c: 0x5632256b3860: v4i64 = X86ISD::VBROADCAST 0x563225650900
try.c: 0x563225650900: i64,ch = load<LD8[%lsr.iv6971]> 0x5632255ada50, 0x563225674b00, undef:i64
try.c: 0x563225674b00: i64,ch = CopyFromReg 0x5632255ada50, Register:i64 %vreg50
try.c: 0x5632256afc50: i64 = Register %vreg50
try.c: 0x563225647bb0: i64 = undef
try.c: 0x5632256af9f0: v4i64,ch = CopyFromReg 0x5632255ada50, Register:v4i64 %vreg13
try.c: 0x5632256b40b0: v4i64 = Register %vreg13
try.c: 0x563225650440: v16i32 = X86ISD::VBROADCAST 0x5632256b3ac0
try.c: 0x5632256b3ac0: i32,ch = load<LD4[ConstantPool]> 0x5632255ada50, 0x5632256728c0, undef:i64
try.c: 0x5632256728c0: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x563225648530: i64 = TargetConstantPool<i32 1> 0
try.c: 0x563225647bb0: i64 = undef
try.c: 0x5632256cc5a0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x5632256cc470: i32 = Constant<0>
try.c: 0x5632256cc470: i32 = Constant<0>
try.c: 0x5632256cc470: i32 = Constant<0>
try.c: 0x5632256cc470: i32 = Constant<0>
try.c: 0x5632256cc470: i32 = Constant<0>
try.c: 0x5632256cc470: i32 = Constant<0>
try.c: 0x5632256cc470: i32 = Constant<0>
try.c: 0x5632256cc470: i32 = Constant<0>
try.c: 0x5632256cc470: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:amd64

Compiler output

Implementation: T:amd64
Security model: timingleaks
Compiler: clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x556f2449b070: v4i64 = X86ISD::VTRUNC 0x556f2449af40
try.c: 0x556f2449af40: v16i32 = vselect 0x556f24487500, 0x556f24428b50, 0x556f2449ae10
try.c: 0x556f24487500: v4i1 = X86ISD::PCMPGTM 0x556f244808a0, 0x556f2447c430
try.c: 0x556f244808a0: v4i64 = X86ISD::VBROADCAST 0x556f2443b740
try.c: 0x556f2443b740: i64,ch = load<LD8[%lsr.iv6971]> 0x556f24391950, 0x556f24477290, undef:i64
try.c: 0x556f24477290: i64,ch = CopyFromReg 0x556f24391950, Register:i64 %vreg50
try.c: 0x556f2447c690: i64 = Register %vreg50
try.c: 0x556f244271c0: i64 = undef
try.c: 0x556f2447c430: v4i64,ch = CopyFromReg 0x556f24391950, Register:v4i64 %vreg13
try.c: 0x556f244810f0: v4i64 = Register %vreg13
try.c: 0x556f24428b50: v16i32 = X86ISD::VBROADCAST 0x556f24480b00
try.c: 0x556f24480b00: i32,ch = load<LD4[ConstantPool]> 0x556f24391950, 0x556f2443ad20, undef:i64
try.c: 0x556f2443ad20: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x556f244677b0: i64 = TargetConstantPool<i32 1> 0
try.c: 0x556f244271c0: i64 = undef
try.c: 0x556f2449ae10: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x556f2449ace0: i32 = Constant<0>
try.c: 0x556f2449ace0: i32 = Constant<0>
try.c: 0x556f2449ace0: i32 = Constant<0>
try.c: 0x556f2449ace0: i32 = Constant<0>
try.c: 0x556f2449ace0: i32 = Constant<0>
try.c: 0x556f2449ace0: i32 = Constant<0>
try.c: 0x556f2449ace0: i32 = Constant<0>
try.c: 0x556f2449ace0: i32 = Constant<0>
try.c: 0x556f2449ace0: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:amd64

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x55ef4abb4180: v4i64 = X86ISD::VTRUNC 0x55ef4abb4050
try.c: 0x55ef4abb4050: v16i32 = vselect 0x55ef4abca6f0, 0x55ef4ab4f8f0, 0x55ef4abb3f20
try.c: 0x55ef4abca6f0: v4i1 = X86ISD::PCMPGTM 0x55ef4abafb60, 0x55ef4abab6f0
try.c: 0x55ef4abafb60: v4i64 = X86ISD::VBROADCAST 0x55ef4ab5ab50
try.c: 0x55ef4ab5ab50: i64,ch = load<LD8[%lsr.iv6971]> 0x55ef4aac0950, 0x55ef4ab9b2b0, undef:i64
try.c: 0x55ef4ab9b2b0: i64,ch = CopyFromReg 0x55ef4aac0950, Register:i64 %vreg50
try.c: 0x55ef4abab950: i64 = Register %vreg50
try.c: 0x55ef4ab5c020: i64 = undef
try.c: 0x55ef4abab6f0: v4i64,ch = CopyFromReg 0x55ef4aac0950, Register:v4i64 %vreg13
try.c: 0x55ef4abb03b0: v4i64 = Register %vreg13
try.c: 0x55ef4ab4f8f0: v16i32 = X86ISD::VBROADCAST 0x55ef4abafdc0
try.c: 0x55ef4abafdc0: i32,ch = load<LD4[ConstantPool]> 0x55ef4aac0950, 0x55ef4ab59990, undef:i64
try.c: 0x55ef4ab59990: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x55ef4ab53b90: i64 = TargetConstantPool<i32 1> 0
try.c: 0x55ef4ab5c020: i64 = undef
try.c: 0x55ef4abb3f20: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x55ef4abb3df0: i32 = Constant<0>
try.c: 0x55ef4abb3df0: i32 = Constant<0>
try.c: 0x55ef4abb3df0: i32 = Constant<0>
try.c: 0x55ef4abb3df0: i32 = Constant<0>
try.c: 0x55ef4abb3df0: i32 = Constant<0>
try.c: 0x55ef4abb3df0: i32 = Constant<0>
try.c: 0x55ef4abb3df0: i32 = Constant<0>
try.c: 0x55ef4abb3df0: i32 = Constant<0>
try.c: 0x55ef4abb3df0: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x56504e9e8660: v4i64 = X86ISD::VTRUNC 0x56504e9e8530
try.c: 0x56504e9e8530: v16i32 = vselect 0x56504e9e3030, 0x56504e978040, 0x56504e9e8400
try.c: 0x56504e9e3030: v4i1 = X86ISD::PCMPGTM 0x56504e9e0010, 0x56504e9dbba0
try.c: 0x56504e9e0010: v4i64 = X86ISD::VBROADCAST 0x56504e978500
try.c: 0x56504e978500: i64,ch = load<LD8[%lsr.iv6971]> 0x56504e8d9a30, 0x56504e97b6d0, undef:i64
try.c: 0x56504e97b6d0: i64,ch = CopyFromReg 0x56504e8d9a30, Register:i64 %vreg50
try.c: 0x56504e9dbe00: i64 = Register %vreg50
try.c: 0x56504e96ea90: i64 = undef
try.c: 0x56504e9dbba0: v4i64,ch = CopyFromReg 0x56504e8d9a30, Register:v4i64 %vreg13
try.c: 0x56504e9e0860: v4i64 = Register %vreg13
try.c: 0x56504e978040: v16i32 = X86ISD::VBROADCAST 0x56504e9e0270
try.c: 0x56504e9e0270: i32,ch = load<LD4[ConstantPool]> 0x56504e8d9a30, 0x56504e982bf0, undef:i64
try.c: 0x56504e982bf0: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x56504e96f410: i64 = TargetConstantPool<i32 1> 0
try.c: 0x56504e96ea90: i64 = undef
try.c: 0x56504e9e8400: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x56504e9e82d0: i32 = Constant<0>
try.c: 0x56504e9e82d0: i32 = Constant<0>
try.c: 0x56504e9e82d0: i32 = Constant<0>
try.c: 0x56504e9e82d0: i32 = Constant<0>
try.c: 0x56504e9e82d0: i32 = Constant<0>
try.c: 0x56504e9e82d0: i32 = Constant<0>
try.c: 0x56504e9e82d0: i32 = Constant<0>
try.c: 0x56504e9e82d0: i32 = Constant<0>
try.c: 0x56504e9e82d0: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x5556af2a6020: v4i64 = X86ISD::VTRUNC 0x5556af2a5ef0
try.c: 0x5556af2a5ef0: v16i32 = vselect 0x5556af297e20, 0x5556af247ac0, 0x5556af2a5dc0
try.c: 0x5556af297e20: v4i1 = X86ISD::PCMPGTM 0x5556af29f9f0, 0x5556af29b580
try.c: 0x5556af29f9f0: v4i64 = X86ISD::VBROADCAST 0x5556af240a60
try.c: 0x5556af240a60: i64,ch = load<LD8[%lsr.iv6971]> 0x5556af1b0920, 0x5556af2896a0, undef:i64
try.c: 0x5556af2896a0: i64,ch = CopyFromReg 0x5556af1b0920, Register:i64 %vreg50
try.c: 0x5556af29b7e0: i64 = Register %vreg50
try.c: 0x5556af246130: i64 = undef
try.c: 0x5556af29b580: v4i64,ch = CopyFromReg 0x5556af1b0920, Register:v4i64 %vreg13
try.c: 0x5556af2a0240: v4i64 = Register %vreg13
try.c: 0x5556af247ac0: v16i32 = X86ISD::VBROADCAST 0x5556af29fc50
try.c: 0x5556af29fc50: i32,ch = load<LD4[ConstantPool]> 0x5556af1b0920, 0x5556af240040, undef:i64
try.c: 0x5556af240040: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x5556af259000: i64 = TargetConstantPool<i32 1> 0
try.c: 0x5556af246130: i64 = undef
try.c: 0x5556af2a5dc0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x5556af2a5c90: i32 = Constant<0>
try.c: 0x5556af2a5c90: i32 = Constant<0>
try.c: 0x5556af2a5c90: i32 = Constant<0>
try.c: 0x5556af2a5c90: i32 = Constant<0>
try.c: 0x5556af2a5c90: i32 = Constant<0>
try.c: 0x5556af2a5c90: i32 = Constant<0>
try.c: 0x5556af2a5c90: i32 = Constant<0>
try.c: 0x5556af2a5c90: i32 = Constant<0>
try.c: 0x5556af2a5c90: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
blas_comm.c: In file included from blas_comm.c:6:
blas_comm.c: In file included from ./blas.h:25:
blas_comm.c: ./blas_avx2.h:88:17: error: always_inline function '_mm256_loadu_si256' requires target feature 'sse4.2', but would be inlined into function 'gf256v_add_avx2' that is compiled without support for 'sse4.2'
blas_comm.c: __m256i inp = _mm256_loadu_si256( (__m256i*) (a+i*32) );
blas_comm.c: ^
blas_comm.c: ./blas_avx2.h:89:17: error: always_inline function '_mm256_loadu_si256' requires target feature 'sse4.2', but would be inlined into function 'gf256v_add_avx2' that is compiled without support for 'sse4.2'
blas_comm.c: __m256i out = _mm256_loadu_si256( (__m256i*) (accu_b+i*32) );
blas_comm.c: ^
blas_comm.c: ./blas_avx2.h:91:3: error: always_inline function '_mm256_storeu_si256' requires target feature 'sse4.2', but would be inlined into function 'gf256v_add_avx2' that is compiled without support for 'sse4.2'
blas_comm.c: _mm256_storeu_si256( (__m256i*) (accu_b+i*32) , out );
blas_comm.c: ^
blas_comm.c: fatal error: error in backend: Do not know how to split this operator's operand!
blas_comm.c:
blas_comm.c: clang: error: clang frontend command failed with exit code 70 (use -v to see invocation)
blas_comm.c: clang version 3.8.1-24 (tags/RELEASE_381/final)
blas_comm.c: Target: x86_64-pc-linux-gnu
blas_comm.c: Thread model: posix
blas_comm.c: InstalledDir: /usr/bin
blas_comm.c: clang: note: diagnostic msg: PLEASE submit a bug report to http://llvm.org/bugs/ and include the crash backtrace, preprocessed source, and associated run script.
blas_comm.c: clang: note: diagnostic msg:
blas_comm.c: ********************
blas_comm.c:
blas_comm.c: PLEASE ATTACH THE FOLLOWING FILES TO THE BUG REPORT:
blas_comm.c: Preprocessed source(s) and associated run script(s) are located at:
blas_comm.c: clang: note: diagnostic msg: /tmp/blas_comm-597657.c
blas_comm.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
blas_matrix_avx2.c: blas_matrix_avx2.c: In function '_gf256mat_gauss_elim_avx2_32x':
blas_matrix_avx2.c: blas_matrix_avx2.c:479:19: warning: implicit declaration of function '_mm256_setr_m128i' [-Wimplicit-function-declaration]
blas_matrix_avx2.c: __m256i mask = _mm256_setr_m128i( add_mask , add_mask );
blas_matrix_avx2.c: ^~~~~~~~~~~~~~~~~
blas_matrix_avx2.c: blas_matrix_avx2.c:479:19: error: incompatible types when initializing type '__m256i {aka __vector(4) long long int}' using type 'int'
blas_matrix_avx2.c: blas_matrix_avx2.c:495:17: error: incompatible types when initializing type '__m256i {aka __vector(4) long long int}' using type 'int'
blas_matrix_avx2.c: __m256i pi2 = _mm256_setr_m128i( pi1 , pi1 );
blas_matrix_avx2.c: ^~~~~~~~~~~~~~~~~

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2

Compiler output

Implementation: T:ref
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x5594171b9190: v4i64 = X86ISD::VTRUNC 0x5594171b9060
try.c: 0x5594171b9060: v16i32 = vselect 0x5594171bc660, 0x55941713f490, 0x5594171b8f30
try.c: 0x5594171bc660: v4i1 = X86ISD::PCMPGTM 0x55941719e9c0, 0x55941719a550
try.c: 0x55941719e9c0: v4i64 = X86ISD::VBROADCAST 0x559417142310
try.c: 0x559417142310: i64,ch = load<LD8[%lsr.iv6971]> 0x5594170af940, 0x559417188290, undef:i64
try.c: 0x559417188290: i64,ch = CopyFromReg 0x5594170af940, Register:i64 %vreg50
try.c: 0x55941719a7b0: i64 = Register %vreg50
try.c: 0x55941713db00: i64 = undef
try.c: 0x55941719a550: v4i64,ch = CopyFromReg 0x5594170af940, Register:v4i64 %vreg13
try.c: 0x55941719f210: v4i64 = Register %vreg13
try.c: 0x55941713f490: v16i32 = X86ISD::VBROADCAST 0x55941719ec20
try.c: 0x55941719ec20: i32,ch = load<LD4[ConstantPool]> 0x5594170af940, 0x5594171418f0, undef:i64
try.c: 0x5594171418f0: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x5594171894e0: i64 = TargetConstantPool<i32 1> 0
try.c: 0x55941713db00: i64 = undef
try.c: 0x5594171b8f30: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x5594171b8e00: i32 = Constant<0>
try.c: 0x5594171b8e00: i32 = Constant<0>
try.c: 0x5594171b8e00: i32 = Constant<0>
try.c: 0x5594171b8e00: i32 = Constant<0>
try.c: 0x5594171b8e00: i32 = Constant<0>
try.c: 0x5594171b8e00: i32 = Constant<0>
try.c: 0x5594171b8e00: i32 = Constant<0>
try.c: 0x5594171b8e00: i32 = Constant<0>
try.c: 0x5594171b8e00: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref

Compiler output

Implementation: T:ref
Security model: timingleaks
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x56529d3318a0: v4i64 = X86ISD::VTRUNC 0x56529d331770
try.c: 0x56529d331770: v16i32 = vselect 0x56529d342420, 0x56529d2dad50, 0x56529d331640
try.c: 0x56529d342420: v4i1 = X86ISD::PCMPGTM 0x56529d32c270, 0x56529d327e00
try.c: 0x56529d32c270: v4i64 = X86ISD::VBROADCAST 0x56529d2db210
try.c: 0x56529d2db210: i64,ch = load<LD8[%lsr.iv6971]> 0x56529d225a30, 0x56529d2cd240, undef:i64
try.c: 0x56529d2cd240: i64,ch = CopyFromReg 0x56529d225a30, Register:i64 %vreg50
try.c: 0x56529d328060: i64 = Register %vreg50
try.c: 0x56529d2c7480: i64 = undef
try.c: 0x56529d327e00: v4i64,ch = CopyFromReg 0x56529d225a30, Register:v4i64 %vreg13
try.c: 0x56529d32cac0: v4i64 = Register %vreg13
try.c: 0x56529d2dad50: v16i32 = X86ISD::VBROADCAST 0x56529d32c4d0
try.c: 0x56529d32c4d0: i32,ch = load<LD4[ConstantPool]> 0x56529d225a30, 0x56529d2cb810, undef:i64
try.c: 0x56529d2cb810: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x56529d2c7e00: i64 = TargetConstantPool<i32 1> 0
try.c: 0x56529d2c7480: i64 = undef
try.c: 0x56529d331640: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x56529d331510: i32 = Constant<0>
try.c: 0x56529d331510: i32 = Constant<0>
try.c: 0x56529d331510: i32 = Constant<0>
try.c: 0x56529d331510: i32 = Constant<0>
try.c: 0x56529d331510: i32 = Constant<0>
try.c: 0x56529d331510: i32 = Constant<0>
try.c: 0x56529d331510: i32 = Constant<0>
try.c: 0x56529d331510: i32 = Constant<0>
try.c: 0x56529d331510: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref

Compiler output

Implementation: T:ref
Security model: timingleaks
Compiler: clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x55939e1bd7a0: v4i64 = X86ISD::VTRUNC 0x55939e1bd670
try.c: 0x55939e1bd670: v16i32 = vselect 0x55939e1b0f20, 0x55939e167a70, 0x55939e1bd540
try.c: 0x55939e1b0f20: v4i1 = X86ISD::PCMPGTM 0x55939e1b8af0, 0x55939e1b4680
try.c: 0x55939e1b8af0: v4i64 = X86ISD::VBROADCAST 0x55939e15f890
try.c: 0x55939e15f890: i64,ch = load<LD8[%lsr.iv6971]> 0x55939e0c99b0, 0x55939e1a41c0, undef:i64
try.c: 0x55939e1a41c0: i64,ch = CopyFromReg 0x55939e0c99b0, Register:i64 %vreg50
try.c: 0x55939e1b48e0: i64 = Register %vreg50
try.c: 0x55939e1660e0: i64 = undef
try.c: 0x55939e1b4680: v4i64,ch = CopyFromReg 0x55939e0c99b0, Register:v4i64 %vreg13
try.c: 0x55939e1b9340: v4i64 = Register %vreg13
try.c: 0x55939e167a70: v16i32 = X86ISD::VBROADCAST 0x55939e1b8d50
try.c: 0x55939e1b8d50: i32,ch = load<LD4[ConstantPool]> 0x55939e0c99b0, 0x55939e15ee70, undef:i64
try.c: 0x55939e15ee70: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x55939e1a28c0: i64 = TargetConstantPool<i32 1> 0
try.c: 0x55939e1660e0: i64 = undef
try.c: 0x55939e1bd540: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x55939e1bd410: i32 = Constant<0>
try.c: 0x55939e1bd410: i32 = Constant<0>
try.c: 0x55939e1bd410: i32 = Constant<0>
try.c: 0x55939e1bd410: i32 = Constant<0>
try.c: 0x55939e1bd410: i32 = Constant<0>
try.c: 0x55939e1bd410: i32 = Constant<0>
try.c: 0x55939e1bd410: i32 = Constant<0>
try.c: 0x55939e1bd410: i32 = Constant<0>
try.c: 0x55939e1bd410: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref

Compiler output

Implementation: T:ssse3
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x56102735d4e0: v4i64 = X86ISD::VTRUNC 0x56102735d3b0
try.c: 0x56102735d3b0: v16i32 = vselect 0x56102734bf80, 0x5610272f9fe0, 0x56102735d280
try.c: 0x56102734bf80: v4i1 = X86ISD::PCMPGTM 0x561027353b50, 0x56102734f6e0
try.c: 0x561027353b50: v4i64 = X86ISD::VBROADCAST 0x5610272f7180
try.c: 0x5610272f7180: i64,ch = load<LD8[%lsr.iv6971]> 0x561027264960, 0x561027346750, undef:i64
try.c: 0x561027346750: i64,ch = CopyFromReg 0x561027264960, Register:i64 %vreg50
try.c: 0x56102734f940: i64 = Register %vreg50
try.c: 0x5610272f8650: i64 = undef
try.c: 0x56102734f6e0: v4i64,ch = CopyFromReg 0x561027264960, Register:v4i64 %vreg13
try.c: 0x5610273543a0: v4i64 = Register %vreg13
try.c: 0x5610272f9fe0: v16i32 = X86ISD::VBROADCAST 0x561027353db0
try.c: 0x561027353db0: i32,ch = load<LD4[ConstantPool]> 0x561027264960, 0x5610272fcc70, undef:i64
try.c: 0x5610272fcc70: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x56102733d4e0: i64 = TargetConstantPool<i32 1> 0
try.c: 0x5610272f8650: i64 = undef
try.c: 0x56102735d280: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x56102735d150: i32 = Constant<0>
try.c: 0x56102735d150: i32 = Constant<0>
try.c: 0x56102735d150: i32 = Constant<0>
try.c: 0x56102735d150: i32 = Constant<0>
try.c: 0x56102735d150: i32 = Constant<0>
try.c: 0x56102735d150: i32 = Constant<0>
try.c: 0x56102735d150: i32 = Constant<0>
try.c: 0x56102735d150: i32 = Constant<0>
try.c: 0x56102735d150: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ssse3

Compiler output

Implementation: T:ssse3
Security model: timingleaks
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x55f697e3a4d0: v4i64 = X86ISD::VTRUNC 0x55f697e3a3a0
try.c: 0x55f697e3a3a0: v16i32 = vselect 0x55f697e340f0, 0x55f697dc1740, 0x55f697e3a270
try.c: 0x55f697e340f0: v4i1 = X86ISD::PCMPGTM 0x55f697e150c0, 0x55f697e10c50
try.c: 0x55f697e150c0: v4i64 = X86ISD::VBROADCAST 0x55f697dc1c00
try.c: 0x55f697dc1c00: i64,ch = load<LD8[%lsr.iv6971]> 0x55f697d0ea40, 0x55f697dc4ab0, undef:i64
try.c: 0x55f697dc4ab0: i64,ch = CopyFromReg 0x55f697d0ea40, Register:i64 %vreg50
try.c: 0x55f697e10eb0: i64 = Register %vreg50
try.c: 0x55f697da1b50: i64 = undef
try.c: 0x55f697e10c50: v4i64,ch = CopyFromReg 0x55f697d0ea40, Register:v4i64 %vreg13
try.c: 0x55f697e15910: v4i64 = Register %vreg13
try.c: 0x55f697dc1740: v16i32 = X86ISD::VBROADCAST 0x55f697e15320
try.c: 0x55f697e15320: i32,ch = load<LD4[ConstantPool]> 0x55f697d0ea40, 0x55f697db1ea0, undef:i64
try.c: 0x55f697db1ea0: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x55f697da24d0: i64 = TargetConstantPool<i32 1> 0
try.c: 0x55f697da1b50: i64 = undef
try.c: 0x55f697e3a270: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x55f697e3a140: i32 = Constant<0>
try.c: 0x55f697e3a140: i32 = Constant<0>
try.c: 0x55f697e3a140: i32 = Constant<0>
try.c: 0x55f697e3a140: i32 = Constant<0>
try.c: 0x55f697e3a140: i32 = Constant<0>
try.c: 0x55f697e3a140: i32 = Constant<0>
try.c: 0x55f697e3a140: i32 = Constant<0>
try.c: 0x55f697e3a140: i32 = Constant<0>
try.c: 0x55f697e3a140: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ssse3

Compiler output

Implementation: T:ssse3
Security model: timingleaks
Compiler: clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x559a26133d30: v4i64 = X86ISD::VTRUNC 0x559a26133c00
try.c: 0x559a26133c00: v16i32 = vselect 0x559a26149290, 0x559a260d2970, 0x559a26133ad0
try.c: 0x559a26149290: v4i1 = X86ISD::PCMPGTM 0x559a2612e700, 0x559a2612a290
try.c: 0x559a2612e700: v4i64 = X86ISD::VBROADCAST 0x559a260dc160
try.c: 0x559a260dc160: i64,ch = load<LD8[%lsr.iv6971]> 0x559a2603f950, 0x559a261250f0, undef:i64
try.c: 0x559a261250f0: i64,ch = CopyFromReg 0x559a2603f950, Register:i64 %vreg50
try.c: 0x559a2612a4f0: i64 = Register %vreg50
try.c: 0x559a260dd630: i64 = undef
try.c: 0x559a2612a290: v4i64,ch = CopyFromReg 0x559a2603f950, Register:v4i64 %vreg13
try.c: 0x559a2612ef50: v4i64 = Register %vreg13
try.c: 0x559a260d2970: v16i32 = X86ISD::VBROADCAST 0x559a2612e960
try.c: 0x559a2612e960: i32,ch = load<LD4[ConstantPool]> 0x559a2603f950, 0x559a260cff70, undef:i64
try.c: 0x559a260cff70: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x559a260db0b0: i64 = TargetConstantPool<i32 1> 0
try.c: 0x559a260dd630: i64 = undef
try.c: 0x559a26133ad0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x559a261339a0: i32 = Constant<0>
try.c: 0x559a261339a0: i32 = Constant<0>
try.c: 0x559a261339a0: i32 = Constant<0>
try.c: 0x559a261339a0: i32 = Constant<0>
try.c: 0x559a261339a0: i32 = Constant<0>
try.c: 0x559a261339a0: i32 = Constant<0>
try.c: 0x559a261339a0: i32 = Constant<0>
try.c: 0x559a261339a0: i32 = Constant<0>
try.c: 0x559a261339a0: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ssse3

Compiler output

Implementation: T:ssse3
Security model: timingleaks
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
blas_matrix_ref.c: In file included from blas_matrix_ref.c:6:
blas_matrix_ref.c: In file included from ./blas.h:25:
blas_matrix_ref.c: In file included from ./blas_sse.h:16:
blas_matrix_ref.c: ./gf16_sse.h:34:9: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'linear_transform_8x8_128b' that is compiled without support for 'ssse3'
blas_matrix_ref.c: return _mm_shuffle_epi8(tab_l,v&mask_f)^_mm_shuffle_epi8(tab_h,_mm_srli_epi16(v,4)&mask_f);
blas_matrix_ref.c: ^
blas_matrix_ref.c: ./gf16_sse.h:34:42: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'linear_transform_8x8_128b' that is compiled without support for 'ssse3'
blas_matrix_ref.c: return _mm_shuffle_epi8(tab_l,v&mask_f)^_mm_shuffle_epi8(tab_h,_mm_srli_epi16(v,4)&mask_f);
blas_matrix_ref.c: ^
blas_matrix_ref.c: fatal error: error in backend: Cannot select: 0x5586fa1d4ba0: v16i8 = X86ISD::PSHUFB 0x5586fa200bf0, 0x5586fa1911b0
blas_matrix_ref.c: 0x5586fa200bf0: v16i8,ch = CopyFromReg 0x5586fa0cd5a0, Register:v16i8 %vreg15
blas_matrix_ref.c: 0x5586fa1905d0: v16i8 = Register %vreg15
blas_matrix_ref.c: 0x5586fa1911b0: v16i8 = bitcast 0x5586fa208a20
blas_matrix_ref.c: 0x5586fa208a20: v2i64 = and 0x5586fa1f94b0, 0x5586fa2093a0
blas_matrix_ref.c: 0x5586fa1f94b0: v2i64 = bitcast 0x5586fa1f8b30
blas_matrix_ref.c: 0x5586fa1f8b30: v8i16 = X86ISD::VSRLI 0x5586fa2010b0, Constant:i8<4>
blas_matrix_ref.c: 0x5586fa2010b0: v8i16 = bitcast 0x5586fa1f8ff0
blas_matrix_ref.c: 0x5586fa1f8ff0: v2i64,ch = load<LD16[%uglygep1617](align=1)(tbaa=<0x5586f9d52628>)> 0x5586fa0cd5a0, 0x5586fa208560, undef:i64
blas_matrix_ref.c: 0x5586fa208560: i64 = add 0x5586fa1f8c60, 0x5586fa1f9710
blas_matrix_ref.c: 0x5586fa1f8c60: i64,ch = CopyFromReg 0x5586fa0cd5a0, Register:i64 %vreg12
blas_matrix_ref.c: 0x5586fa1f9970: i64 = Register %vreg12
blas_matrix_ref.c: 0x5586fa1f9710: i64,ch = CopyFromReg 0x5586fa0cd5a0, Register:i64 %vreg17
blas_matrix_ref.c: 0x5586fa1f9250: i64 = Register %vreg17
blas_matrix_ref.c: 0x5586fa2011e0: i64 = undef
blas_matrix_ref.c: 0x5586fa191080: i8 = Constant<4>
blas_matrix_ref.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ssse3