Implementation notes: amd64, cel02, crypto_hash/fugue384

Computer: cel02
Architecture: amd64
CPU ID: GenuineIntel-00050657-bfebfbff
SUPERCOP version: 20201130
Operation: crypto_hash
Primitive: fugue384
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
3532234330 0 042924 792 736T:ccalik/aesniclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121120201130
3949433794 0 048037 824 808T:ccalik/aesnigcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
4044628245 0 039013 824 776T:ccalik/aesnigcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
4220825477 0 035200 800 776T:ccalik/aesnigcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
4274239500 0 050581 824 776T:ccalik/vpermgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
4340431124 0 042205 824 776T:ccalik/aesnigcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
5438642283 0 056525 824 808T:ccalik/vpermgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
6111234030 0 043744 800 776T:ccalik/vpermgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
6154036847 0 047621 824 776T:ccalik/vpermgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2020121120201130
6258845023 0 052980 792 736T:ccalik/vpermclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2020121120201130

Compiler output

Implementation: T:ccalik/aesni
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x556c46e63c40: v4i64 = X86ISD::VTRUNC 0x556c46e63b10
try.c: 0x556c46e63b10: v16i32 = vselect 0x556c46e60620, 0x556c46e03920, 0x556c46e639e0
try.c: 0x556c46e60620: v4i1 = X86ISD::PCMPGTM 0x556c46e5b9d0, 0x556c46e57560
try.c: 0x556c46e5b9d0: v4i64 = X86ISD::VBROADCAST 0x556c46dffb10
try.c: 0x556c46dffb10: i64,ch = load<LD8[%lsr.iv6971]> 0x556c46d6c950, 0x556c46e45d50, undef:i64
try.c: 0x556c46e45d50: i64,ch = CopyFromReg 0x556c46d6c950, Register:i64 %vreg50
try.c: 0x556c46e577c0: i64 = Register %vreg50
try.c: 0x556c46e01f90: i64 = undef
try.c: 0x556c46e57560: v4i64,ch = CopyFromReg 0x556c46d6c950, Register:v4i64 %vreg13
try.c: 0x556c46e5c220: v4i64 = Register %vreg13
try.c: 0x556c46e03920: v16i32 = X86ISD::VBROADCAST 0x556c46e5bc30
try.c: 0x556c46e5bc30: i32,ch = load<LD4[ConstantPool]> 0x556c46d6c950, 0x556c46dff0f0, undef:i64
try.c: 0x556c46dff0f0: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x556c46de4e40: i64 = TargetConstantPool<i32 1> 0
try.c: 0x556c46e01f90: i64 = undef
try.c: 0x556c46e639e0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x556c46e638b0: i32 = Constant<0>
try.c: 0x556c46e638b0: i32 = Constant<0>
try.c: 0x556c46e638b0: i32 = Constant<0>
try.c: 0x556c46e638b0: i32 = Constant<0>
try.c: 0x556c46e638b0: i32 = Constant<0>
try.c: 0x556c46e638b0: i32 = Constant<0>
try.c: 0x556c46e638b0: i32 = Constant<0>
try.c: 0x556c46e638b0: i32 = Constant<0>
try.c: 0x556c46e638b0: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ccalik/aesni

Compiler output

Implementation: T:ccalik/aesni
Security model: timingleaks
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x563e148fa350: v4i64 = X86ISD::VTRUNC 0x563e148fa220
try.c: 0x563e148fa220: v16i32 = vselect 0x563e148f4d20, 0x563e1487d330, 0x563e148fa0f0
try.c: 0x563e148f4d20: v4i1 = X86ISD::PCMPGTM 0x563e148da340, 0x563e148d68d0
try.c: 0x563e148da340: v4i64 = X86ISD::VBROADCAST 0x563e1487d7f0
try.c: 0x563e1487d7f0: i64,ch = load<LD8[%lsr.iv6971]> 0x563e147d4a30, 0x563e14878060, undef:i64
try.c: 0x563e14878060: i64,ch = CopyFromReg 0x563e147d4a30, Register:i64 %vreg50
try.c: 0x563e148d6b30: i64 = Register %vreg50
try.c: 0x563e1489bed0: i64 = undef
try.c: 0x563e148d68d0: v4i64,ch = CopyFromReg 0x563e147d4a30, Register:v4i64 %vreg13
try.c: 0x563e148dab90: v4i64 = Register %vreg13
try.c: 0x563e1487d330: v16i32 = X86ISD::VBROADCAST 0x563e148da5a0
try.c: 0x563e148da5a0: i32,ch = load<LD4[ConstantPool]> 0x563e147d4a30, 0x563e14875a10, undef:i64
try.c: 0x563e14875a10: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x563e1489c850: i64 = TargetConstantPool<i32 1> 0
try.c: 0x563e1489bed0: i64 = undef
try.c: 0x563e148fa0f0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x563e148f9fc0: i32 = Constant<0>
try.c: 0x563e148f9fc0: i32 = Constant<0>
try.c: 0x563e148f9fc0: i32 = Constant<0>
try.c: 0x563e148f9fc0: i32 = Constant<0>
try.c: 0x563e148f9fc0: i32 = Constant<0>
try.c: 0x563e148f9fc0: i32 = Constant<0>
try.c: 0x563e148f9fc0: i32 = Constant<0>
try.c: 0x563e148f9fc0: i32 = Constant<0>
try.c: 0x563e148f9fc0: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ccalik/aesni

Compiler output

Implementation: T:ccalik/aesni
Security model: timingleaks
Compiler: clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x55649f2a4fe0: v4i64 = X86ISD::VTRUNC 0x55649f2a4eb0
try.c: 0x55649f2a4eb0: v16i32 = vselect 0x55649f2b7530, 0x55649f23e410, 0x55649f2a4d80
try.c: 0x55649f2b7530: v4i1 = X86ISD::PCMPGTM 0x55649f29d9a0, 0x55649f299530
try.c: 0x55649f29d9a0: v4i64 = X86ISD::VBROADCAST 0x55649f264290
try.c: 0x55649f264290: i64,ch = load<LD8[%lsr.iv6971]> 0x55649f1ae950, 0x55649f2887b0, undef:i64
try.c: 0x55649f2887b0: i64,ch = CopyFromReg 0x55649f1ae950, Register:i64 %vreg50
try.c: 0x55649f299790: i64 = Register %vreg50
try.c: 0x55649f23ca80: i64 = undef
try.c: 0x55649f299530: v4i64,ch = CopyFromReg 0x55649f1ae950, Register:v4i64 %vreg13
try.c: 0x55649f29e1f0: v4i64 = Register %vreg13
try.c: 0x55649f23e410: v16i32 = X86ISD::VBROADCAST 0x55649f29dc00
try.c: 0x55649f29dc00: i32,ch = load<LD4[ConstantPool]> 0x55649f1ae950, 0x55649f263870, undef:i64
try.c: 0x55649f263870: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x55649f25f670: i64 = TargetConstantPool<i32 1> 0
try.c: 0x55649f23ca80: i64 = undef
try.c: 0x55649f2a4d80: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x55649f2a4c50: i32 = Constant<0>
try.c: 0x55649f2a4c50: i32 = Constant<0>
try.c: 0x55649f2a4c50: i32 = Constant<0>
try.c: 0x55649f2a4c50: i32 = Constant<0>
try.c: 0x55649f2a4c50: i32 = Constant<0>
try.c: 0x55649f2a4c50: i32 = Constant<0>
try.c: 0x55649f2a4c50: i32 = Constant<0>
try.c: 0x55649f2a4c50: i32 = Constant<0>
try.c: 0x55649f2a4c50: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ccalik/aesni

Compiler output

Implementation: T:ccalik/aesni
Security model: timingleaks
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
hash.c: hash.c:449:4: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'Compress256' that is compiled without support for 'ssse3'
hash.c: SUBROUND256_2(ctx->state[8], ctx->state[9], ctx->state[7], ctx->state[2], ctx->state[7], ctx->state[8], ctx->state[6], ctx->state[1]);
hash.c: ^
hash.c: hash.c:340:2: note: expanded from macro 'SUBROUND256_2'
hash.c: SUBSTITUTE(r0, _t1, _t2, _t3, _t0);\
hash.c: ^
hash.c: hash.c:249:10: note: expanded from macro 'SUBSTITUTE'
hash.c: _t2 = _mm_shuffle_epi8(r0, M128(_inv_shift_rows));\
hash.c: ^
hash.c: hash.c:449:4: error: always_inline function '_mm_aesenclast_si128' requires target feature 'aes', but would be inlined into function 'Compress256' that is compiled without support for 'aes'
hash.c: hash.c:340:2: note: expanded from macro 'SUBROUND256_2'
hash.c: SUBSTITUTE(r0, _t1, _t2, _t3, _t0);\
hash.c: ^
hash.c: hash.c:250:10: note: expanded from macro 'SUBSTITUTE'
hash.c: _t2 = _mm_aesenclast_si128(_t2, M128(_zero))
hash.c: ^
hash.c: hash.c:449:4: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'Compress256' that is compiled without support for 'ssse3'
hash.c: hash.c:341:2: note: expanded from macro 'SUBROUND256_2'
hash.c: SUPERMIX(_t2, _t3, _t0, _t1, r0);\
hash.c: ^
hash.c: hash.c:253:2: note: expanded from macro 'SUPERMIX'
hash.c: PRESUPERMIX(t0, t1, t2, t3, t4);\
hash.c: ^
hash.c: hash.c:245:25: note: expanded from macro 'PRESUPERMIX'
hash.c: s2 = _mm_xor_si128(s2, _mm_shuffle_epi8(M128(_mul2mask), t1));\
hash.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ccalik/aesni

Compiler output

Implementation: T:ccalik/vperm
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x56313745cc30: v4i64 = X86ISD::VTRUNC 0x56313745cb00
try.c: 0x56313745cb00: v16i32 = vselect 0x563137459610, 0x5631373f8c60, 0x56313745c9d0
try.c: 0x563137459610: v4i1 = X86ISD::PCMPGTM 0x563137454ad0, 0x563137450660
try.c: 0x563137454ad0: v4i64 = X86ISD::VBROADCAST 0x5631373fb460
try.c: 0x5631373fb460: i64,ch = load<LD8[%lsr.iv6971]> 0x563137365940, 0x56313743fb90, undef:i64
try.c: 0x56313743fb90: i64,ch = CopyFromReg 0x563137365940, Register:i64 %vreg50
try.c: 0x5631374508c0: i64 = Register %vreg50
try.c: 0x5631373f72d0: i64 = undef
try.c: 0x563137450660: v4i64,ch = CopyFromReg 0x563137365940, Register:v4i64 %vreg13
try.c: 0x563137455320: v4i64 = Register %vreg13
try.c: 0x5631373f8c60: v16i32 = X86ISD::VBROADCAST 0x563137454d30
try.c: 0x563137454d30: i32,ch = load<LD4[ConstantPool]> 0x563137365940, 0x5631373faa40, undef:i64
try.c: 0x5631373faa40: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x5631373f56d0: i64 = TargetConstantPool<i32 1> 0
try.c: 0x5631373f72d0: i64 = undef
try.c: 0x56313745c9d0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x56313745c8a0: i32 = Constant<0>
try.c: 0x56313745c8a0: i32 = Constant<0>
try.c: 0x56313745c8a0: i32 = Constant<0>
try.c: 0x56313745c8a0: i32 = Constant<0>
try.c: 0x56313745c8a0: i32 = Constant<0>
try.c: 0x56313745c8a0: i32 = Constant<0>
try.c: 0x56313745c8a0: i32 = Constant<0>
try.c: 0x56313745c8a0: i32 = Constant<0>
try.c: 0x56313745c8a0: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ccalik/vperm

Compiler output

Implementation: T:ccalik/vperm
Security model: timingleaks
Compiler: clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x5608580ccc20: v4i64 = X86ISD::VTRUNC 0x5608580ccaf0
try.c: 0x5608580ccaf0: v16i32 = vselect 0x5608580c75f0, 0x560858061290, 0x5608580cc9c0
try.c: 0x5608580c75f0: v4i1 = X86ISD::PCMPGTM 0x5608580c55d0, 0x5608580c1160
try.c: 0x5608580c55d0: v4i64 = X86ISD::VBROADCAST 0x560858061750
try.c: 0x560858061750: i64,ch = load<LD8[%lsr.iv6971]> 0x560857fbea30, 0x560858059fa0, undef:i64
try.c: 0x560858059fa0: i64,ch = CopyFromReg 0x560857fbea30, Register:i64 %vreg50
try.c: 0x5608580c13c0: i64 = Register %vreg50
try.c: 0x5608580640c0: i64 = undef
try.c: 0x5608580c1160: v4i64,ch = CopyFromReg 0x560857fbea30, Register:v4i64 %vreg13
try.c: 0x5608580c5e20: v4i64 = Register %vreg13
try.c: 0x560858061290: v16i32 = X86ISD::VBROADCAST 0x5608580c5830
try.c: 0x5608580c5830: i32,ch = load<LD4[ConstantPool]> 0x560857fbea30, 0x560858057d60, undef:i64
try.c: 0x560858057d60: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x560858064a40: i64 = TargetConstantPool<i32 1> 0
try.c: 0x5608580640c0: i64 = undef
try.c: 0x5608580cc9c0: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x5608580cc890: i32 = Constant<0>
try.c: 0x5608580cc890: i32 = Constant<0>
try.c: 0x5608580cc890: i32 = Constant<0>
try.c: 0x5608580cc890: i32 = Constant<0>
try.c: 0x5608580cc890: i32 = Constant<0>
try.c: 0x5608580cc890: i32 = Constant<0>
try.c: 0x5608580cc890: i32 = Constant<0>
try.c: 0x5608580cc890: i32 = Constant<0>
try.c: 0x5608580cc890: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ccalik/vperm

Compiler output

Implementation: T:ccalik/vperm
Security model: timingleaks
Compiler: clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
try.c: fatal error: error in backend: Cannot select: 0x564134f1d9a0: v4i64 = X86ISD::VTRUNC 0x564134f1d870
try.c: 0x564134f1d870: v16i32 = vselect 0x564134f15400, 0x564134ea2f40, 0x564134f1d740
try.c: 0x564134f15400: v4i1 = X86ISD::PCMPGTM 0x564134efba20, 0x564134ef75b0
try.c: 0x564134efba20: v4i64 = X86ISD::VBROADCAST 0x564134e9be90
try.c: 0x564134e9be90: i64,ch = load<LD8[%lsr.iv6971]> 0x564134e0c950, 0x564134ef2410, undef:i64
try.c: 0x564134ef2410: i64,ch = CopyFromReg 0x564134e0c950, Register:i64 %vreg50
try.c: 0x564134ef7810: i64 = Register %vreg50
try.c: 0x564134e9d360: i64 = undef
try.c: 0x564134ef75b0: v4i64,ch = CopyFromReg 0x564134e0c950, Register:v4i64 %vreg13
try.c: 0x564134efc270: v4i64 = Register %vreg13
try.c: 0x564134ea2f40: v16i32 = X86ISD::VBROADCAST 0x564134efbc80
try.c: 0x564134efbc80: i32,ch = load<LD4[ConstantPool]> 0x564134e0c950, 0x564134ea04d0, undef:i64
try.c: 0x564134ea04d0: i64 = X86ISD::WrapperRIP TargetConstantPool:i64<i32 1> 0
try.c: 0x564134eb7010: i64 = TargetConstantPool<i32 1> 0
try.c: 0x564134e9d360: i64 = undef
try.c: 0x564134f1d740: v16i32 = BUILD_VECTOR Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>, Constant:i32<0>
try.c: 0x564134f1d610: i32 = Constant<0>
try.c: 0x564134f1d610: i32 = Constant<0>
try.c: 0x564134f1d610: i32 = Constant<0>
try.c: 0x564134f1d610: i32 = Constant<0>
try.c: 0x564134f1d610: i32 = Constant<0>
try.c: 0x564134f1d610: i32 = Constant<0>
try.c: 0x564134f1d610: i32 = Constant<0>
try.c: 0x564134f1d610: i32 = Constant<0>
try.c: 0x564134f1d610: i32 = Constant<0>
try.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ccalik/vperm

Compiler output

Implementation: T:ccalik/vperm
Security model: timingleaks
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
hash.c: hash.c:448:4: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'Compress256' that is compiled without support for 'ssse3'
hash.c: TIX256(pmsg, ctx->state[1], ctx->state[0], ctx->state[6], ctx->state[8], _t0, _t1, _t2);
hash.c: ^
hash.c: hash.c:169:2: note: expanded from macro 'TIX256'
hash.c: TRANSFORM(t1, _k_ipt, t2, t3);\
hash.c: ^
hash.c: ./vperm.h:52:7: note: expanded from macro 'TRANSFORM'
hash.c: t1 = _mm_shuffle_epi8(*((__m128i*)table + 1), t1);\
hash.c: ^
hash.c: hash.c:448:4: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'Compress256' that is compiled without support for 'ssse3'
hash.c: hash.c:169:2: note: expanded from macro 'TIX256'
hash.c: TRANSFORM(t1, _k_ipt, t2, t3);\
hash.c: ^
hash.c: ./vperm.h:53:7: note: expanded from macro 'TRANSFORM'
hash.c: x = _mm_shuffle_epi8(*((__m128i*)table + 0), x);\
hash.c: ^
hash.c: hash.c:449:4: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'Compress256' that is compiled without support for 'ssse3'
hash.c: SUBROUND256_2(ctx->state[8], ctx->state[9], ctx->state[7], ctx->state[2], ctx->state[7], ctx->state[8], ctx->state[6], ctx->state[1]);
hash.c: ^
hash.c: hash.c:340:2: note: expanded from macro 'SUBROUND256_2'
hash.c: SUBSTITUTE(r0, _t1, _t2, _t3, _t0);\
hash.c: ^
hash.c: hash.c:265:20: note: expanded from macro 'SUBSTITUTE'
hash.c: #define SUBSTITUTE SUBSTITUTE_VPERM_CORE
hash.c: ^
hash.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ccalik/vperm