Implementation notes: amd64, hydra8, crypto_hash/fugue512

Computer: hydra8
Microarchitecture: amd64; Ivy Bridge+AES (306a9)
Architecture: amd64
CPU ID: GenuineIntel-000306a9-bfebfbff
SUPERCOP version: 20240716
Operation: crypto_hash
Primitive: fugue512
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
5180840062 0 050480 860 896T:ccalik/vpermclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
5198840606 0 052968 860 896T:ccalik/vpermclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
5316439595 0 048058 852 896T:ccalik/vpermclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
5624640983 0 049728 860 896T:ccalik/vpermclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
8417141532 0 053622 820 960T:sphlibgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
8453440628 0 051456 860 896T:sphlibclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
8475140721 0 055184 860 896T:sphlibclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
8475343308 0 056662 820 960T:sphlibgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
8481940081 0 053178 868 896T:sphlibclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
8487840025 0 051677 812 960T:sphlibgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
8525940721 0 053240 860 896T:sphlibclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
8619837761 0 048353 796 928T:sphlibgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716
9592641825 0 052356 860 896T:sphlibclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024071720240716

Compiler output


hash.c: hash.c:540:43: warning: unused variable 'uBlockCount' [-Wunused-variable]
hash.c:         unsigned int i, col[30], block[4], base, uBlockCount = 5;
hash.c:                                                  ^
hash.c: hash.c:541:10: warning: unused variable 't1' [-Wunused-variable]
hash.c:         __m128i t1, t2;
hash.c:                 ^
hash.c: hash.c:541:14: warning: unused variable 't2' [-Wunused-variable]
hash.c:         __m128i t1, t2;
hash.c:                     ^
hash.c: hash.c:746:10: warning: unused variable 't1' [-Wunused-variable]
hash.c:         __m128i t1, t2;
hash.c:                 ^
hash.c: hash.c:746:14: warning: unused variable 't2' [-Wunused-variable]
hash.c:         __m128i t1, t2;
hash.c:                     ^
hash.c: hash.c:915:10: warning: unused variable 't1' [-Wunused-variable]
hash.c:         __m128i t1, t2;
hash.c:                 ^
hash.c: hash.c:915:14: warning: unused variable 't2' [-Wunused-variable]
hash.c:         __m128i t1, t2;
hash.c:                     ^
hash.c: hash.c:1053:10: warning: unused variable 't1' [-Wunused-variable]
hash.c:         __m128i t1, t2;
hash.c:                 ^
hash.c: hash.c:1053:14: warning: unused variable 't2' [-Wunused-variable]
hash.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:ccalik/aesniclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:ccalik/aesniclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:ccalik/aesniclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:ccalik/aesniclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


hash.c: hash.c:449:4: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'Compress256' that is compiled without support for 'ssse3'
hash.c:                         SUBROUND256_2(ctx->state[8], ctx->state[9], ctx->state[7], ctx->state[2], ctx->state[7], ctx->state[8], ctx->state[6], ctx->state[1]);
hash.c:                         ^
hash.c: hash.c:340:2: note: expanded from macro 'SUBROUND256_2'
hash.c:         SUBSTITUTE(r0, _t1, _t2, _t3, _t0);\
hash.c:         ^
hash.c: hash.c:249:10: note: expanded from macro 'SUBSTITUTE'
hash.c:                         _t2 = _mm_shuffle_epi8(r0, M128(_inv_shift_rows));\
hash.c:                               ^
hash.c: hash.c:449:4: error: always_inline function '_mm_aesenclast_si128' requires target feature 'aes', but would be inlined into function 'Compress256' that is compiled without support for 'aes'
hash.c: hash.c:340:2: note: expanded from macro 'SUBROUND256_2'
hash.c:         SUBSTITUTE(r0, _t1, _t2, _t3, _t0);\
hash.c:         ^
hash.c: hash.c:250:10: note: expanded from macro 'SUBSTITUTE'
hash.c:                         _t2 = _mm_aesenclast_si128(_t2, M128(_zero))
hash.c:                               ^
hash.c: hash.c:449:4: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'Compress256' that is compiled without support for 'ssse3'
hash.c: hash.c:341:2: note: expanded from macro 'SUBROUND256_2'
hash.c:         SUPERMIX(_t2, _t3, _t0, _t1, r0);\
hash.c:         ^
hash.c: hash.c:253:2: note: expanded from macro 'SUPERMIX'
hash.c:         PRESUPERMIX(t0, t1, t2, t3, t4);\
hash.c:         ^
hash.c: hash.c:245:25: note: expanded from macro 'PRESUPERMIX'
hash.c:         s2 = _mm_xor_si128(s2, _mm_shuffle_epi8(M128(_mul2mask), t1));\
hash.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
T:ccalik/aesniclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


hash.c: In file included from hash.c:21:
hash.c: vperm.h:57:9: warning: multi-line comment [-Wcomment]
hash.c:    57 |         //t2 = _mm_shuffle_epi8(table[0], x);\
hash.c:       |         ^
hash.c: hash.c:123:1: warning: multi-line comment [-Wcomment]
hash.c:   123 | //#define PACK_S0(s0, s1, t1)\
hash.c:       | ^
hash.c: hash.c:132:1: warning: multi-line comment [-Wcomment]
hash.c:   132 | //#define UNPACK_S0(s0, s1, t1)\
hash.c:       | ^
hash.c: hash.c:141:1: warning: multi-line comment [-Wcomment]
hash.c:   141 | //#define CMIX(s1, s2, r1, r2, t1, t2)\
hash.c:       | ^
hash.c: hash.c:330:1: warning: multi-line comment [-Wcomment]
hash.c:   330 | //#define SUBROUND256(r1, r2, r0, r5)\
hash.c:       | ^
hash.c: hash.c: In function 'Final256':
hash.c: hash.c:541:21: warning: unused variable 't2' [-Wunused-variable]
hash.c:   541 |         __m128i t1, t2;
hash.c:       |                     ^~
hash.c: hash.c:541:17: warning: unused variable 't1' [-Wunused-variable]
hash.c:   541 |         __m128i t1, t2;
hash.c:       |                 ^~
hash.c: hash.c:540:50: warning: unused variable 'uBlockCount' [-Wunused-variable]
hash.c:   540 |         unsigned int i, col[30], block[4], base, uBlockCount = 5;
hash.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:ccalik/aesnigcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:ccalik/aesnigcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:ccalik/aesnigcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:ccalik/aesnigcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

Compiler output


hash.c: hash.c:539:43: warning: unused variable 'uBlockCount' [-Wunused-variable]
hash.c:         unsigned int i, col[30], block[4], base, uBlockCount = 5;
hash.c:                                                  ^
hash.c: hash.c:540:14: warning: unused variable 't2' [-Wunused-variable]
hash.c:         __m128i t1, t2;
hash.c:                     ^
hash.c: hash.c:745:14: warning: unused variable 't2' [-Wunused-variable]
hash.c:         __m128i t1, t2;
hash.c:                     ^
hash.c: hash.c:914:14: warning: unused variable 't2' [-Wunused-variable]
hash.c:         __m128i t1, t2;
hash.c:                     ^
hash.c: hash.c:1052:14: warning: unused variable 't2' [-Wunused-variable]
hash.c:         __m128i t1, t2;
hash.c:                     ^
hash.c: 5 warnings generated.

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:ccalik/vpermclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:ccalik/vpermclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:ccalik/vpermclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:ccalik/vpermclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


hash.c: hash.c:447:4: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'Compress256' that is compiled without support for 'ssse3'
hash.c:                         TIX256(pmsg, ctx->state[1], ctx->state[0], ctx->state[6], ctx->state[8], _t0, _t1, _t2);
hash.c:                         ^
hash.c: hash.c:168:2: note: expanded from macro 'TIX256'
hash.c:         TRANSFORM(t1, _k_ipt, t2, t3);\
hash.c:         ^
hash.c: ./vperm.h:52:7: note: expanded from macro 'TRANSFORM'
hash.c:         t1 = _mm_shuffle_epi8(*((__m128i*)table + 1), t1);\
hash.c:              ^
hash.c: hash.c:447:4: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'Compress256' that is compiled without support for 'ssse3'
hash.c: hash.c:168:2: note: expanded from macro 'TIX256'
hash.c:         TRANSFORM(t1, _k_ipt, t2, t3);\
hash.c:         ^
hash.c: ./vperm.h:53:7: note: expanded from macro 'TRANSFORM'
hash.c:         x  = _mm_shuffle_epi8(*((__m128i*)table + 0), x);\
hash.c:              ^
hash.c: hash.c:448:4: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'Compress256' that is compiled without support for 'ssse3'
hash.c:                         SUBROUND256_2(ctx->state[8], ctx->state[9], ctx->state[7], ctx->state[2], ctx->state[7], ctx->state[8], ctx->state[6], ctx->state[1]);
hash.c:                         ^
hash.c: hash.c:339:2: note: expanded from macro 'SUBROUND256_2'
hash.c:         SUBSTITUTE(r0, _t1, _t2, _t3, _t0);\
hash.c:         ^
hash.c: hash.c:264:20: note: expanded from macro 'SUBSTITUTE'
hash.c: #define SUBSTITUTE SUBSTITUTE_VPERM_CORE
hash.c:                    ^
hash.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
T:ccalik/vpermclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


hash.c: In file included from hash.c:21:
hash.c: vperm.h:57:9: warning: multi-line comment [-Wcomment]
hash.c:    57 |         //t2 = _mm_shuffle_epi8(table[0], x);\
hash.c:       |         ^
hash.c: hash.c:122:1: warning: multi-line comment [-Wcomment]
hash.c:   122 | //#define PACK_S0(s0, s1, t1)\
hash.c:       | ^
hash.c: hash.c:131:1: warning: multi-line comment [-Wcomment]
hash.c:   131 | //#define UNPACK_S0(s0, s1, t1)\
hash.c:       | ^
hash.c: hash.c:140:1: warning: multi-line comment [-Wcomment]
hash.c:   140 | //#define CMIX(s1, s2, r1, r2, t1, t2)\
hash.c:       | ^
hash.c: hash.c:329:1: warning: multi-line comment [-Wcomment]
hash.c:   329 | //#define SUBROUND256(r1, r2, r0, r5)\
hash.c:       | ^
hash.c: hash.c: In function 'Final256':
hash.c: hash.c:540:21: warning: unused variable 't2' [-Wunused-variable]
hash.c:   540 |         __m128i t1, t2;
hash.c:       |                     ^~
hash.c: hash.c:539:50: warning: unused variable 'uBlockCount' [-Wunused-variable]
hash.c:   539 |         unsigned int i, col[30], block[4], base, uBlockCount = 5;
hash.c:       |                                                  ^~~~~~~~~~~
hash.c: hash.c: In function 'Final384':
hash.c: hash.c:745:21: warning: unused variable 't2' [-Wunused-variable]
hash.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:ccalik/vpermgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:ccalik/vpermgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:ccalik/vpermgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:ccalik/vpermgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

Namespace violations


hash.o Compress256 T
hash.o Compress384 T
hash.o Compress512 T
hash.o Final T
hash.o Final256 T
hash.o Final384 T
hash.o Final512 T
hash.o Hash T
hash.o Init T
hash.o Update T
hash.o _IV256 R
hash.o _IV384 R
hash.o _IV512 R
hash.o _aes_shift_rows R
hash.o _inv_shift_rows R
hash.o _k_aesmix1 R
hash.o _k_aesmix2 R
hash.o _k_aesmix3 R
hash.o _k_aesmix4 R
hash.o _k_h0e R
hash.o _k_h15 R
hash.o _k_h4e R
hash.o _k_h5b R
hash.o _k_h63 R
hash.o _k_hc6 R
hash.o _k_inv R
hash.o _k_ipt R
hash.o _k_n R
hash.o _k_opt R
hash.o _k_s0F R
hash.o _k_sb1 R
hash.o _k_sb2 R
hash.o _k_sb3 R
hash.o _k_sb4 R
hash.o _k_sb5 R
hash.o _k_sb7 R
hash.o _k_sbo R
hash.o _lsbmask2 R
hash.o _maskd3n R
hash.o _mul2mask R
hash.o _mul4mask R
hash.o _shift_four_mask R
hash.o _shift_one_mask R
hash.o _shift_seven_mask R
hash.o _supermix1a R
hash.o _supermix1b R
hash.o _supermix1c R
hash.o _supermix1d R
hash.o _supermix2a R
hash.o _supermix4a R
hash.o _supermix4b R
hash.o _supermix4c R
hash.o _supermix7a R
hash.o _supermix7b R
hash.o _zero R

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:ccalik/vpermclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:ccalik/vpermclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:ccalik/vpermclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:ccalik/vpermclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Namespace violations


fugue.o sph_fugue224 T
fugue.o sph_fugue224_addbits_and_close T
fugue.o sph_fugue224_close T
fugue.o sph_fugue224_init T
fugue.o sph_fugue256 T
fugue.o sph_fugue256_addbits_and_close T
fugue.o sph_fugue256_close T
fugue.o sph_fugue256_init T
fugue.o sph_fugue384 T
fugue.o sph_fugue384_addbits_and_close T
fugue.o sph_fugue384_close T
fugue.o sph_fugue384_init T
fugue.o sph_fugue512 T
fugue.o sph_fugue512_addbits_and_close T
fugue.o sph_fugue512_close T
fugue.o sph_fugue512_init T

Number of similar (implementation,compiler) pairs: 9, namely:
ImplementationCompiler
T:sphlibclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:sphlibclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:sphlibclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:sphlibclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:sphlibclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:sphlibgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:sphlibgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:sphlibgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:sphlibgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)