Implementation notes: amd64, cubi10, crypto_hash/fugue512
Computer: cubi10
Microarchitecture: amd64; Comet Lake (806ec)
Architecture: amd64
CPU ID: GenuineIntel-000806ec-bfebfbff
SUPERCOP version: 20240625
Operation: crypto_hash
Primitive: fugue512
Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
45784 | 30826 0 0 | 39846 804 888 | T:ccalik/aesni | clang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
45939 | 32165 0 0 | 44224 812 920 | T:ccalik/aesni | clang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
46141 | 32937 0 0 | 46024 780 952 | T:ccalik/aesni | gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
46268 | 32505 0 0 | 43616 780 952 | T:ccalik/aesni | gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
46883 | 31717 0 0 | 43680 812 920 | T:ccalik/aesni | clang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
48528 | 31670 0 0 | 40942 804 888 | T:ccalik/aesni | clang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
49074 | 29586 0 0 | 40240 780 952 | T:ccalik/aesni | gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
52300 | 27305 0 0 | 36931 756 920 | T:ccalik/aesni | gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
54667 | 39216 0 0 | 47614 804 888 | T:ccalik/vperm | clang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
54933 | 40173 0 0 | 51752 812 920 | T:ccalik/vperm | clang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
55522 | 42349 0 0 | 53440 780 952 | T:ccalik/vperm | gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
55572 | 43963 0 0 | 57024 780 952 | T:ccalik/vperm | gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
56225 | 39741 0 0 | 51224 812 920 | T:ccalik/vperm | clang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
57931 | 39510 0 0 | 48302 804 888 | T:ccalik/vperm | clang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
58404 | 38699 0 0 | 49336 780 952 | T:ccalik/vperm | gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
62032 | 36052 0 0 | 45659 756 920 | T:ccalik/vperm | gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
76136 | 41117 0 0 | 52394 788 952 | T:sphlib | gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
76320 | 39224 0 0 | 49969 780 952 | T:sphlib | gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
76812 | 42349 0 0 | 55586 788 952 | T:sphlib | gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
78263 | 36714 0 0 | 46509 764 920 | T:sphlib | gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
80314 | 39521 0 0 | 53232 812 920 | T:sphlib | clang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
80408 | 39521 0 0 | 53136 812 920 | T:sphlib | clang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
81420 | 39953 0 0 | 52958 820 888 | T:sphlib | clang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
82085 | 40963 0 0 | 51630 804 888 | T:sphlib | clang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
90981 | 41523 0 0 | 51956 812 888 | T:sphlib | clang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240627 | 20240625 |
Compiler output
hash.c: hash.c:540:43: warning: unused variable 'uBlockCount' [-Wunused-variable]
hash.c: unsigned int i, col[30], block[4], base, uBlockCount = 5;
hash.c: ^
hash.c: hash.c:541:10: warning: unused variable 't1' [-Wunused-variable]
hash.c: __m128i t1, t2;
hash.c: ^
hash.c: hash.c:541:14: warning: unused variable 't2' [-Wunused-variable]
hash.c: __m128i t1, t2;
hash.c: ^
hash.c: hash.c:746:10: warning: unused variable 't1' [-Wunused-variable]
hash.c: __m128i t1, t2;
hash.c: ^
hash.c: hash.c:746:14: warning: unused variable 't2' [-Wunused-variable]
hash.c: __m128i t1, t2;
hash.c: ^
hash.c: hash.c:915:10: warning: unused variable 't1' [-Wunused-variable]
hash.c: __m128i t1, t2;
hash.c: ^
hash.c: hash.c:915:14: warning: unused variable 't2' [-Wunused-variable]
hash.c: __m128i t1, t2;
hash.c: ^
hash.c: hash.c:1053:10: warning: unused variable 't1' [-Wunused-variable]
hash.c: __m128i t1, t2;
hash.c: ^
hash.c: hash.c:1053:14: warning: unused variable 't2' [-Wunused-variable]
hash.c: ...
Number of similar (implementation,compiler) pairs: 4, namely:
Implementation | Compiler |
T:ccalik/aesni | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:ccalik/aesni | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:ccalik/aesni | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:ccalik/aesni | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
Compiler output
hash.c: hash.c:449:4: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'Compress256' that is compiled without support for 'ssse3'
hash.c: SUBROUND256_2(ctx->state[8], ctx->state[9], ctx->state[7], ctx->state[2], ctx->state[7], ctx->state[8], ctx->state[6], ctx->state[1]);
hash.c: ^
hash.c: hash.c:340:2: note: expanded from macro 'SUBROUND256_2'
hash.c: SUBSTITUTE(r0, _t1, _t2, _t3, _t0);\
hash.c: ^
hash.c: hash.c:249:10: note: expanded from macro 'SUBSTITUTE'
hash.c: _t2 = _mm_shuffle_epi8(r0, M128(_inv_shift_rows));\
hash.c: ^
hash.c: hash.c:449:4: error: always_inline function '_mm_aesenclast_si128' requires target feature 'aes', but would be inlined into function 'Compress256' that is compiled without support for 'aes'
hash.c: hash.c:340:2: note: expanded from macro 'SUBROUND256_2'
hash.c: SUBSTITUTE(r0, _t1, _t2, _t3, _t0);\
hash.c: ^
hash.c: hash.c:250:10: note: expanded from macro 'SUBSTITUTE'
hash.c: _t2 = _mm_aesenclast_si128(_t2, M128(_zero))
hash.c: ^
hash.c: hash.c:449:4: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'Compress256' that is compiled without support for 'ssse3'
hash.c: hash.c:341:2: note: expanded from macro 'SUBROUND256_2'
hash.c: SUPERMIX(_t2, _t3, _t0, _t1, r0);\
hash.c: ^
hash.c: hash.c:253:2: note: expanded from macro 'SUPERMIX'
hash.c: PRESUPERMIX(t0, t1, t2, t3, t4);\
hash.c: ^
hash.c: hash.c:245:25: note: expanded from macro 'PRESUPERMIX'
hash.c: s2 = _mm_xor_si128(s2, _mm_shuffle_epi8(M128(_mul2mask), t1));\
hash.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
T:ccalik/aesni | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
Compiler output
hash.c: In file included from hash.c:21:
hash.c: vperm.h:57:2: warning: multi-line comment [-Wcomment]
hash.c: 57 | //t2 = _mm_shuffle_epi8(table[0], x);\
hash.c: | ^
hash.c: hash.c:123:1: warning: multi-line comment [-Wcomment]
hash.c: 123 | //#define PACK_S0(s0, s1, t1)\
hash.c: | ^
hash.c: hash.c:132:1: warning: multi-line comment [-Wcomment]
hash.c: 132 | //#define UNPACK_S0(s0, s1, t1)\
hash.c: | ^
hash.c: hash.c:141:1: warning: multi-line comment [-Wcomment]
hash.c: 141 | //#define CMIX(s1, s2, r1, r2, t1, t2)\
hash.c: | ^
hash.c: hash.c:330:1: warning: multi-line comment [-Wcomment]
hash.c: 330 | //#define SUBROUND256(r1, r2, r0, r5)\
hash.c: | ^
hash.c: hash.c: In function 'Final256':
hash.c: hash.c:541:14: warning: unused variable 't2' [-Wunused-variable]
hash.c: 541 | __m128i t1, t2;
hash.c: | ^~
hash.c: hash.c:541:10: warning: unused variable 't1' [-Wunused-variable]
hash.c: 541 | __m128i t1, t2;
hash.c: | ^~
hash.c: hash.c:540:43: warning: unused variable 'uBlockCount' [-Wunused-variable]
hash.c: 540 | unsigned int i, col[30], block[4], base, uBlockCount = 5;
hash.c: ...
Number of similar (implementation,compiler) pairs: 4, namely:
Implementation | Compiler |
T:ccalik/aesni | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
T:ccalik/aesni | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
T:ccalik/aesni | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
T:ccalik/aesni | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
Compiler output
hash.c: hash.c:539:43: warning: unused variable 'uBlockCount' [-Wunused-variable]
hash.c: unsigned int i, col[30], block[4], base, uBlockCount = 5;
hash.c: ^
hash.c: hash.c:540:14: warning: unused variable 't2' [-Wunused-variable]
hash.c: __m128i t1, t2;
hash.c: ^
hash.c: hash.c:745:14: warning: unused variable 't2' [-Wunused-variable]
hash.c: __m128i t1, t2;
hash.c: ^
hash.c: hash.c:914:14: warning: unused variable 't2' [-Wunused-variable]
hash.c: __m128i t1, t2;
hash.c: ^
hash.c: hash.c:1052:14: warning: unused variable 't2' [-Wunused-variable]
hash.c: __m128i t1, t2;
hash.c: ^
hash.c: 5 warnings generated.
Number of similar (implementation,compiler) pairs: 4, namely:
Implementation | Compiler |
T:ccalik/vperm | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:ccalik/vperm | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:ccalik/vperm | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
T:ccalik/vperm | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
Compiler output
hash.c: hash.c:447:4: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'Compress256' that is compiled without support for 'ssse3'
hash.c: TIX256(pmsg, ctx->state[1], ctx->state[0], ctx->state[6], ctx->state[8], _t0, _t1, _t2);
hash.c: ^
hash.c: hash.c:168:2: note: expanded from macro 'TIX256'
hash.c: TRANSFORM(t1, _k_ipt, t2, t3);\
hash.c: ^
hash.c: ./vperm.h:52:7: note: expanded from macro 'TRANSFORM'
hash.c: t1 = _mm_shuffle_epi8(*((__m128i*)table + 1), t1);\
hash.c: ^
hash.c: hash.c:447:4: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'Compress256' that is compiled without support for 'ssse3'
hash.c: hash.c:168:2: note: expanded from macro 'TIX256'
hash.c: TRANSFORM(t1, _k_ipt, t2, t3);\
hash.c: ^
hash.c: ./vperm.h:53:7: note: expanded from macro 'TRANSFORM'
hash.c: x = _mm_shuffle_epi8(*((__m128i*)table + 0), x);\
hash.c: ^
hash.c: hash.c:448:4: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'Compress256' that is compiled without support for 'ssse3'
hash.c: SUBROUND256_2(ctx->state[8], ctx->state[9], ctx->state[7], ctx->state[2], ctx->state[7], ctx->state[8], ctx->state[6], ctx->state[1]);
hash.c: ^
hash.c: hash.c:339:2: note: expanded from macro 'SUBROUND256_2'
hash.c: SUBSTITUTE(r0, _t1, _t2, _t3, _t0);\
hash.c: ^
hash.c: hash.c:264:20: note: expanded from macro 'SUBSTITUTE'
hash.c: #define SUBSTITUTE SUBSTITUTE_VPERM_CORE
hash.c: ^
hash.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
T:ccalik/vperm | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
Compiler output
hash.c: In file included from hash.c:21:
hash.c: vperm.h:57:2: warning: multi-line comment [-Wcomment]
hash.c: 57 | //t2 = _mm_shuffle_epi8(table[0], x);\
hash.c: | ^
hash.c: hash.c:122:1: warning: multi-line comment [-Wcomment]
hash.c: 122 | //#define PACK_S0(s0, s1, t1)\
hash.c: | ^
hash.c: hash.c:131:1: warning: multi-line comment [-Wcomment]
hash.c: 131 | //#define UNPACK_S0(s0, s1, t1)\
hash.c: | ^
hash.c: hash.c:140:1: warning: multi-line comment [-Wcomment]
hash.c: 140 | //#define CMIX(s1, s2, r1, r2, t1, t2)\
hash.c: | ^
hash.c: hash.c:329:1: warning: multi-line comment [-Wcomment]
hash.c: 329 | //#define SUBROUND256(r1, r2, r0, r5)\
hash.c: | ^
hash.c: hash.c: In function 'Final256':
hash.c: hash.c:540:14: warning: unused variable 't2' [-Wunused-variable]
hash.c: 540 | __m128i t1, t2;
hash.c: | ^~
hash.c: hash.c:539:43: warning: unused variable 'uBlockCount' [-Wunused-variable]
hash.c: 539 | unsigned int i, col[30], block[4], base, uBlockCount = 5;
hash.c: | ^~~~~~~~~~~
hash.c: hash.c: In function 'Final384':
hash.c: hash.c:745:14: warning: unused variable 't2' [-Wunused-variable]
hash.c: ...
Number of similar (implementation,compiler) pairs: 4, namely:
Implementation | Compiler |
T:ccalik/vperm | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
T:ccalik/vperm | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
T:ccalik/vperm | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
T:ccalik/vperm | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |