Implementation notes: amd64, hertz, crypto_aead/twine80n6t4clocv3

Computer: hertz
Microarchitecture: amd64; Zen 4 (a60f12)
Architecture: amd64
CPU ID: AuthenticAMD-00a60f12-178bfbff
SUPERCOP version: 20240425
Operation: crypto_aead
Primitive: twine80n6t4clocv3
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
1342744390 0 89617999 788 1960T:vpermgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023122020231217
1349195141 0 89620476 812 1992T:vpermgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023122020231217
1351605725 0 89623044 812 2056T:vpermgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023122020231217
8316688234 0 29225453 804 1480T:refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023122020231217
8626607779 0 29222997 804 1416T:refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023122020231217
8889506221 0 30421136 820 1416T:refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024042920240425
146880310736 0 30431856 836 1416T:refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024042920240425
152178110969 0 30432224 836 1416T:refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024042920240425
18401796406 0 29219864 780 1384T:refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2023122020231217

Compiler output

Implementation: T:vperm
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
cloc.c: cloc.c:193:26: warning: implicit conversion from 'int' to 'char' changes value from 128 to -128 [-Wconstant-conversion]
cloc.c: 193 | state = XORDQW(state, SHR(state, 8));
cloc.c: | ~~~~~~~~~~~~~~^~~~~~~~~~~~~~
cloc.c: ./common.h:30:126: note: expanded from macro 'SHR'
cloc.c: 30 | #define SHR(x,n) _mm_shuffle_epi8((x), _mm_set_epi8(127+(n), 126+(n), 125+(n), 124+(n), 123+(n), 122+(n), 121+(n), 120+(n), 119+(n), 118+(n), 117+(n), 116+(n), 115+(n), 114+(n), 113+(n), 112+(n))) // shift to the right
cloc.c: | ~~~~~~~~~~~~ ^
cloc.c: ./common.h:18:43: note: expanded from macro 'XORDQW'
cloc.c: 18 | #define XORDQW(x, y) _mm_xor_si128((x), (y))
cloc.c: | ^
cloc.c: cloc.c:193:26: warning: implicit conversion from 'int' to 'char' changes value from 129 to -127 [-Wconstant-conversion]
cloc.c: 193 | state = XORDQW(state, SHR(state, 8));
cloc.c: | ~~~~~~~~~~~~~~^~~~~~~~~~~~~~
cloc.c: ./common.h:30:117: note: expanded from macro 'SHR'
cloc.c: 30 | #define SHR(x,n) _mm_shuffle_epi8((x), _mm_set_epi8(127+(n), 126+(n), 125+(n), 124+(n), 123+(n), 122+(n), 121+(n), 120+(n), 119+(n), 118+(n), 117+(n), 116+(n), 115+(n), 114+(n), 113+(n), 112+(n))) // shift to the right
cloc.c: | ~~~~~~~~~~~~ ^
cloc.c: ./common.h:18:43: note: expanded from macro 'XORDQW'
cloc.c: 18 | #define XORDQW(x, y) _mm_xor_si128((x), (y))
cloc.c: | ^
cloc.c: cloc.c:193:26: warning: implicit conversion from 'int' to 'char' changes value from 130 to -126 [-Wconstant-conversion]
cloc.c: 193 | state = XORDQW(state, SHR(state, 8));
cloc.c: | ~~~~~~~~~~~~~~^~~~~~~~~~~~~~
cloc.c: ./common.h:30:108: note: expanded from macro 'SHR'
cloc.c: 30 | #define SHR(x,n) _mm_shuffle_epi8((x), _mm_set_epi8(127+(n), 126+(n), 125+(n), 124+(n), 123+(n), 122+(n), 121+(n), 120+(n), 119+(n), 118+(n), 117+(n), 116+(n), 115+(n), 114+(n), 113+(n), 112+(n))) // shift to the right
cloc.c: | ~~~~~~~~~~~~ ^
cloc.c: ./common.h:18:43: note: expanded from macro 'XORDQW'
cloc.c: ...

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:vperm
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:vperm
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:vperm