Implementation notes: amd64, panther, crypto_aead/romulusn
Computer: panther
Microarchitecture: amd64; Tiger Lake (806c1)
Architecture: amd64
CPU ID: GenuineIntel-000806c1-00-bfebfbff
SUPERCOP version: 20240625
Operation: crypto_aead
Primitive: romulusn
Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
180166 | 9283 0 0 | 22891 820 1016 | aadomn/x86 | clang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
183527 | 12240 0 0 | 29173 828 1048 | aadomn/x86 | clang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
183533 | 9849 0 0 | 23691 820 1016 | aadomn/x86 | clang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
184311 | 12080 0 0 | 28997 828 1048 | aadomn/x86 | clang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
193134 | 10401 0 0 | 23283 756 1048 | aadomn/x86 | gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
198536 | 13275 0 0 | 29824 780 1080 | aadomn/x86 | gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
198588 | 10722 0 0 | 25240 780 1080 | aadomn/x86 | gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
201638 | 10779 0 0 | 24815 772 1080 | aadomn/x86 | gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
310558 | 11862 608 0 | 28552 1396 1080 | T:opt32t | gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
313752 | 7258 608 0 | 20227 1372 1048 | T:opt32t | gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
314408 | 8559 608 0 | 23152 1396 1080 | T:opt32t | gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
325184 | 10415 592 0 | 26765 1428 1016 | T:opt32t | clang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
329526 | 8363 608 0 | 22479 1388 1080 | T:opt32t | gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
332890 | 10847 592 0 | 27829 1428 1048 | T:opt32t | clang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
333349 | 10859 592 0 | 27861 1428 1048 | T:opt32t | clang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
335135 | 7903 592 0 | 21827 1420 1016 | T:opt32t | clang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
357113 | 8018 592 0 | 21547 1420 1016 | T:opt32t | clang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
724969 | 26928 640 0 | 43677 1508 1048 | T:fixslice_opt32 | clang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
725921 | 27046 640 0 | 43917 1508 1048 | aadomn/opt32 | clang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
726810 | 26928 640 0 | 43693 1508 1048 | T:fixslice_opt32 | clang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
727657 | 27318 640 0 | 44205 1508 1048 | aadomn/opt32 | clang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
777090 | 27476 640 0 | 43301 1476 1016 | T:fixslice_opt32 | clang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
778805 | 27680 640 0 | 43533 1476 1016 | aadomn/opt32 | clang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
913458 | 16669 640 0 | 30291 1468 1016 | T:fixslice_opt32 | clang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
921565 | 16268 640 0 | 29947 1468 1016 | aadomn/opt32 | clang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1026616 | 19938 640 0 | 33859 1468 1016 | T:fixslice_opt32 | clang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1033835 | 21371 640 0 | 35992 1428 1080 | T:fixslice_opt32 | gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1034341 | 20287 640 0 | 34968 1428 1080 | aadomn/opt32 | gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1034990 | 19037 640 0 | 33027 1468 1016 | aadomn/opt32 | clang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1060335 | 20013 640 0 | 34224 1428 1080 | T:fixslice_opt32 | gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1063754 | 19003 640 0 | 33280 1428 1080 | aadomn/opt32 | gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1094164 | 19163 640 0 | 32115 1404 1048 | T:fixslice_opt32 | gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1104440 | 18228 640 0 | 31243 1404 1048 | aadomn/opt32 | gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1123354 | 19954 12 0 | 36672 792 1080 | T:ref | gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1163846 | 30599 640 0 | 47224 1428 1080 | T:fixslice_opt32 | gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1164075 | 29967 640 0 | 46712 1428 1080 | aadomn/opt32 | gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1264042 | 39138 12 0 | 54765 840 1048 | T:ref | clang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1290819 | 39210 12 0 | 54821 840 1048 | T:ref | clang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1439568 | 13568 12 0 | 29837 840 1016 | T:ref | clang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
3782171 | 4769 12 0 | 18739 832 1016 | T:ref | clang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
3804126 | 9164 12 0 | 22075 832 1016 | T:ref | clang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
3852530 | 5938 12 0 | 20584 792 1080 | T:ref | gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
4058930 | 4808 12 0 | 18959 784 1080 | T:ref | gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
4151280 | 3940 12 0 | 16931 768 1048 | T:ref | gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
Compiler output
skinny128.c: skinny128.c:200:13: warning: unused variable 'rconst' [-Wunused-variable]
skinny128.c: __m128i rconst;
skinny128.c: ^
skinny128.c: 1 warning generated.
Number of similar (implementation,compiler) pairs: 4, namely:
Implementation | Compiler |
aadomn/x86 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
aadomn/x86 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
aadomn/x86 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
aadomn/x86 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
Compiler output
skinny128.c: skinny128.c:115:5: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'skinny128_384_plus' that is compiled without support for 'ssse3'
skinny128.c: DOUBLE_ROUND(rtk_23);
skinny128.c: ^
skinny128.c: skinny128.c:78:5: note: expanded from macro 'DOUBLE_ROUND'
skinny128.c: SBOX_ARK_EVEN(rtk_23); \
skinny128.c: ^
skinny128.c: skinny128.c:23:13: note: expanded from macro 'SBOX_ARK_EVEN'
skinny128.c: state = _mm_shuffle_epi8(s1, state); /* apply inner S-box S1 */ \
skinny128.c: ^
skinny128.c: skinny128.c:115:5: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'skinny128_384_plus' that is compiled without support for 'ssse3'
skinny128.c: skinny128.c:78:5: note: expanded from macro 'DOUBLE_ROUND'
skinny128.c: SBOX_ARK_EVEN(rtk_23); \
skinny128.c: ^
skinny128.c: skinny128.c:24:13: note: expanded from macro 'SBOX_ARK_EVEN'
skinny128.c: tmp0 = _mm_shuffle_epi8(s0, tmp0); /* apply inner S-box S0 */ \
skinny128.c: ^
skinny128.c: skinny128.c:115:5: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'skinny128_384_plus' that is compiled without support for 'ssse3'
skinny128.c: skinny128.c:78:5: note: expanded from macro 'DOUBLE_ROUND'
skinny128.c: SBOX_ARK_EVEN(rtk_23); \
skinny128.c: ^
skinny128.c: skinny128.c:32:13: note: expanded from macro 'SBOX_ARK_EVEN'
skinny128.c: tmp0 = _mm_shuffle_epi8(s3, tmp0); /* apply inner S-box S3 */ \
skinny128.c: ^
skinny128.c: skinny128.c:115:5: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'skinny128_384_plus' that is compiled without support for 'ssse3'
skinny128.c: skinny128.c:78:5: note: expanded from macro 'DOUBLE_ROUND'
skinny128.c: ...
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
aadomn/x86 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
Compiler output
skinny128.c: skinny128.c: In function 'tk_schedule_23':
skinny128.c: skinny128.c:200:13: warning: unused variable 'rconst' [-Wunused-variable]
skinny128.c: 200 | __m128i rconst;
skinny128.c: | ^~~~~~
Number of similar (implementation,compiler) pairs: 4, namely:
Implementation | Compiler |
aadomn/x86 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
aadomn/x86 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
aadomn/x86 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
aadomn/x86 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
TIMECOP error (can be valgrind bug)
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x402BF0
at 0x...: st32 (try-anything.c:47)
by 0x...: core (try-anything.c:78)
by 0x...: salsa20 (try-anything.c:101)
by 0x...: testvector (try-anything.c:124)
by 0x...: myrandom (try-anything.c:132)
by 0x...: test (try.c:144)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 2, namely:
Implementation | Compiler |
aadomn/opt32 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
aadomn/x86 | clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
TIMECOP error (can be valgrind bug)
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x4087CC
at 0x...: packing (tk_schedule.c:66)
by 0x...: permute_tk (tk_schedule.c:272)
by 0x...: tk_schedule_1 (tk_schedule.c:356)
by 0x...: tk_schedule_123 (tk_schedule.c:398)
by 0x...: romulusn_process_ad (romulus_n.c:62)
by 0x...: crypto_aead_romulusn_aadomn_opt32_constbranchindex_encrypt (encrypt.c:27)
by 0x...: test (try.c:159)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
aadomn/opt32 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
TIMECOP error (can be valgrind bug)
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x4025AA
at 0x...: core (try-anything.c:73)
by 0x...: salsa20 (try-anything.c:101)
by 0x...: testvector (try-anything.c:124)
by 0x...: myrandom (try-anything.c:132)
by 0x...: test (try.c:144)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 2, namely:
Implementation | Compiler |
aadomn/opt32 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
aadomn/x86 | clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
TIMECOP error (can be valgrind bug)
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x10A849
at 0x...: salsa20.part.0 (try-anything.c:102)
by 0x...: salsa20 (try-anything.c:85)
by 0x...: canary (try-anything.c:148)
by 0x...: input_prepare (try-anything.c:163)
by 0x...: test (try.c:149)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 2, namely:
Implementation | Compiler |
aadomn/opt32 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
aadomn/x86 | gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
TIMECOP error (can be valgrind bug)
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x10A6EA
at 0x...: st32 (try-anything.c:47)
by 0x...: core (try-anything.c:78)
by 0x...: salsa20 (try-anything.c:101)
by 0x...: salsa20 (try-anything.c:81)
by 0x...: testvector (try-anything.c:124)
by 0x...: myrandom (try-anything.c:132)
by 0x...: test (try.c:144)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 2, namely:
Implementation | Compiler |
aadomn/opt32 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
aadomn/x86 | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
TIMECOP error (can be valgrind bug)
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x405A83
at 0x...: tk_schedule_23 (skinny128.c:219)
by 0x...: romulusn_process_ad (romulus_n.c:62)
by 0x...: crypto_aead_romulusn_aadomn_x86_constbranchindex_encrypt (encrypt.c:27)
by 0x...: test (try.c:159)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
aadomn/x86 | clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
TIMECOP error (can be valgrind bug)
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x4047C3
at 0x...: tk_schedule_23 (skinny128.c:219)
by 0x...: romulusn_process_ad (romulus_n.c:62)
by 0x...: crypto_aead_romulusn_aadomn_x86_constbranchindex_encrypt (encrypt.c:27)
by 0x...: test (try.c:159)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
aadomn/x86 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
TIMECOP error (can be valgrind bug)
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x10B974
at 0x...: _mm_and_si128 (emmintrin.h:1287)
by 0x...: skinny128_384_plus (skinny128.c:116)
by 0x...: romulusn_process_ad (romulus_n.c:63)
by 0x...: crypto_aead_romulusn_aadomn_x86_constbranchindex_encrypt (encrypt.c:27)
by 0x...: test (try.c:159)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
aadomn/x86 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
TIMECOP error (can be valgrind bug)
Process terminating with default action of signal 4 (SIGILL)
Illegal opcode at address 0x10B436
at 0x...: _mm_shuffle_epi8 (tmmintrin.h:138)
by 0x...: skinny128_384_plus (skinny128.c:116)
by 0x...: romulusn_process_ad (romulus_n.c:63)
by 0x...: crypto_aead_romulusn_aadomn_x86_constbranchindex_encrypt (encrypt.c:27)
by 0x...: test (try.c:159)
by 0x...: main (try-anything.c:345)
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
aadomn/x86 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
Passed TIMECOP
TIMECOP iterations: 10
Number of similar (implementation,compiler) pairs: 4, namely:
Implementation | Compiler |
aadomn/opt32 | clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
aadomn/opt32 | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1) |
aadomn/opt32 | gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |
aadomn/opt32 | gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110) |