Implementation notes: amd64, hydra4, crypto_aead/deoxyseq256128v1

Computer: hydra4
Microarchitecture: amd64; K10 32nm (300f10)
Architecture: amd64
CPU ID: AuthenticAMD-00300f10-178bfbff
SUPERCOP version: 20240625
Operation: crypto_aead
Primitive: deoxyseq256128v1
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
252149731412 0 047000 812 1016T:refclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
287851232543 0 048632 780 1080T:refgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
420581129732 0 046136 812 1016T:refclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
420596729796 0 046200 812 1016T:refclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
794477325206 0 038654 804 1016T:refclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
814841527353 0 042272 780 1080T:refgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
942158224061 0 037075 756 1048T:refgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
946222526114 0 040319 772 1080T:refgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
1022518924838 0 038910 804 1016T:refclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625

Compiler output


deoxys-256.c: deoxys-256.c:77:45: warning: implicit conversion from 'int' to 'char' changes value from 128 to -128 [-Wconstant-conversion]
deoxys-256.c:   const __m128i MSB_XLS1                = constant8( (0x8<<4),0x00,0x00,0x00, 0x00,0x00,0x00,0x00, 0x00,0x00,0x00,0x00, 0x00,0x00,0x00,0x00 );
deoxys-256.c:                                           ~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys-256.c: ./tweakable-cipher.macros:12:171: note: expanded from macro 'constant8'
deoxys-256.c: #define constant8(b15,b14,b13,b12,b11,b10,b9,b8,b7,b6,b5,b4,b3,b2,b1,b0)   _mm_set_epi8 ((b0),(b1),(b2),(b3),(b4),(b5),(b6),(b7),(b8),(b9),(b10),(b11),(b12),(b13),(b14),(b15))
deoxys-256.c:                                                                            ~~~~~~~~~~~~                                                                                   ^~~
deoxys-256.c: deoxys-256.c:78:45: warning: implicit conversion from 'int' to 'char' changes value from 144 to -112 [-Wconstant-conversion]
deoxys-256.c:   const __m128i MSB_XLS2                = constant8( (0x9<<4),0x00,0x00,0x00, 0x00,0x00,0x00,0x00, 0x00,0x00,0x00,0x00, 0x00,0x00,0x00,0x00 );
deoxys-256.c:                                           ~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
deoxys-256.c: ./tweakable-cipher.macros:12:171: note: expanded from macro 'constant8'
deoxys-256.c: #define constant8(b15,b14,b13,b12,b11,b10,b9,b8,b7,b6,b5,b4,b3,b2,b1,b0)   _mm_set_epi8 ((b0),(b1),(b2),(b3),(b4),(b5),(b6),(b7),(b8),(b9),(b10),(b11),(b12),(b13),(b14),(b15))
deoxys-256.c:                                                                            ~~~~~~~~~~~~                                                                                   ^~~
deoxys-256.c: deoxys-256.c:92:21: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'XLS' that is compiled without support for 'ssse3'
deoxys-256.c:   if( Direction){   AES( State , subkeys , Tweak , tTweak ); }
deoxys-256.c:                     ^
deoxys-256.c: ./tweakable-cipher.macros:79:41: note: expanded from macro 'AES'
deoxys-256.c: s = xor( s , xor( subkeys[ 0] , t ) );t=permute( t , H_PERMUTATION);\
deoxys-256.c:                                         ^
deoxys-256.c: ./tweakable-cipher.macros:7:22: note: expanded from macro 'permute'
deoxys-256.c: #define permute(a,b) _mm_shuffle_epi8(a,b)
deoxys-256.c:                      ^
deoxys-256.c: deoxys-256.c:92:21: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'XLS' that is compiled without support for 'aes'
deoxys-256.c: ./tweakable-cipher.macros:80:5: note: expanded from macro 'AES'
deoxys-256.c: s = enc( s , xor( subkeys[ 1] , t ) );t=permute( t , H_PERMUTATION);\
deoxys-256.c:     ^
deoxys-256.c: ...

Number of similar (implementation,compiler) pairs: 5, namely:
ImplementationCompiler
T:optclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:optclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:optclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:optclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:optclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)

Compiler output


deoxys-256.c: In file included from deoxys-256.c:27:
deoxys-256.c: deoxys-256.c: In function 'XLS':
deoxys-256.c: /usr/lib/gcc/x86_64-linux-gnu/10/include/wmmintrin.h:61:1: error: inlining failed in call to 'always_inline' '_mm_aesenc_si128': target specific option mismatch
deoxys-256.c:    61 | _mm_aesenc_si128 (__m128i __X, __m128i __Y)
deoxys-256.c:       | ^~~~~~~~~~~~~~~~
deoxys-256.c: In file included from deoxys-256.c:29:
deoxys-256.c: tweakable-cipher.macros:1:18: note: called from here
deoxys-256.c:     1 | #define enc(a,b) _mm_aesenc_si128(a,b)
deoxys-256.c:       |                  ^~~~~~~~~~~~~~~~~~~~~
deoxys-256.c: tweakable-cipher.macros:95:5: note: in expansion of macro 'enc'
deoxys-256.c:    95 | s = enc( s , xor( subkeys[16] , t ) );
deoxys-256.c:       |     ^~~
deoxys-256.c: deoxys-256.c:92:21: note: in expansion of macro 'AES'
deoxys-256.c:    92 |   if( Direction){   AES( State , subkeys , Tweak , tTweak ); }
deoxys-256.c:       |                     ^~~
deoxys-256.c: In file included from deoxys-256.c:28:
deoxys-256.c: /usr/lib/gcc/x86_64-linux-gnu/10/include/tmmintrin.h:136:1: error: inlining failed in call to 'always_inline' '_mm_shuffle_epi8': target specific option mismatch
deoxys-256.c:   136 | _mm_shuffle_epi8 (__m128i __X, __m128i __Y)
deoxys-256.c:       | ^~~~~~~~~~~~~~~~
deoxys-256.c: In file included from deoxys-256.c:29:
deoxys-256.c: tweakable-cipher.macros:7:22: note: called from here
deoxys-256.c:     7 | #define permute(a,b) _mm_shuffle_epi8(a,b)
deoxys-256.c:       |                      ^~~~~~~~~~~~~~~~~~~~~
deoxys-256.c: tweakable-cipher.macros:94:41: note: in expansion of macro 'permute'
deoxys-256.c:    94 | s = enc( s , xor( subkeys[15] , t ) );t=permute( t , H_PERMUTATION);\
deoxys-256.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:optgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:optgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:optgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:optgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)

Compiler output


tweakableBC.c: tweakableBC.c: In function 'aesTweakEncrypt':
tweakableBC.c: tweakableBC.c:886:9: warning: variable 'Nr' set but not used [-Wunused-but-set-variable]
tweakableBC.c:   886 |     int Nr;
tweakableBC.c:       |         ^~
tweakableBC.c: tweakableBC.c: In function 'aesTweakDecrypt':
tweakableBC.c: tweakableBC.c:1005:9: warning: variable 'Nr' set but not used [-Wunused-but-set-variable]
tweakableBC.c:  1005 |     int Nr;
tweakableBC.c:       |         ^~

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:refgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:refgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:refgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:refgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)