Implementation notes: amd64, h8atom, crypto_aead/deoxysi256v141

Computer: h8atom
Microarchitecture: amd64; Bonnell (30661)
Architecture: amd64
CPU ID: GenuineIntel-00030661-bfebfbff
SUPERCOP version: 20240716
Operation: crypto_aead
Primitive: deoxysi256v141
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
284830129250 0 592145381 784 1520T:bitslicegcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
311087111008 0 592125805 784 1520T:bitslicegcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
325549100450 0 592113520 760 1488T:bitslicegcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
349090178005 0 592192453 784 1520T:bitslicegcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
42047634207 0 54747126 808 1416T:tableclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
45005835362 0 54750244 816 1416T:tableclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
45021935362 0 54751420 816 1416T:tableclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
54539834982 0 54750940 816 1416T:tableclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
55528236680 0 59252861 784 1520T:tablegcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
55528935510 0 54749244 816 1416T:tableclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
57534435989 0 59250837 784 1520T:tablegcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
58273633458 0 59246592 760 1488T:tablegcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
69777435014 0 59249541 784 1520T:tablegcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
243199625316 0 040196 816 856T:refclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
274750727076 0 043148 816 856T:refclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
332363526665 0 042644 816 856T:refclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
401982027914 0 044101 784 928T:refgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
557398125188 0 038948 816 856T:refclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
774324625807 0 040669 784 928T:refgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
787866824080 0 037022 808 856T:refclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
1010258923264 0 036400 760 896T:refgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
1087944924626 0 039165 784 928T:refgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625

Compiler output


deoxys.c: deoxys.c:154:15: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'deoxys_aead_encrypt' that is compiled without support for 'aes'
deoxys.c:               PROCESS_8_ASS_DATA_BLOCKS( States, ass_data + 16 * i , Auth );
deoxys.c:               ^
deoxys.c: ./ae.macros:29:11: note: expanded from macro 'PROCESS_8_ASS_DATA_BLOCKS'
deoxys.c:           AES8        ( States, subkeys, tTweak );          \
deoxys.c:           ^
deoxys.c: ./tweakable-cipher.macros:191:3: note: expanded from macro 'AES8'
deoxys.c:   ONE_ROUND( states , subkeys[ 1] , RT[ 1] ,  1 );\
deoxys.c:   ^
deoxys.c: ./tweakable-cipher.macros:179:8: note: expanded from macro 'ONE_ROUND'
deoxys.c: s[0] = enc( s[0] , tmp );\
deoxys.c:        ^
deoxys.c: ./tweakable-cipher.macros:1:18: note: expanded from macro 'enc'
deoxys.c: #define enc(a,b) _mm_aesenc_si128(a,b)
deoxys.c:                  ^
deoxys.c: deoxys.c:154:15: error: always_inline function '_mm_aesenc_si128' requires target feature 'aes', but would be inlined into function 'deoxys_aead_encrypt' that is compiled without support for 'aes'
deoxys.c: ./ae.macros:29:11: note: expanded from macro 'PROCESS_8_ASS_DATA_BLOCKS'
deoxys.c:           AES8        ( States, subkeys, tTweak );          \
deoxys.c:           ^
deoxys.c: ./tweakable-cipher.macros:191:3: note: expanded from macro 'AES8'
deoxys.c:   ONE_ROUND( states , subkeys[ 1] , RT[ 1] ,  1 );\
deoxys.c:   ^
deoxys.c: ./tweakable-cipher.macros:180:8: note: expanded from macro 'ONE_ROUND'
deoxys.c: s[1] = enc( s[1] , xor(tmp, tw_c[Round][1] ) );\
deoxys.c:        ^
deoxys.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:aesniclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:aesniclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:aesniclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:aesniclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


deoxys.c: deoxys.c:105:11: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'deoxys_aead_encrypt' that is compiled without support for 'ssse3'
deoxys.c:     tmp = permute( tmp, H_PERMUTATION );
deoxys.c:           ^
deoxys.c: ./tweakable-cipher.macros:7:22: note: expanded from macro 'permute'
deoxys.c: #define permute(a,b) _mm_shuffle_epi8(a,b)
deoxys.c:                      ^
deoxys.c: deoxys.c:112:5: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'deoxys_aead_encrypt' that is compiled without support for 'ssse3'
deoxys.c:     TWEAKEY_SCHEDULE3( tsubkeys1,tsubkeys2,subkeys, key);
deoxys.c:     ^
deoxys.c: ./tweakable-cipher.macros:40:3: note: expanded from macro 'TWEAKEY_SCHEDULE3'
deoxys.c:   ONE_KEY_ROUND( subkeys1[ 0], subkeys1[ 1], subkeys2[ 0], subkeys2[ 1] );      ts[ 1] = xor( xor(subkeys1[ 1],subkeys2[ 1]), RCONS[ 1] ); \
deoxys.c:   ^
deoxys.c: ./tweakable-cipher.macros:33:16: note: expanded from macro 'ONE_KEY_ROUND'
deoxys.c:     new_key1 = permute( new_key1, H_PERMUTATION);\
deoxys.c:                ^
deoxys.c: ./tweakable-cipher.macros:7:22: note: expanded from macro 'permute'
deoxys.c: #define permute(a,b) _mm_shuffle_epi8(a,b)
deoxys.c:                      ^
deoxys.c: deoxys.c:112:5: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'deoxys_aead_encrypt' that is compiled without support for 'ssse3'
deoxys.c: ./tweakable-cipher.macros:40:3: note: expanded from macro 'TWEAKEY_SCHEDULE3'
deoxys.c:   ONE_KEY_ROUND( subkeys1[ 0], subkeys1[ 1], subkeys2[ 0], subkeys2[ 1] );      ts[ 1] = xor( xor(subkeys1[ 1],subkeys2[ 1]), RCONS[ 1] ); \
deoxys.c:   ^
deoxys.c: ./tweakable-cipher.macros:34:16: note: expanded from macro 'ONE_KEY_ROUND'
deoxys.c:     new_key2 = permute( new_key2, H_PERMUTATION);
deoxys.c:                ^
deoxys.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
T:aesniclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


deoxys.c: In file included from deoxys.c:29:
deoxys.c: deoxys.c: In function 'deoxys_aead_encrypt':
deoxys.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/wmmintrin.h:61:1: error: inlining failed in call to 'always_inline' '_mm_aesenc_si128': target specific option mismatch
deoxys.c:    61 | _mm_aesenc_si128 (__m128i __X, __m128i __Y)
deoxys.c:       | ^~~~~~~~~~~~~~~~
deoxys.c: In file included from deoxys.c:31:
deoxys.c: tweakable-cipher.macros:1:18: note: called from here
deoxys.c:     1 | #define enc(a,b) _mm_aesenc_si128(a,b)
deoxys.c:       |                  ^~~~~~~~~~~~~~~~~~~~~
deoxys.c: tweakable-cipher.macros:186:8: note: in expansion of macro 'enc'
deoxys.c:   186 | s[7] = enc( s[7] , xor(tmp, tw_c[Round][7] ) );\
deoxys.c:       |        ^~~
deoxys.c: tweakable-cipher.macros:206:3: note: in expansion of macro 'ONE_ROUND'
deoxys.c:   206 |   ONE_ROUND( states , subkeys[16] , RT[ 0] , 16 );\
deoxys.c:       |   ^~~~~~~~~
deoxys.c: ae.macros:29:11: note: in expansion of macro 'AES8'
deoxys.c:    29 |           AES8        ( States, subkeys, tTweak );          \
deoxys.c:       |           ^~~~
deoxys.c: deoxys.c:154:15: note: in expansion of macro 'PROCESS_8_ASS_DATA_BLOCKS'
deoxys.c:   154 |               PROCESS_8_ASS_DATA_BLOCKS( States, ass_data + 16 * i , Auth );
deoxys.c:       |               ^~~~~~~~~~~~~~~~~~~~~~~~~
deoxys.c: In file included from deoxys.c:29:
deoxys.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/wmmintrin.h:61:1: error: inlining failed in call to 'always_inline' '_mm_aesenc_si128': target specific option mismatch
deoxys.c:    61 | _mm_aesenc_si128 (__m128i __X, __m128i __Y)
deoxys.c:       | ^~~~~~~~~~~~~~~~
deoxys.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:aesnigcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:aesnigcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:aesnigcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:aesnigcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

Compiler output


deoxys.c: deoxys.c:355:20: warning: variable 'Checksum' is uninitialized when used here [-Wuninitialized]
deoxys.c:     Checksum = xor(Checksum, Checksum);
deoxys.c:                    ^~~~~~~~
deoxys.c: ./tweakable-cipher.macros:5:32: note: expanded from macro 'xor'
deoxys.c: #define xor(a,b) _mm_xor_si128(a,b)
deoxys.c:                                ^
deoxys.c: deoxys.c:82:5: note: variable 'Checksum' is declared here
deoxys.c:     __m128i Checksum;
deoxys.c:     ^
deoxys.c: deoxys.c:99:16: warning: variable 'Auth' is uninitialized when used here [-Wuninitialized]
deoxys.c:     Auth = xor(Auth, Auth);
deoxys.c:                ^~~~
deoxys.c: ./tweakable-cipher.macros:5:32: note: expanded from macro 'xor'
deoxys.c: #define xor(a,b) _mm_xor_si128(a,b)
deoxys.c:                                ^
deoxys.c: deoxys.c:81:5: note: variable 'Auth' is declared here
deoxys.c:     __m128i Auth;
deoxys.c:     ^
deoxys.c: deoxys.c:100:17: warning: variable 'Tweak' is uninitialized when used here [-Wuninitialized]
deoxys.c:     Tweak = xor(Tweak, Tweak);
deoxys.c:                 ^~~~~
deoxys.c: ./tweakable-cipher.macros:5:32: note: expanded from macro 'xor'
deoxys.c: #define xor(a,b) _mm_xor_si128(a,b)
deoxys.c:                                ^
deoxys.c: deoxys.c:79:5: note: variable 'Tweak' is declared here
deoxys.c: ...

Number of similar (implementation,compiler) pairs: 5, namely:
ImplementationCompiler
T:aesnisclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:aesnisclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:aesnisclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:aesnisclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:aesnisclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


deoxys.c: In file included from deoxys.c:30:
deoxys.c: deoxys.c: In function 'deoxys_aead_encrypt':
deoxys.c: tweakable-cipher.macros:5:18: warning: 'Auth' is used uninitialized [-Wuninitialized]
deoxys.c:     5 | #define xor(a,b) _mm_xor_si128(a,b)
deoxys.c:       |                  ^~~~~~~~~~~~~
deoxys.c: deoxys.c:81:13: note: 'Auth' was declared here
deoxys.c:    81 |     __m128i Auth;
deoxys.c:       |             ^~~~
deoxys.c: In file included from deoxys.c:30:
deoxys.c: tweakable-cipher.macros:5:18: warning: 'Tweak' is used uninitialized [-Wuninitialized]
deoxys.c:     5 | #define xor(a,b) _mm_xor_si128(a,b)
deoxys.c:       |                  ^~~~~~~~~~~~~
deoxys.c: deoxys.c:79:13: note: 'Tweak' was declared here
deoxys.c:    79 |     __m128i Tweak;
deoxys.c:       |             ^~~~~
deoxys.c: In file included from deoxys.c:30:
deoxys.c: tweakable-cipher.macros:5:18: warning: 'Checksum' is used uninitialized [-Wuninitialized]
deoxys.c:     5 | #define xor(a,b) _mm_xor_si128(a,b)
deoxys.c:       |                  ^~~~~~~~~~~~~
deoxys.c: deoxys.c:82:13: note: 'Checksum' was declared here
deoxys.c:    82 |     __m128i Checksum;
deoxys.c:       |             ^~~~~~~~
deoxys.c: In file included from deoxys.c:30:
deoxys.c: deoxys.c: In function 'deoxys_aead_decrypt':
deoxys.c: tweakable-cipher.macros:5:18: warning: 'Auth' is used uninitialized [-Wuninitialized]
deoxys.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:aesnisgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:aesnisgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:aesnisgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:aesnisgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

Compiler output


deoxys256.c: deoxys256.c:249:18: warning: variable 'CHECKSUM' is uninitialized when used here [-Wuninitialized]
deoxys256.c:     CHECKSUM=XOR(CHECKSUM,CHECKSUM);
deoxys256.c:                  ^~~~~~~~
deoxys256.c: ./deoxysi256.macros:38:39: note: expanded from macro 'XOR'
deoxys256.c: #define XOR(a,b)        _mm_xor_si128(a,b)
deoxys256.c:                                       ^
deoxys256.c: deoxys256.c:159:5: note: variable 'CHECKSUM' is declared here
deoxys256.c:     __m128i CHECKSUM;
deoxys256.c:     ^
deoxys256.c: deoxys256.c:175:14: warning: variable 'AUTH' is uninitialized when used here [-Wuninitialized]
deoxys256.c:     AUTH=XOR(AUTH,AUTH);
deoxys256.c:              ^~~~
deoxys256.c: ./deoxysi256.macros:38:39: note: expanded from macro 'XOR'
deoxys256.c: #define XOR(a,b)        _mm_xor_si128(a,b)
deoxys256.c:                                       ^
deoxys256.c: deoxys256.c:158:5: note: variable 'AUTH' is declared here
deoxys256.c:     __m128i AUTH;
deoxys256.c:     ^
deoxys256.c: deoxys256.c:461:18: warning: variable 'CHECKSUM' is uninitialized when used here [-Wuninitialized]
deoxys256.c:     CHECKSUM=XOR(CHECKSUM,CHECKSUM);
deoxys256.c:                  ^~~~~~~~
deoxys256.c: ./deoxysi256.macros:38:39: note: expanded from macro 'XOR'
deoxys256.c: #define XOR(a,b)        _mm_xor_si128(a,b)
deoxys256.c:                                       ^
deoxys256.c: deoxys256.c:367:5: note: variable 'CHECKSUM' is declared here
deoxys256.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:bitsliceclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:bitsliceclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:bitsliceclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:bitsliceclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


deoxys256.c: deoxys256.c:249:18: warning: variable 'CHECKSUM' is uninitialized when used here [-Wuninitialized]
deoxys256.c:     CHECKSUM=XOR(CHECKSUM,CHECKSUM);
deoxys256.c:                  ^~~~~~~~
deoxys256.c: ./deoxysi256.macros:38:39: note: expanded from macro 'XOR'
deoxys256.c: #define XOR(a,b)        _mm_xor_si128(a,b)
deoxys256.c:                                       ^
deoxys256.c: deoxys256.c:159:5: note: variable 'CHECKSUM' is declared here
deoxys256.c:     __m128i CHECKSUM;
deoxys256.c:     ^
deoxys256.c: deoxys256.c:175:14: warning: variable 'AUTH' is uninitialized when used here [-Wuninitialized]
deoxys256.c:     AUTH=XOR(AUTH,AUTH);
deoxys256.c:              ^~~~
deoxys256.c: ./deoxysi256.macros:38:39: note: expanded from macro 'XOR'
deoxys256.c: #define XOR(a,b)        _mm_xor_si128(a,b)
deoxys256.c:                                       ^
deoxys256.c: deoxys256.c:158:5: note: variable 'AUTH' is declared here
deoxys256.c:     __m128i AUTH;
deoxys256.c:     ^
deoxys256.c: deoxys256.c:179:5: error: always_inline function '_mm_shuffle_epi8' requires target feature 'ssse3', but would be inlined into function 'deoxys_aead_encrypt_8' that is compiled without support for 'ssse3'
deoxys256.c:     KEY_SCHEDULE(key, subkey);
deoxys256.c:     ^
deoxys256.c: ./deoxysi256.macros:79:5: note: expanded from macro 'KEY_SCHEDULE'
deoxys256.c:     packing(subkey1[0]);\
deoxys256.c:     ^
deoxys256.c: ./deoxysi256.macros:457:14: note: expanded from macro 'packing'
deoxys256.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
T:bitsliceclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)