Implementation notes: amd64, hydra5, crypto_stream/aes256ctr

Computer: hydra5
Microarchitecture: amd64; K10 32nm (300f10)
Architecture: amd64
CPU ID: AuthenticAMD-00300f10-178bfbff
SUPERCOP version: 20240625
Operation: crypto_stream
Primitive: aes256ctr
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
5049511856 2064 01767941 145212 10304T:cryptoppclang++_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
5056520796 2800 01773460 145164 10400T:cryptoppg++_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
5058514743 4848 01776389 147572 10304T:cryptoppclang++_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
5059911938 2064 01768905 145212 10304T:cryptoppclang++_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
5060222208 2800 01775983 145148 10400T:cryptoppg++_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
5066120385 2800 01770978 145164 10400T:cryptoppg++_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
507819259 1480 01763463 145196 10304T:cryptoppclang++_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
507839564 3272 01763592 146228 10368T:cryptoppg++_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
78475779 0 017475 876 992T:opensslgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
78504428 0 015303 916 960T:opensslclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
78570428 0 016191 916 960T:opensslclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
78580535 0 013726 852 992T:opensslgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
78612610 0 015115 876 992T:opensslgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
78632437 0 013935 916 960T:opensslclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
78660428 0 016191 916 960T:opensslclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
78691423 0 013209 908 960T:opensslclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
78700621 0 015907 876 992T:opensslgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625

Compiler output


aesenc-int.c: aesenc-int.c:81:20: warning: unused function 'aesni_encrypt1' [-Wunused-function]
aesenc-int.c: static inline void aesni_encrypt1(unsigned char *out, unsigned char *n, __m128i rkeys[16]) {
aesenc-int.c:                    ^
aesenc-int.c: aesenc-int.c:97:20: warning: unused function 'incle' [-Wunused-function]
aesenc-int.c: static inline void incle(unsigned char n[16]) {
aesenc-int.c:                    ^
aesenc-int.c: aesenc-int.c:195:1: warning: unused function 'aesni_encrypt4' [-Wunused-function]
aesenc-int.c: FUNC(4, MAKE4)
aesenc-int.c: ^
aesenc-int.c: aesenc-int.c:172:22: note: expanded from macro 'FUNC'
aesenc-int.c:   static inline void aesni_encrypt##N(unsigned char *out, unsigned char *n, __m128i rkeys[16]) { \
aesenc-int.c:                      ^
aesenc-int.c: <scratch space>:168:1: note: expanded from here
aesenc-int.c: aesni_encrypt4
aesenc-int.c: ^
aesenc-int.c: aesenc-int.c:196:1: warning: unused function 'aesni_encrypt6' [-Wunused-function]
aesenc-int.c: FUNC(6, MAKE6)
aesenc-int.c: ^
aesenc-int.c: aesenc-int.c:172:22: note: expanded from macro 'FUNC'
aesenc-int.c:   static inline void aesni_encrypt##N(unsigned char *out, unsigned char *n, __m128i rkeys[16]) { \
aesenc-int.c:                      ^
aesenc-int.c: <scratch space>:217:1: note: expanded from here
aesenc-int.c: aesni_encrypt6
aesenc-int.c: ^
aesenc-int.c: aesenc-int.c:197:1: warning: unused function 'aesni_encrypt7' [-Wunused-function]
aesenc-int.c: ...

Number of similar (implementation,compiler) pairs: 5, namely:
ImplementationCompiler
dolbeau/aesenc-intclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
dolbeau/aesenc-intclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
dolbeau/aesenc-intclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
dolbeau/aesenc-intclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
dolbeau/aesenc-intclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


aesenc-int.c: aesenc-int.c:23: warning: "_bswap64" redefined
aesenc-int.c:    23 | #define _bswap64(a) __builtin_bswap64(a)
aesenc-int.c:       |
aesenc-int.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/x86gprintrin.h:33,
aesenc-int.c:                  from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:27,
aesenc-int.c:                  from aesenc-int.c:12:
aesenc-int.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/ia32intrin.h:273: note: this is the location of the previous definition
aesenc-int.c:   273 | #define _bswap64(a)             __bswapq(a)
aesenc-int.c:       |
aesenc-int.c: aesenc-int.c:24: warning: "_bswap" redefined
aesenc-int.c:    24 | #define _bswap(a) __builtin_bswap(a)
aesenc-int.c:       |
aesenc-int.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/x86gprintrin.h:33,
aesenc-int.c:                  from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:27,
aesenc-int.c:                  from aesenc-int.c:12:
aesenc-int.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/ia32intrin.h:307: note: this is the location of the previous definition
aesenc-int.c:   307 | #define _bswap(a)               __bswapd(a)
aesenc-int.c:       |
aesenc-int.c: aesenc-int.c: In function 'aesni_encrypt1':
aesenc-int.c: aesenc-int.c:85: warning: ignoring '#pragma unroll ' [-Wunknown-pragmas]
aesenc-int.c:    85 | #pragma unroll(13)
aesenc-int.c:       |
aesenc-int.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:41,
aesenc-int.c:                  from aesenc-int.c:12:
aesenc-int.c: aesenc-int.c: In function 'aesni_key256_expand':
aesenc-int.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
dolbeau/aesenc-intgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
dolbeau/aesenc-intgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
dolbeau/aesenc-intgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
dolbeau/aesenc-intgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

Compiler output


vaesenc-int.c: vaesenc-int.c:15:2: error: "This code requires AVX enabled in the compiler."
vaesenc-int.c: #error "This code requires AVX enabled in the compiler."
vaesenc-int.c:  ^
vaesenc-int.c: vaesenc-int.c:19:2: error: "This code requires VAES enabled in the compiler."
vaesenc-int.c: #error "This code requires VAES enabled in the compiler."
vaesenc-int.c:  ^
vaesenc-int.c: 2 errors generated.

Number of similar (implementation,compiler) pairs: 5, namely:
ImplementationCompiler
dolbeau/vaesenc-intclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
dolbeau/vaesenc-intclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
dolbeau/vaesenc-intclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
dolbeau/vaesenc-intclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
dolbeau/vaesenc-intclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


vaesenc-int.c: vaesenc-int.c:15:2: error: #error "This code requires AVX enabled in the compiler."
vaesenc-int.c:    15 | #error "This code requires AVX enabled in the compiler."
vaesenc-int.c:       |  ^~~~~
vaesenc-int.c: vaesenc-int.c:19:2: error: #error "This code requires VAES enabled in the compiler."
vaesenc-int.c:    19 | #error "This code requires VAES enabled in the compiler."
vaesenc-int.c:       |  ^~~~~
vaesenc-int.c: vaesenc-int.c: In function 'aesni_encrypt1':
vaesenc-int.c: vaesenc-int.c:123: warning: ignoring '#pragma unroll ' [-Wunknown-pragmas]
vaesenc-int.c:   123 | #pragma unroll(13)
vaesenc-int.c:       |
vaesenc-int.c: vaesenc-int.c: In function 'aesni_key256_expand':
vaesenc-int.c: vaesenc-int.c:71:16: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
vaesenc-int.c:    71 |   rkeys[idx++] = _mmAES_broadcast_i32x4(key0);
vaesenc-int.c:       |                ^

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
dolbeau/vaesenc-intgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
dolbeau/vaesenc-intgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
dolbeau/vaesenc-intgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
dolbeau/vaesenc-intgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)