Implementation notes: amd64, speed2supercop, crypto_stream/aes256ctr

Computer: speed2supercop
Microarchitecture: amd64; Haswell+AES (306c3)
Architecture: amd64
CPU ID: GenuineIntel-000306c3-1fc9cbf5
SUPERCOP version: 20240625
Operation: crypto_stream
Primitive: aes256ctr
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
17084751 0 019397 752 864dolbeau/aesenc-intgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
17203382 0 019598 792 808dolbeau/aesenc-intclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
17323382 0 019406 792 808dolbeau/aesenc-intclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
319620958 2800 01650049 145296 10232T:cryptoppg++_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
322414816 2800 01653214 145336 10168T:cryptoppclang++_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
323612017 2064 01650126 145336 10168T:cryptoppclang++_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
324014900 4848 01655978 147696 10136T:cryptoppclang++_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
324019731 2800 01648079 145296 10232T:cryptoppg++_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
32449274 1480 01644495 145328 10200T:cryptoppclang++_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
32808354 3272 01640281 146360 10200T:cryptoppg++_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
332821173 2800 01652377 145296 10232T:cryptoppg++_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
3508437 0 013197 848 792T:opensslclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
3512456 0 013707 824 864T:opensslgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
3516428 0 017301 848 808T:opensslclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
3516623 0 015923 824 864T:opensslgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
3516437 0 013411 824 864T:opensslgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
3520428 0 017109 848 808T:opensslclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
3528428 0 015877 848 792T:opensslclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
3540358 0 012110 800 832T:opensslgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
3556423 0 014102 840 856T:opensslclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
43443067 0 015046 792 792dolbeau/aesenc-intclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
43721918 0 014533 752 864dolbeau/aesenc-intgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
44241949 0 014212 744 864dolbeau/aesenc-intgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
49322948 0 015823 784 856dolbeau/aesenc-intclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625
50041788 0 012904 728 832dolbeau/aesenc-intgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024070320240625

Compiler output


aesenc-int.c: aesenc-int.c:81:20: warning: unused function 'aesni_encrypt1' [-Wunused-function]
aesenc-int.c: static inline void aesni_encrypt1(unsigned char *out, unsigned char *n, __m128i rkeys[16]) {
aesenc-int.c:                    ^
aesenc-int.c: aesenc-int.c:97:20: warning: unused function 'incle' [-Wunused-function]
aesenc-int.c: static inline void incle(unsigned char n[16]) {
aesenc-int.c:                    ^
aesenc-int.c: aesenc-int.c:195:1: warning: unused function 'aesni_encrypt4' [-Wunused-function]
aesenc-int.c: FUNC(4, MAKE4)
aesenc-int.c: ^
aesenc-int.c: aesenc-int.c:172:22: note: expanded from macro 'FUNC'
aesenc-int.c:   static inline void aesni_encrypt##N(unsigned char *out, unsigned char *n, __m128i rkeys[16]) { \
aesenc-int.c:                      ^
aesenc-int.c: <scratch space>:182:1: note: expanded from here
aesenc-int.c: aesni_encrypt4
aesenc-int.c: ^
aesenc-int.c: aesenc-int.c:196:1: warning: unused function 'aesni_encrypt6' [-Wunused-function]
aesenc-int.c: FUNC(6, MAKE6)
aesenc-int.c: ^
aesenc-int.c: aesenc-int.c:172:22: note: expanded from macro 'FUNC'
aesenc-int.c:   static inline void aesni_encrypt##N(unsigned char *out, unsigned char *n, __m128i rkeys[16]) { \
aesenc-int.c:                      ^
aesenc-int.c: <scratch space>:231:1: note: expanded from here
aesenc-int.c: aesni_encrypt6
aesenc-int.c: ^
aesenc-int.c: aesenc-int.c:197:1: warning: unused function 'aesni_encrypt7' [-Wunused-function]
aesenc-int.c: ...

Number of similar (implementation,compiler) pairs: 5, namely:
ImplementationCompiler
dolbeau/aesenc-intclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1))
dolbeau/aesenc-intclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1))
dolbeau/aesenc-intclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1))
dolbeau/aesenc-intclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1))
dolbeau/aesenc-intclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1))

Compiler output


aesenc-int.c: aesenc-int.c:23: warning: "_bswap64" redefined
aesenc-int.c:    23 | #define _bswap64(a) __builtin_bswap64(a)
aesenc-int.c:       |
aesenc-int.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/13/include/x86gprintrin.h:33,
aesenc-int.c:                  from /usr/lib/gcc/x86_64-linux-gnu/13/include/immintrin.h:27,
aesenc-int.c:                  from aesenc-int.c:12:
aesenc-int.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/ia32intrin.h:273: note: this is the location of the previous definition
aesenc-int.c:   273 | #define _bswap64(a)             __bswapq(a)
aesenc-int.c:       |
aesenc-int.c: aesenc-int.c:24: warning: "_bswap" redefined
aesenc-int.c:    24 | #define _bswap(a) __builtin_bswap(a)
aesenc-int.c:       |
aesenc-int.c: /usr/lib/gcc/x86_64-linux-gnu/13/include/ia32intrin.h:307: note: this is the location of the previous definition
aesenc-int.c:   307 | #define _bswap(a)               __bswapd(a)
aesenc-int.c:       |
aesenc-int.c: aesenc-int.c: In function 'aesni_encrypt1':
aesenc-int.c: aesenc-int.c:85: warning: ignoring '#pragma unroll ' [-Wunknown-pragmas]
aesenc-int.c:    85 | #pragma unroll(13)
aesenc-int.c:       |

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
dolbeau/aesenc-intgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)
dolbeau/aesenc-intgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)
dolbeau/aesenc-intgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)
dolbeau/aesenc-intgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)

Compiler output


vaesenc-int.c: vaesenc-int.c:19:2: error: "This code requires VAES enabled in the compiler."
vaesenc-int.c: #error "This code requires VAES enabled in the compiler."
vaesenc-int.c:  ^
vaesenc-int.c: 1 error generated.

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
dolbeau/vaesenc-intclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1))
dolbeau/vaesenc-intclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1))
dolbeau/vaesenc-intclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1))
dolbeau/vaesenc-intclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1))

Compiler output


vaesenc-int.c: vaesenc-int.c:15:2: error: "This code requires AVX enabled in the compiler."
vaesenc-int.c: #error "This code requires AVX enabled in the compiler."
vaesenc-int.c:  ^
vaesenc-int.c: vaesenc-int.c:19:2: error: "This code requires VAES enabled in the compiler."
vaesenc-int.c: #error "This code requires VAES enabled in the compiler."
vaesenc-int.c:  ^
vaesenc-int.c: 2 errors generated.

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
dolbeau/vaesenc-intclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1))

Compiler output


vaesenc-int.c: vaesenc-int.c:19:2: error: #error "This code requires VAES enabled in the compiler."
vaesenc-int.c:    19 | #error "This code requires VAES enabled in the compiler."
vaesenc-int.c:       |  ^~~~~
vaesenc-int.c: vaesenc-int.c: In function 'aesni_encrypt1':
vaesenc-int.c: vaesenc-int.c:123: warning: ignoring '#pragma unroll ' [-Wunknown-pragmas]
vaesenc-int.c:   123 | #pragma unroll(13)
vaesenc-int.c:       |

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
dolbeau/vaesenc-intgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)
dolbeau/vaesenc-intgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)
dolbeau/vaesenc-intgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)
dolbeau/vaesenc-intgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)

Passed TIMECOP


TIMECOP iterations: 10

Number of similar (implementation,compiler) pairs: 8, namely:
ImplementationCompiler
dolbeau/aesenc-intclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1))
dolbeau/aesenc-intclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1))
dolbeau/aesenc-intclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1))
dolbeau/aesenc-intclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_16.0.6_(27+b1))
dolbeau/aesenc-intgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)
dolbeau/aesenc-intgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)
dolbeau/aesenc-intgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)
dolbeau/aesenc-intgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)