Implementation notes: amd64, hydra8, crypto_aead/aezv3
Computer: hydra8
Microarchitecture: amd64; Ivy Bridge+AES (306a9)
Architecture: amd64
CPU ID: GenuineIntel-000306a9-bfebfbff
SUPERCOP version: 20240625
Operation: crypto_aead
Primitive: aezv3
Time | Object size | Test size | Implementation | Compiler | Benchmark date | SUPERCOP version |
5139 | 10444 0 0 | 30008 860 1024 | T:aesni | clang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
5159 | 10756 0 0 | 32256 860 1024 | T:aesni | clang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
5181 | 10756 0 0 | 34200 860 1024 | T:aesni | clang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
5211 | 9643 0 0 | 29090 852 1024 | T:aesni | clang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
5241 | 17212 0 0 | 40101 804 1088 | T:aesni | gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
5255 | 10326 0 0 | 31989 804 1088 | T:aesni | gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
5437 | 9134 0 0 | 28640 780 1056 | T:aesni | gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
5547 | 10721 0 0 | 31876 796 1088 | T:aesni | gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
819543 | 33822 0 0 | 57720 876 1024 | T:ref | clang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
833279 | 31255 0 0 | 53200 876 1024 | T:ref | clang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
914757 | 38063 0 0 | 60600 876 1024 | T:ref | clang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1102225 | 55172 0 0 | 78396 828 1088 | T:ref | gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1116580 | 22187 0 0 | 41954 868 1024 | T:ref | clang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1214253 | 23460 0 0 | 43496 876 1024 | T:ref | clang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1289003 | 25972 0 0 | 47908 828 1088 | T:ref | gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1433404 | 25395 0 0 | 46892 828 1088 | T:ref | gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
1618053 | 22875 0 0 | 42599 804 1056 | T:ref | gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall | 20240626 | 20240625 |
Compiler output
aez_ni.c: aez_ni.c:146:22: error: '__builtin_ia32_vec_set_v16qi' needs target feature sse4.1
aez_ni.c: __m128i i1 = _mm_insert_epi8(zero, 1, 7);
aez_ni.c: ^
aez_ni.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/smmintrin.h:930:13: note: expanded from macro '_mm_insert_epi8'
aez_ni.c: ((__m128i)__builtin_ia32_vec_set_v16qi((__v16qi)(__m128i)(X), \
aez_ni.c: ^
aez_ni.c: aez_ni.c:147:22: error: '__builtin_ia32_vec_set_v16qi' needs target feature sse4.1
aez_ni.c: __m128i i2 = _mm_insert_epi8(zero, 2, 7);
aez_ni.c: ^
aez_ni.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/smmintrin.h:930:13: note: expanded from macro '_mm_insert_epi8'
aez_ni.c: ((__m128i)__builtin_ia32_vec_set_v16qi((__v16qi)(__m128i)(X), \
aez_ni.c: ^
aez_ni.c: aez_ni.c:148:22: error: '__builtin_ia32_vec_set_v16qi' needs target feature sse4.1
aez_ni.c: __m128i i3 = _mm_insert_epi8(zero, 3, 7);
aez_ni.c: ^
aez_ni.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/smmintrin.h:930:13: note: expanded from macro '_mm_insert_epi8'
aez_ni.c: ((__m128i)__builtin_ia32_vec_set_v16qi((__v16qi)(__m128i)(X), \
aez_ni.c: ^
aez_ni.c: aez_ni.c:149:26: error: '__builtin_ia32_vec_set_v16qi' needs target feature sse4.1
aez_ni.c: __m128i j, one = _mm_insert_epi8(zero, 1, 15);
aez_ni.c: ^
aez_ni.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/smmintrin.h:930:13: note: expanded from macro '_mm_insert_epi8'
aez_ni.c: ((__m128i)__builtin_ia32_vec_set_v16qi((__v16qi)(__m128i)(X), \
aez_ni.c: ^
aez_ni.c: 4 errors generated.
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
T:aesni | clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0) |
Compiler output
aez_ni.c: In function 'load_partial',
aez_ni.c: inlined from 'load_partial' at aez_ni.c:119:16,
aez_ni.c: inlined from 'cipher_aez_tiny' at aez_ni.c:498:18,
aez_ni.c: inlined from 'aez_encrypt' at aez_ni.c:588:9,
aez_ni.c: inlined from 'crypto_aead_aezv3_aesni_timingleaks_encrypt' at aez_ni.c:637:5:
aez_ni.c: aez_ni.c:123:46: warning: '__builtin_memcpy' forming offset [16, 4294967263] is out of the bounds [0, 16] of object 'tmp' with type '__m128i' [-Warray-bounds]
aez_ni.c: 123 | for (i=0; i<n; i++) ((char*)&tmp)[i] = ((char*)p)[i];
aez_ni.c: | ~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~
aez_ni.c: aez_ni.c: In function 'crypto_aead_aezv3_aesni_timingleaks_encrypt':
aez_ni.c: aez_ni.c:122:17: note: 'tmp' declared here
aez_ni.c: 122 | __m128i tmp; unsigned i;
aez_ni.c: | ^~~
Number of similar (implementation,compiler) pairs: 1, namely:
Implementation | Compiler |
T:aesni | gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0) |