Implementation notes: amd64, hunsnivy, crypto_aead/minalpherv11

Computer: hunsnivy
Microarchitecture: amd64; Ivy Bridge+AES (306a9)
Architecture: amd64
CPU ID: GenuineIntel-000306a9-bfebfbff
SUPERCOP version: 20240625
Operation: crypto_aead
Primitive: minalpherv11
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
30143658793 0 024338 788 1080T:refgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
322023213634 0 031584 812 1016T:refclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
325112912185 0 028448 812 1016T:refclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
327733116088 0 032440 812 1016T:refclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
39016686850 0 021426 788 1080T:refgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
42228975005 0 018438 804 1016T:refclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
44096465070 0 018101 764 1048T:refgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
49175015661 0 019630 804 1016T:refclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625
60265906076 0 020247 772 1080T:refgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062820240625

Compiler output


MinalpherCore.cpp: MinalpherCore.cpp:88:17: error: '__builtin_ia32_permdi256' needs target feature avx2
MinalpherCore.cpp:   __m256i tmp = _mm256_permute4x64_epi64(DATA, 0xD8);
MinalpherCore.cpp:                 ^
MinalpherCore.cpp: /usr/lib/llvm-11/lib/clang/11.0.1/include/avx2intrin.h:818:12: note: expanded from macro '_mm256_permute4x64_epi64'
MinalpherCore.cpp:   (__m256i)__builtin_ia32_permdi256((__v4di)(__m256i)(V), (int)(M))
MinalpherCore.cpp:            ^
MinalpherCore.cpp: MinalpherCore.cpp:92:10: error: always_inline function '_mm256_and_si256' requires target feature 'avx2', but would be inlined into function 'minalpher_core_single_block' that is compiled without support for 'avx2'
MinalpherCore.cpp:   tmp0 = _mm256_and_si256(ymmMask, tmp);
MinalpherCore.cpp:          ^
MinalpherCore.cpp: MinalpherCore.cpp:93:10: error: always_inline function '_mm256_andnot_si256' requires target feature 'avx2', but would be inlined into function 'minalpher_core_single_block' that is compiled without support for 'avx2'
MinalpherCore.cpp:   tmp1 = _mm256_andnot_si256(ymmMask, tmp);
MinalpherCore.cpp:          ^
MinalpherCore.cpp: MinalpherCore.cpp:96:10: error: always_inline function '_mm256_srli_epi16' requires target feature 'avx2', but would be inlined into function 'minalpher_core_single_block' that is compiled without support for 'avx2'
MinalpherCore.cpp:   tmp1 = _mm256_srli_epi16(tmp1, 4);
MinalpherCore.cpp:          ^
MinalpherCore.cpp: MinalpherCore.cpp:99:18: error: always_inline function '_mm256_unpacklo_epi8' requires target feature 'avx2', but would be inlined into function 'minalpher_core_single_block' that is compiled without support for 'avx2'
MinalpherCore.cpp:   __m256i ymm0 = _mm256_unpacklo_epi8(tmp0, tmp1);
MinalpherCore.cpp:                  ^
MinalpherCore.cpp: MinalpherCore.cpp:100:18: error: always_inline function '_mm256_unpackhi_epi8' requires target feature 'avx2', but would be inlined into function 'minalpher_core_single_block' that is compiled without support for 'avx2'
MinalpherCore.cpp:   __m256i ymm1 = _mm256_unpackhi_epi8(tmp0, tmp1);
MinalpherCore.cpp:                  ^
MinalpherCore.cpp: MinalpherCore.cpp:128:3: error: always_inline function '_mm256_shuffle_epi8' requires target feature 'avx2', but would be inlined into function 'minalpher_core_single_block' that is compiled without support for 'avx2'
MinalpherCore.cpp:   singleRoundFunctionH( 0, ymm0, ymm1);
MinalpherCore.cpp:   ^
MinalpherCore.cpp: MinalpherCore.cpp:112:12: note: expanded from macro 'singleRoundFunctionH'
MinalpherCore.cpp: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:avx2clang++ -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:avx2clang++ -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:avx2clang++ -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:avx2clang++ -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)

Compiler output


MinalpherCore.cpp: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:53,
MinalpherCore.cpp:                  from /usr/lib/gcc/x86_64-linux-gnu/10/include/x86intrin.h:32,
MinalpherCore.cpp:                  from define.h:47,
MinalpherCore.cpp:                  from MinalpherCore.cpp:33:
MinalpherCore.cpp: /usr/lib/gcc/x86_64-linux-gnu/10/include/avx2intrin.h: In function 'void minalpher_core_single_block(const __m256i&, __m256i&, __m256i&)':
MinalpherCore.cpp: /usr/lib/gcc/x86_64-linux-gnu/10/include/avx2intrin.h:588:1: error: inlining failed in call to 'always_inline' '__m256i _mm256_shuffle_epi8(__m256i, __m256i)': target specific option mismatch
MinalpherCore.cpp:   588 | _mm256_shuffle_epi8 (__m256i __X, __m256i __Y)
MinalpherCore.cpp:       | ^~~~~~~~~~~~~~~~~~~
MinalpherCore.cpp: MinalpherCore.cpp:150:30: note: called from here
MinalpherCore.cpp:   150 |   ymm1_ = _mm256_shuffle_epi8(ymm1, ShuffleSingleBlock1);
MinalpherCore.cpp:       |           ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~
MinalpherCore.cpp: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:53,
MinalpherCore.cpp:                  from /usr/lib/gcc/x86_64-linux-gnu/10/include/x86intrin.h:32,
MinalpherCore.cpp:                  from define.h:47,
MinalpherCore.cpp:                  from MinalpherCore.cpp:33:
MinalpherCore.cpp: /usr/lib/gcc/x86_64-linux-gnu/10/include/avx2intrin.h:588:1: error: inlining failed in call to 'always_inline' '__m256i _mm256_shuffle_epi8(__m256i, __m256i)': target specific option mismatch
MinalpherCore.cpp:   588 | _mm256_shuffle_epi8 (__m256i __X, __m256i __Y)
MinalpherCore.cpp:       | ^~~~~~~~~~~~~~~~~~~
MinalpherCore.cpp: MinalpherCore.cpp:149:30: note: called from here
MinalpherCore.cpp:   149 |   ymm1  = _mm256_shuffle_epi8(ymmSbox, ymm1);
MinalpherCore.cpp:       |           ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~
MinalpherCore.cpp: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:53,
MinalpherCore.cpp:                  from /usr/lib/gcc/x86_64-linux-gnu/10/include/x86intrin.h:32,
MinalpherCore.cpp:                  from define.h:47,
MinalpherCore.cpp:                  from MinalpherCore.cpp:33:
MinalpherCore.cpp: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:avx2g++ -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:avx2g++ -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:avx2g++ -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:avx2g++ -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)

Compiler output


encrypt.c: encrypt.c: In function 'crypto_mac_gen':
encrypt.c: encrypt.c:116:7: warning: variable 'ret' set but not used [-Wunused-but-set-variable]
encrypt.c:   116 |   int ret;
encrypt.c:       |       ^~~
minalpher.c: minalpher.c: In function 'AEAD_TGEN':
minalpher.c: minalpher.c:473:25: warning: variable 'j' set but not used [-Wunused-but-set-variable]
minalpher.c:   473 |   unsigned long long i, j, blocknum;
minalpher.c:       |                         ^
minalpher.c: minalpher.c: In function 'AEAD_MGEN':
minalpher.c: minalpher.c:560:15: warning: variable 'cnt' set but not used [-Wunused-but-set-variable]
minalpher.c:   560 |   int d, pad, cnt;
minalpher.c:       |               ^~~
minalpher.c: minalpher.c:560:10: warning: variable 'pad' set but not used [-Wunused-but-set-variable]
minalpher.c:   560 |   int d, pad, cnt;
minalpher.c:       |          ^~~

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:refgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:refgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:refgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:refgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)