Implementation notes: amd64, hunsnivy, crypto_kem/frodokem976aes

Computer: hunsnivy
Microarchitecture: amd64; Ivy Bridge+AES (306a9)
Architecture: amd64
CPU ID: GenuineIntel-000306a9-bfebfbff
SUPERCOP version: 20240808
Operation: crypto_kem
Primitive: frodokem976aes
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
984398635046 38 053595 908 1752T:optimizedgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024073020240716
1051959538426 24 057662 908 1720T:optimizedclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024073020240716
1066973623677 24 044606 908 1720T:optimizedclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024073020240716
1071322122255 24 041662 908 1720T:optimizedclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024073020240716
157970569506 24 026156 900 1720T:optimizedclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024073020240716
322363779718 24 026852 900 1720T:optimizedclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024073020240716
4117403010174 38 027827 908 1752T:optimizedgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024073020240716
483036609638 38 026955 908 1752T:optimizedgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024073020240716
563174858674 38 024875 900 1720T:optimizedgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024073020240716

Compiler output


frodo976.c: In file included from frodo976.c:32:
frodo976.c: ./frodo_macrify.inc:112:28: error: always_inline function '_mm256_madd_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo976.c:                     a[0] = _mm256_madd_epi16(a[0], b);
frodo976.c:                            ^
frodo976.c: ./frodo_macrify.inc:113:30: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo976.c:                     acc[0] = _mm256_add_epi16(a[0], acc[0]);
frodo976.c:                              ^
frodo976.c: ./frodo_macrify.inc:115:28: error: always_inline function '_mm256_madd_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo976.c:                     a[1] = _mm256_madd_epi16(a[1], b);
frodo976.c:                            ^
frodo976.c: ./frodo_macrify.inc:116:30: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo976.c:                     acc[1] = _mm256_add_epi16(a[1], acc[1]);
frodo976.c:                              ^
frodo976.c: ./frodo_macrify.inc:118:28: error: always_inline function '_mm256_madd_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo976.c:                     a[2] = _mm256_madd_epi16(a[2], b);
frodo976.c:                            ^
frodo976.c: ./frodo_macrify.inc:119:30: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo976.c:                     acc[2] = _mm256_add_epi16(a[2], acc[2]);
frodo976.c:                              ^
frodo976.c: ./frodo_macrify.inc:121:28: error: always_inline function '_mm256_madd_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo976.c:                     a[3] = _mm256_madd_epi16(a[3], b);
frodo976.c:                            ^
frodo976.c: ./frodo_macrify.inc:122:30: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo976.c:                     acc[3] = _mm256_add_epi16(a[3], acc[3]);
frodo976.c:                              ^
frodo976.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:x64clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:x64clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:x64clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:x64clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)

Compiler output


aes_ni.c: aes_ni.c:35:16: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes_ni.c:         schedule[1] = key_exp(schedule[0], 0x01);
aes_ni.c:                       ^
aes_ni.c: aes_ni.c:28:40: note: expanded from macro 'key_exp'
aes_ni.c: #define key_exp(k, rcon) key_expand(k, _mm_aeskeygenassist_si128(k, rcon))
aes_ni.c:                                        ^
aes_ni.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/__wmmintrin_aes.h:136:12: note: expanded from macro '_mm_aeskeygenassist_si128'
aes_ni.c:   (__m128i)__builtin_ia32_aeskeygenassist128((__v2di)(__m128i)(C), (int)(R))
aes_ni.c:            ^
aes_ni.c: aes_ni.c:36:16: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes_ni.c:         schedule[2] = key_exp(schedule[1], 0x02);
aes_ni.c:                       ^
aes_ni.c: aes_ni.c:28:40: note: expanded from macro 'key_exp'
aes_ni.c: #define key_exp(k, rcon) key_expand(k, _mm_aeskeygenassist_si128(k, rcon))
aes_ni.c:                                        ^
aes_ni.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/__wmmintrin_aes.h:136:12: note: expanded from macro '_mm_aeskeygenassist_si128'
aes_ni.c:   (__m128i)__builtin_ia32_aeskeygenassist128((__v2di)(__m128i)(C), (int)(R))
aes_ni.c:            ^
aes_ni.c: aes_ni.c:37:16: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes_ni.c:         schedule[3] = key_exp(schedule[2], 0x04);
aes_ni.c:                       ^
aes_ni.c: aes_ni.c:28:40: note: expanded from macro 'key_exp'
aes_ni.c: #define key_exp(k, rcon) key_expand(k, _mm_aeskeygenassist_si128(k, rcon))
aes_ni.c:                                        ^
aes_ni.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/__wmmintrin_aes.h:136:12: note: expanded from macro '_mm_aeskeygenassist_si128'
aes_ni.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
T:x64clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)

Compiler output


frodo976.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:53,
frodo976.c:                  from frodo_macrify.inc:8,
frodo976.c:                  from frodo976.c:32:
frodo976.c: frodo_macrify.inc: In function 'frodo_mul_add_sa_plus_e':
frodo976.c: /usr/lib/gcc/x86_64-linux-gnu/10/include/avx2intrin.h:112:1: error: inlining failed in call to 'always_inline' '_mm256_add_epi16': target specific option mismatch
frodo976.c:   112 | _mm256_add_epi16 (__m256i __A, __m256i __B)
frodo976.c:       | ^~~~~~~~~~~~~~~~
frodo976.c: In file included from frodo976.c:32:
frodo976.c: frodo_macrify.inc:122:30: note: called from here
frodo976.c:   122 |                     acc[3] = _mm256_add_epi16(a[3], acc[3]);
frodo976.c:       |                              ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
frodo976.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:53,
frodo976.c:                  from frodo_macrify.inc:8,
frodo976.c:                  from frodo976.c:32:
frodo976.c: /usr/lib/gcc/x86_64-linux-gnu/10/include/avx2intrin.h:341:1: error: inlining failed in call to 'always_inline' '_mm256_madd_epi16': target specific option mismatch
frodo976.c:   341 | _mm256_madd_epi16 (__m256i __A, __m256i __B)
frodo976.c:       | ^~~~~~~~~~~~~~~~~
frodo976.c: In file included from frodo976.c:32:
frodo976.c: frodo_macrify.inc:121:28: note: called from here
frodo976.c:   121 |                     a[3] = _mm256_madd_epi16(a[3], b);
frodo976.c:       |                            ^~~~~~~~~~~~~~~~~~~~~~~~~~
frodo976.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:53,
frodo976.c:                  from frodo_macrify.inc:8,
frodo976.c:                  from frodo976.c:32:
frodo976.c: /usr/lib/gcc/x86_64-linux-gnu/10/include/avx2intrin.h:112:1: error: inlining failed in call to 'always_inline' '_mm256_add_epi16': target specific option mismatch
frodo976.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:x64gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:x64gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:x64gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:x64gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)

Namespace violations


aes.o AES128_free_schedule T
aes.o AES256_free_schedule T
aes.o handleErrors T
fips202.o KeccakF1600_StatePermute T
fips202.o shake128 T
fips202.o shake128_absorb T
fips202.o shake128_squeezeblocks T
fips202.o shake256 T
fips202.o shake256_absorb T
fips202.o shake256_squeezeblocks T
frodo976.o CDF_TABLE D
frodo976.o CDF_TABLE_LEN D
frodo976.o frodo_add T
frodo976.o frodo_key_decode T
frodo976.o frodo_key_encode T
frodo976.o frodo_mul_add_as_plus_e T
frodo976.o frodo_mul_add_sa_plus_e T
frodo976.o frodo_mul_add_sb_plus_e T
frodo976.o frodo_mul_bs T
frodo976.o frodo_sample_n T
frodo976.o frodo_sub T
util.o clear_bytes T
util.o frodo_pack T
util.o frodo_unpack T

Number of similar (implementation,compiler) pairs: 9, namely:
ImplementationCompiler
T:optimizedclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:optimizedclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:optimizedclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:optimizedclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:optimizedclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:optimizedgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:optimizedgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:optimizedgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:optimizedgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)