Implementation notes: amd64, hydra8, crypto_kem/frodokem976aes

Computer: hydra8
Microarchitecture: amd64; Ivy Bridge+AES (306a9)
Architecture: amd64
CPU ID: GenuineIntel-000306a9-bfebfbff
SUPERCOP version: 20240625
Operation: crypto_kem
Primitive: frodokem976aes
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
982826135705 38 054814 932 1760T:optimizedgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
1006617037234 24 056098 956 1728T:optimizedclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
1032716122525 24 043386 956 1728T:optimizedclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
1060584820665 24 040194 956 1728T:optimizedclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
141528379591 24 026356 948 1728T:optimizedclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
3113481412438 24 029658 956 1728T:optimizedclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
4118781910922 38 028862 932 1760T:optimizedgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
4807139010429 38 028014 932 1760T:optimizedgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625
563709279406 38 025902 924 1728T:optimizedgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062920240625

Compiler output


frodo976.c: In file included from frodo976.c:32:
frodo976.c: ./frodo_macrify.inc:112:28: error: always_inline function '_mm256_madd_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo976.c:                     a[0] = _mm256_madd_epi16(a[0], b);
frodo976.c:                            ^
frodo976.c: ./frodo_macrify.inc:113:30: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo976.c:                     acc[0] = _mm256_add_epi16(a[0], acc[0]);
frodo976.c:                              ^
frodo976.c: ./frodo_macrify.inc:115:28: error: always_inline function '_mm256_madd_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo976.c:                     a[1] = _mm256_madd_epi16(a[1], b);
frodo976.c:                            ^
frodo976.c: ./frodo_macrify.inc:116:30: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo976.c:                     acc[1] = _mm256_add_epi16(a[1], acc[1]);
frodo976.c:                              ^
frodo976.c: ./frodo_macrify.inc:118:28: error: always_inline function '_mm256_madd_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo976.c:                     a[2] = _mm256_madd_epi16(a[2], b);
frodo976.c:                            ^
frodo976.c: ./frodo_macrify.inc:119:30: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo976.c:                     acc[2] = _mm256_add_epi16(a[2], acc[2]);
frodo976.c:                              ^
frodo976.c: ./frodo_macrify.inc:121:28: error: always_inline function '_mm256_madd_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo976.c:                     a[3] = _mm256_madd_epi16(a[3], b);
frodo976.c:                            ^
frodo976.c: ./frodo_macrify.inc:122:30: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo976.c:                     acc[3] = _mm256_add_epi16(a[3], acc[3]);
frodo976.c:                              ^
frodo976.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:x64clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:x64clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:x64clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:x64clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


aes_ni.c: aes_ni.c:35:16: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes_ni.c:         schedule[1] = key_exp(schedule[0], 0x01);
aes_ni.c:                       ^
aes_ni.c: aes_ni.c:28:40: note: expanded from macro 'key_exp'
aes_ni.c: #define key_exp(k, rcon) key_expand(k, _mm_aeskeygenassist_si128(k, rcon))
aes_ni.c:                                        ^
aes_ni.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/__wmmintrin_aes.h:136:13: note: expanded from macro '_mm_aeskeygenassist_si128'
aes_ni.c:   ((__m128i)__builtin_ia32_aeskeygenassist128((__v2di)(__m128i)(C), (int)(R)))
aes_ni.c:             ^
aes_ni.c: aes_ni.c:36:16: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes_ni.c:         schedule[2] = key_exp(schedule[1], 0x02);
aes_ni.c:                       ^
aes_ni.c: aes_ni.c:28:40: note: expanded from macro 'key_exp'
aes_ni.c: #define key_exp(k, rcon) key_expand(k, _mm_aeskeygenassist_si128(k, rcon))
aes_ni.c:                                        ^
aes_ni.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/__wmmintrin_aes.h:136:13: note: expanded from macro '_mm_aeskeygenassist_si128'
aes_ni.c:   ((__m128i)__builtin_ia32_aeskeygenassist128((__v2di)(__m128i)(C), (int)(R)))
aes_ni.c:             ^
aes_ni.c: aes_ni.c:37:16: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes_ni.c:         schedule[3] = key_exp(schedule[2], 0x04);
aes_ni.c:                       ^
aes_ni.c: aes_ni.c:28:40: note: expanded from macro 'key_exp'
aes_ni.c: #define key_exp(k, rcon) key_expand(k, _mm_aeskeygenassist_si128(k, rcon))
aes_ni.c:                                        ^
aes_ni.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/__wmmintrin_aes.h:136:13: note: expanded from macro '_mm_aeskeygenassist_si128'
aes_ni.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
T:x64clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


frodo976.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
frodo976.c:                  from frodo_macrify.inc:8,
frodo976.c:                  from frodo976.c:32:
frodo976.c: frodo_macrify.inc: In function 'frodo_mul_add_sa_plus_e':
frodo976.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:112:1: error: inlining failed in call to 'always_inline' '_mm256_add_epi16': target specific option mismatch
frodo976.c:   112 | _mm256_add_epi16 (__m256i __A, __m256i __B)
frodo976.c:       | ^~~~~~~~~~~~~~~~
frodo976.c: In file included from frodo976.c:32:
frodo976.c: frodo_macrify.inc:122:30: note: called from here
frodo976.c:   122 |                     acc[3] = _mm256_add_epi16(a[3], acc[3]);
frodo976.c:       |                              ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
frodo976.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
frodo976.c:                  from frodo_macrify.inc:8,
frodo976.c:                  from frodo976.c:32:
frodo976.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:341:1: error: inlining failed in call to 'always_inline' '_mm256_madd_epi16': target specific option mismatch
frodo976.c:   341 | _mm256_madd_epi16 (__m256i __A, __m256i __B)
frodo976.c:       | ^~~~~~~~~~~~~~~~~
frodo976.c: In file included from frodo976.c:32:
frodo976.c: frodo_macrify.inc:121:28: note: called from here
frodo976.c:   121 |                     a[3] = _mm256_madd_epi16(a[3], b);
frodo976.c:       |                            ^~~~~~~~~~~~~~~~~~~~~~~~~~
frodo976.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
frodo976.c:                  from frodo_macrify.inc:8,
frodo976.c:                  from frodo976.c:32:
frodo976.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:112:1: error: inlining failed in call to 'always_inline' '_mm256_add_epi16': target specific option mismatch
frodo976.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:x64gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:x64gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:x64gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:x64gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)