Implementation notes: amd64, hunsnivy, crypto_kem/frodokem640aes

Computer: hunsnivy
Microarchitecture: amd64; Ivy Bridge+AES (306a9)
Architecture: amd64
CPU ID: GenuineIntel-000306a9-bfebfbff
SUPERCOP version: 20240808
Operation: crypto_kem
Primitive: frodokem640aes
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
507051835916 42 054451 916 1752T:optimizedgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024073020240716
529980823686 28 042870 916 1720T:optimizedclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024073020240716
544979225377 28 046206 916 1720T:optimizedclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024073020240716
556580723699 28 042958 916 1720T:optimizedclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024073020240716
80168269908 28 026444 908 1720T:optimizedclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024073020240716
137549729686 28 026724 908 1720T:optimizedclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024073020240716
1897686610050 42 027635 916 1752T:optimizedgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024073020240716
221499989555 42 026883 916 1752T:optimizedgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024073020240716
257407958576 42 024811 908 1720T:optimizedgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024073020240716

Compiler output


frodo640.c: In file included from frodo640.c:32:
frodo640.c: ./frodo_macrify.inc:112:28: error: always_inline function '_mm256_madd_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo640.c:                     a[0] = _mm256_madd_epi16(a[0], b);
frodo640.c:                            ^
frodo640.c: ./frodo_macrify.inc:113:30: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo640.c:                     acc[0] = _mm256_add_epi16(a[0], acc[0]);
frodo640.c:                              ^
frodo640.c: ./frodo_macrify.inc:115:28: error: always_inline function '_mm256_madd_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo640.c:                     a[1] = _mm256_madd_epi16(a[1], b);
frodo640.c:                            ^
frodo640.c: ./frodo_macrify.inc:116:30: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo640.c:                     acc[1] = _mm256_add_epi16(a[1], acc[1]);
frodo640.c:                              ^
frodo640.c: ./frodo_macrify.inc:118:28: error: always_inline function '_mm256_madd_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo640.c:                     a[2] = _mm256_madd_epi16(a[2], b);
frodo640.c:                            ^
frodo640.c: ./frodo_macrify.inc:119:30: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo640.c:                     acc[2] = _mm256_add_epi16(a[2], acc[2]);
frodo640.c:                              ^
frodo640.c: ./frodo_macrify.inc:121:28: error: always_inline function '_mm256_madd_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo640.c:                     a[3] = _mm256_madd_epi16(a[3], b);
frodo640.c:                            ^
frodo640.c: ./frodo_macrify.inc:122:30: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo640.c:                     acc[3] = _mm256_add_epi16(a[3], acc[3]);
frodo640.c:                              ^
frodo640.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:x64clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:x64clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:x64clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:x64clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)

Compiler output


aes_ni.c: aes_ni.c:35:16: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes_ni.c:         schedule[1] = key_exp(schedule[0], 0x01);
aes_ni.c:                       ^
aes_ni.c: aes_ni.c:28:40: note: expanded from macro 'key_exp'
aes_ni.c: #define key_exp(k, rcon) key_expand(k, _mm_aeskeygenassist_si128(k, rcon))
aes_ni.c:                                        ^
aes_ni.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/__wmmintrin_aes.h:136:12: note: expanded from macro '_mm_aeskeygenassist_si128'
aes_ni.c:   (__m128i)__builtin_ia32_aeskeygenassist128((__v2di)(__m128i)(C), (int)(R))
aes_ni.c:            ^
aes_ni.c: aes_ni.c:36:16: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes_ni.c:         schedule[2] = key_exp(schedule[1], 0x02);
aes_ni.c:                       ^
aes_ni.c: aes_ni.c:28:40: note: expanded from macro 'key_exp'
aes_ni.c: #define key_exp(k, rcon) key_expand(k, _mm_aeskeygenassist_si128(k, rcon))
aes_ni.c:                                        ^
aes_ni.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/__wmmintrin_aes.h:136:12: note: expanded from macro '_mm_aeskeygenassist_si128'
aes_ni.c:   (__m128i)__builtin_ia32_aeskeygenassist128((__v2di)(__m128i)(C), (int)(R))
aes_ni.c:            ^
aes_ni.c: aes_ni.c:37:16: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes_ni.c:         schedule[3] = key_exp(schedule[2], 0x04);
aes_ni.c:                       ^
aes_ni.c: aes_ni.c:28:40: note: expanded from macro 'key_exp'
aes_ni.c: #define key_exp(k, rcon) key_expand(k, _mm_aeskeygenassist_si128(k, rcon))
aes_ni.c:                                        ^
aes_ni.c: /usr/lib/llvm-11/lib/clang/11.0.1/include/__wmmintrin_aes.h:136:12: note: expanded from macro '_mm_aeskeygenassist_si128'
aes_ni.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
T:x64clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)

Compiler output


frodo640.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:53,
frodo640.c:                  from frodo_macrify.inc:8,
frodo640.c:                  from frodo640.c:32:
frodo640.c: frodo_macrify.inc: In function 'frodo_mul_add_sa_plus_e':
frodo640.c: /usr/lib/gcc/x86_64-linux-gnu/10/include/avx2intrin.h:112:1: error: inlining failed in call to 'always_inline' '_mm256_add_epi16': target specific option mismatch
frodo640.c:   112 | _mm256_add_epi16 (__m256i __A, __m256i __B)
frodo640.c:       | ^~~~~~~~~~~~~~~~
frodo640.c: In file included from frodo640.c:32:
frodo640.c: frodo_macrify.inc:122:30: note: called from here
frodo640.c:   122 |                     acc[3] = _mm256_add_epi16(a[3], acc[3]);
frodo640.c:       |                              ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
frodo640.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:53,
frodo640.c:                  from frodo_macrify.inc:8,
frodo640.c:                  from frodo640.c:32:
frodo640.c: /usr/lib/gcc/x86_64-linux-gnu/10/include/avx2intrin.h:341:1: error: inlining failed in call to 'always_inline' '_mm256_madd_epi16': target specific option mismatch
frodo640.c:   341 | _mm256_madd_epi16 (__m256i __A, __m256i __B)
frodo640.c:       | ^~~~~~~~~~~~~~~~~
frodo640.c: In file included from frodo640.c:32:
frodo640.c: frodo_macrify.inc:121:28: note: called from here
frodo640.c:   121 |                     a[3] = _mm256_madd_epi16(a[3], b);
frodo640.c:       |                            ^~~~~~~~~~~~~~~~~~~~~~~~~~
frodo640.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/10/include/immintrin.h:53,
frodo640.c:                  from frodo_macrify.inc:8,
frodo640.c:                  from frodo640.c:32:
frodo640.c: /usr/lib/gcc/x86_64-linux-gnu/10/include/avx2intrin.h:112:1: error: inlining failed in call to 'always_inline' '_mm256_add_epi16': target specific option mismatch
frodo640.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:x64gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:x64gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:x64gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:x64gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)

Namespace violations


aes.o AES128_free_schedule T
aes.o AES256_free_schedule T
aes.o handleErrors T
fips202.o KeccakF1600_StatePermute T
fips202.o shake128 T
fips202.o shake128_absorb T
fips202.o shake128_squeezeblocks T
fips202.o shake256 T
fips202.o shake256_absorb T
fips202.o shake256_squeezeblocks T
frodo640.o CDF_TABLE D
frodo640.o CDF_TABLE_LEN D
frodo640.o frodo_add T
frodo640.o frodo_key_decode T
frodo640.o frodo_key_encode T
frodo640.o frodo_mul_add_as_plus_e T
frodo640.o frodo_mul_add_sa_plus_e T
frodo640.o frodo_mul_add_sb_plus_e T
frodo640.o frodo_mul_bs T
frodo640.o frodo_sample_n T
frodo640.o frodo_sub T
util.o clear_bytes T
util.o frodo_pack T
util.o frodo_unpack T

Number of similar (implementation,compiler) pairs: 9, namely:
ImplementationCompiler
T:optimizedclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:optimizedclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:optimizedclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:optimizedclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:optimizedclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Debian_Clang_11.0.1)
T:optimizedgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:optimizedgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:optimizedgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)
T:optimizedgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (10.2.1_20210110)