Implementation notes: amd64, hydra8, crypto_kem/frodokem1344aes

Computer: hydra8
Microarchitecture: amd64; Ivy Bridge+AES (306a9)
Architecture: amd64
CPU ID: GenuineIntel-000306a9-bfebfbff
SUPERCOP version: 20240716
Operation: crypto_kem
Primitive: frodokem1344aes
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
1663041929629 16 048546 940 1728T:optimizedclang_-march=native_-O2_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024072020240716
1687370634648 22 053790 908 1760T:optimizedgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024072020240716
1701174822518 16 043394 940 1728T:optimizedclang_-march=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024072020240716
1732815220674 16 040202 940 1728T:optimizedclang_-mcpu=native_-O3_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024072020240716
270076389660 16 026452 932 1728T:optimizedclang_-march=native_-Os_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024072020240716
5098603012486 16 029722 940 1728T:optimizedclang_-march=native_-O_-fwrapv_-Qunused-arguments_-fPIC_-fPIE_-gdwarf-4_-Wall2024072020240716
7637846410894 22 028854 908 1760T:optimizedgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024072020240716
8916016410470 22 028062 908 1760T:optimizedgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024072020240716
1042347009438 22 025894 900 1728T:optimizedgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024072020240716

Compiler output


frodo1344.c: In file included from frodo1344.c:33:
frodo1344.c: ./frodo_macrify.inc:112:28: error: always_inline function '_mm256_madd_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo1344.c:                     a[0] = _mm256_madd_epi16(a[0], b);
frodo1344.c:                            ^
frodo1344.c: ./frodo_macrify.inc:113:30: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo1344.c:                     acc[0] = _mm256_add_epi16(a[0], acc[0]);
frodo1344.c:                              ^
frodo1344.c: ./frodo_macrify.inc:115:28: error: always_inline function '_mm256_madd_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo1344.c:                     a[1] = _mm256_madd_epi16(a[1], b);
frodo1344.c:                            ^
frodo1344.c: ./frodo_macrify.inc:116:30: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo1344.c:                     acc[1] = _mm256_add_epi16(a[1], acc[1]);
frodo1344.c:                              ^
frodo1344.c: ./frodo_macrify.inc:118:28: error: always_inline function '_mm256_madd_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo1344.c:                     a[2] = _mm256_madd_epi16(a[2], b);
frodo1344.c:                            ^
frodo1344.c: ./frodo_macrify.inc:119:30: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo1344.c:                     acc[2] = _mm256_add_epi16(a[2], acc[2]);
frodo1344.c:                              ^
frodo1344.c: ./frodo_macrify.inc:121:28: error: always_inline function '_mm256_madd_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo1344.c:                     a[3] = _mm256_madd_epi16(a[3], b);
frodo1344.c:                            ^
frodo1344.c: ./frodo_macrify.inc:122:30: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'frodo_mul_add_sa_plus_e' that is compiled without support for 'avx2'
frodo1344.c:                     acc[3] = _mm256_add_epi16(a[3], acc[3]);
frodo1344.c:                              ^
frodo1344.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:x64clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:x64clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:x64clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:x64clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


aes_ni.c: aes_ni.c:35:16: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes_ni.c:         schedule[1] = key_exp(schedule[0], 0x01);
aes_ni.c:                       ^
aes_ni.c: aes_ni.c:28:40: note: expanded from macro 'key_exp'
aes_ni.c: #define key_exp(k, rcon) key_expand(k, _mm_aeskeygenassist_si128(k, rcon))
aes_ni.c:                                        ^
aes_ni.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/__wmmintrin_aes.h:136:13: note: expanded from macro '_mm_aeskeygenassist_si128'
aes_ni.c:   ((__m128i)__builtin_ia32_aeskeygenassist128((__v2di)(__m128i)(C), (int)(R)))
aes_ni.c:             ^
aes_ni.c: aes_ni.c:36:16: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes_ni.c:         schedule[2] = key_exp(schedule[1], 0x02);
aes_ni.c:                       ^
aes_ni.c: aes_ni.c:28:40: note: expanded from macro 'key_exp'
aes_ni.c: #define key_exp(k, rcon) key_expand(k, _mm_aeskeygenassist_si128(k, rcon))
aes_ni.c:                                        ^
aes_ni.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/__wmmintrin_aes.h:136:13: note: expanded from macro '_mm_aeskeygenassist_si128'
aes_ni.c:   ((__m128i)__builtin_ia32_aeskeygenassist128((__v2di)(__m128i)(C), (int)(R)))
aes_ni.c:             ^
aes_ni.c: aes_ni.c:37:16: error: '__builtin_ia32_aeskeygenassist128' needs target feature aes
aes_ni.c:         schedule[3] = key_exp(schedule[2], 0x04);
aes_ni.c:                       ^
aes_ni.c: aes_ni.c:28:40: note: expanded from macro 'key_exp'
aes_ni.c: #define key_exp(k, rcon) key_expand(k, _mm_aeskeygenassist_si128(k, rcon))
aes_ni.c:                                        ^
aes_ni.c: /usr/lib/llvm-14/lib/clang/14.0.0/include/__wmmintrin_aes.h:136:13: note: expanded from macro '_mm_aeskeygenassist_si128'
aes_ni.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:
ImplementationCompiler
T:x64clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)

Compiler output


frodo1344.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
frodo1344.c:                  from frodo_macrify.inc:8,
frodo1344.c:                  from frodo1344.c:33:
frodo1344.c: frodo_macrify.inc: In function 'frodo_mul_add_sa_plus_e':
frodo1344.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:112:1: error: inlining failed in call to 'always_inline' '_mm256_add_epi16': target specific option mismatch
frodo1344.c:   112 | _mm256_add_epi16 (__m256i __A, __m256i __B)
frodo1344.c:       | ^~~~~~~~~~~~~~~~
frodo1344.c: In file included from frodo1344.c:33:
frodo1344.c: frodo_macrify.inc:122:30: note: called from here
frodo1344.c:   122 |                     acc[3] = _mm256_add_epi16(a[3], acc[3]);
frodo1344.c:       |                              ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
frodo1344.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
frodo1344.c:                  from frodo_macrify.inc:8,
frodo1344.c:                  from frodo1344.c:33:
frodo1344.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:341:1: error: inlining failed in call to 'always_inline' '_mm256_madd_epi16': target specific option mismatch
frodo1344.c:   341 | _mm256_madd_epi16 (__m256i __A, __m256i __B)
frodo1344.c:       | ^~~~~~~~~~~~~~~~~
frodo1344.c: In file included from frodo1344.c:33:
frodo1344.c: frodo_macrify.inc:121:28: note: called from here
frodo1344.c:   121 |                     a[3] = _mm256_madd_epi16(a[3], b);
frodo1344.c:       |                            ^~~~~~~~~~~~~~~~~~~~~~~~~~
frodo1344.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
frodo1344.c:                  from frodo_macrify.inc:8,
frodo1344.c:                  from frodo1344.c:33:
frodo1344.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:112:1: error: inlining failed in call to 'always_inline' '_mm256_add_epi16': target specific option mismatch
frodo1344.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:x64gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:x64gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:x64gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:x64gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

Namespace violations


aes.o AES128_free_schedule T
aes.o AES256_free_schedule T
aes.o handleErrors T
fips202.o KeccakF1600_StatePermute T
fips202.o shake128 T
fips202.o shake128_absorb T
fips202.o shake128_squeezeblocks T
fips202.o shake256 T
fips202.o shake256_absorb T
fips202.o shake256_squeezeblocks T
frodo1344.o CDF_TABLE D
frodo1344.o CDF_TABLE_LEN D
frodo1344.o frodo_add T
frodo1344.o frodo_key_decode T
frodo1344.o frodo_key_encode T
frodo1344.o frodo_mul_add_as_plus_e T
frodo1344.o frodo_mul_add_sa_plus_e T
frodo1344.o frodo_mul_add_sb_plus_e T
frodo1344.o frodo_mul_bs T
frodo1344.o frodo_sample_n T
frodo1344.o frodo_sub T
util.o clear_bytes T
util.o frodo_pack T
util.o frodo_unpack T

Number of similar (implementation,compiler) pairs: 9, namely:
ImplementationCompiler
T:optimizedclang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:optimizedclang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:optimizedclang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:optimizedclang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:optimizedclang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_14.0.0)
T:optimizedgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:optimizedgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:optimizedgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:optimizedgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)