Implementation notes: amd64, hydra7, crypto_core/mult3sntrup761

Computer: hydra7
Microarchitecture: amd64; Sandy Bridge+AES (206a7)
Architecture: amd64
CPU ID: GenuineIntel-000206a7-bfebfbff
SUPERCOP version: 20240625
Operation: crypto_core
Primitive: mult3sntrup761
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
1876383759 0 017789 804 99232gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062620240625
208028996 0 013813 804 99232gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062620240625
2208081103 0 013588 796 99232gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062620240625
238344921 0 012240 780 96032gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062620240625
2704294208 0 018245 804 992refgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062620240625
1680228654 0 013469 804 992refgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062620240625
2396352705 0 013180 796 992refgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062620240625
2450316556 0 011872 780 960refgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024062620240625

Compiler output


mult768.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
mult768.c:                  from mult768.c:2:
mult768.c: mult768.c: In function 'squeeze_3_x16':
mult768.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:808:1: error: inlining failed in call to 'always_inline' '_mm256_sub_epi16': target specific option mismatch
mult768.c:   808 | _mm256_sub_epi16 (__m256i __A, __m256i __B)
mult768.c:       | ^~~~~~~~~~~~~~~~
mult768.c: mult768.c:12:17: note: called from here
mult768.c:    12 | #define sub_x16 _mm256_sub_epi16
mult768.c:       |                 ^
mult768.c: mult768.c:20:10: note: in expansion of macro 'sub_x16'
mult768.c:    20 |   return sub_x16(x,mullo_x16(mulhrs_x16(x,const_x16(10923)),const_x16(3)));
mult768.c:       |          ^~~~~~~
mult768.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
mult768.c:                  from mult768.c:2:
mult768.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:553:1: error: inlining failed in call to 'always_inline' '_mm256_mullo_epi16': target specific option mismatch
mult768.c:   553 | _mm256_mullo_epi16 (__m256i __A, __m256i __B)
mult768.c:       | ^~~~~~~~~~~~~~~~~~
mult768.c: mult768.c:12:17: note: called from here
mult768.c:    12 | #define sub_x16 _mm256_sub_epi16
mult768.c:       |                 ^
mult768.c: mult768.c:20:10: note: in expansion of macro 'sub_x16'
mult768.c:    20 |   return sub_x16(x,mullo_x16(mulhrs_x16(x,const_x16(10923)),const_x16(3)));
mult768.c:       |          ^~~~~~~
mult768.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
mult768.c:                  from mult768.c:2:
mult768.c: ...

Number of similar (implementation,compiler) pairs: 12, namely:
ImplementationCompiler
avxgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
avxgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
avxgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
avxgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
avx800gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
avx800gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
avx800gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
avx800gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
round2gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
round2gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
round2gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
round2gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

Compiler output


mult.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
mult.c:                  from mult.c:17:
mult.c: const_inline.h: In function 'cvt_to_unsigned':
mult.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:588:1: error: inlining failed in call to 'always_inline' '_mm256_shuffle_epi8': target specific option mismatch
mult.c:   588 | _mm256_shuffle_epi8 (__m256i __X, __m256i __Y)
mult.c:       | ^~~~~~~~~~~~~~~~~~~
mult.c: In file included from mult.c:19:
mult.c: const_inline.h:38:61: note: called from here
mult.c:    38 | static inline __m256i cvt_to_unsigned( __m256i a ) { return _mm256_shuffle_epi8(_cvt_to_unsigned,a&v_0x3); }
mult.c:       |                                                             ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
mult.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
mult.c:                  from mult.c:17:
mult.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:588:1: error: inlining failed in call to 'always_inline' '_mm256_shuffle_epi8': target specific option mismatch
mult.c:   588 | _mm256_shuffle_epi8 (__m256i __X, __m256i __Y)
mult.c:       | ^~~~~~~~~~~~~~~~~~~
mult.c: In file included from mult.c:19:
mult.c: const_inline.h:38:61: note: called from here
mult.c:    38 | static inline __m256i cvt_to_unsigned( __m256i a ) { return _mm256_shuffle_epi8(_cvt_to_unsigned,a&v_0x3); }
mult.c:       |                                                             ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
avx2unsignedgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
avx2unsignedgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
avx2unsignedgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
avx2unsignedgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

Compiler output


mult.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:99,
mult.c:                  from mult.c:2:
mult.c: mult.c: In function 'fastadd':
mult.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/fmaintrin.h:63:1: error: inlining failed in call to 'always_inline' '_mm256_fmadd_ps': target specific option mismatch
mult.c:    63 | _mm256_fmadd_ps (__m256 __A, __m256 __B, __m256 __C)
mult.c:       | ^~~~~~~~~~~~~~~
mult.c: mult.c:87:10: note: called from here
mult.c:    87 |   return _mm256_fmadd_ps(y,v1_float,x);
mult.c:       |          ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
mult.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:99,
mult.c:                  from mult.c:2:
mult.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/fmaintrin.h:63:1: error: inlining failed in call to 'always_inline' '_mm256_fmadd_ps': target specific option mismatch
mult.c:    63 | _mm256_fmadd_ps (__m256 __A, __m256 __B, __m256 __C)
mult.c:       | ^~~~~~~~~~~~~~~
mult.c: mult.c:87:10: note: called from here
mult.c:    87 |   return _mm256_fmadd_ps(y,v1_float,x);
mult.c:       |          ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
round1gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
round1gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
round1gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
round1gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)

Passed TIMECOP


TIMECOP iterations: 1

Number of similar (implementation,compiler) pairs: 8, namely:
ImplementationCompiler
32gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
32gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
32gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
32gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
refgcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
refgcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
refgcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
refgcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)