Implementation notes: amd64, hydra7, crypto_kem/ntskem1264

Computer: hydra7
Microarchitecture: amd64; Sandy Bridge+AES (206a7)
Architecture: amd64
CPU ID: GenuineIntel-000206a7-bfebfbff
SUPERCOP version: 20240625
Operation: crypto_kem
Primitive: ntskem1264
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
64282490478 6228 16107967 7100 1792T:sse2gcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
68213662476 6228 1678751 7100 1792T:sse2gcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
75665860598 6228 1676399 7100 1792T:sse2gcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
79660856617 6228 1671470 7092 1760T:sse2gcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
88857985637 6228 16104799 7100 1792T:optgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
89854456153 6228 1674079 7100 1792T:optgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
95964354144 6228 1671615 7100 1792T:optgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
102939250469 6228 1666718 7092 1760T:optgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
886209046119 76 1665207 924 1792T:refgcc_-march=native_-mtune=native_-O3_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
978060025568 76 1643471 924 1792T:refgcc_-march=native_-mtune=native_-O2_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
1060229523136 76 1640615 924 1792T:refgcc_-march=native_-mtune=native_-O_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625
1165781520200 76 1636414 916 1760T:refgcc_-march=native_-mtune=native_-Os_-fwrapv_-fPIC_-fPIE_-gdwarf-4_-Wall2024063020240625

Compiler output


bitslice_fft_256.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
bitslice_fft_256.c:                  from bitslice_fft_256.h:21,
bitslice_fft_256.c:                  from bitslice_fft_256.c:18:
bitslice_fft_256.c: bitslice_fft_256.c: In function 'bitslice_butterflies12_256':
bitslice_fft_256.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:913:1: error: inlining failed in call to 'always_inline' '_mm256_xor_si256': target specific option mismatch
bitslice_fft_256.c:   913 | _mm256_xor_si256 (__m256i __A, __m256i __B)
bitslice_fft_256.c:       | ^~~~~~~~~~~~~~~~
bitslice_fft_256.c: bitslice_fft_256.c:102:29: note: called from here
bitslice_fft_256.c:   102 |                 out[k][b] = _mm256_xor_si256(out[k][b], vb);
bitslice_fft_256.c:       |                             ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
bitslice_fft_256.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
bitslice_fft_256.c:                  from bitslice_fft_256.h:21,
bitslice_fft_256.c:                  from bitslice_fft_256.c:18:
bitslice_fft_256.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:913:1: error: inlining failed in call to 'always_inline' '_mm256_xor_si256': target specific option mismatch
bitslice_fft_256.c:   913 | _mm256_xor_si256 (__m256i __A, __m256i __B)
bitslice_fft_256.c:       | ^~~~~~~~~~~~~~~~
bitslice_fft_256.c: bitslice_fft_256.c:101:22: note: called from here
bitslice_fft_256.c:   101 |                 vb = _mm256_xor_si256(va, vb);
bitslice_fft_256.c:       |                      ^~~~~~~~~~~~~~~~~~~~~~~~
bitslice_fft_256.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/11/include/immintrin.h:47,
bitslice_fft_256.c:                  from bitslice_fft_256.h:21,
bitslice_fft_256.c:                  from bitslice_fft_256.c:18:
bitslice_fft_256.c: /usr/lib/gcc/x86_64-linux-gnu/11/include/avx2intrin.h:655:1: error: inlining failed in call to 'always_inline' '_mm256_slli_si256': target specific option mismatch
bitslice_fft_256.c:   655 | _mm256_slli_si256 (__m256i __A, const int __N)
bitslice_fft_256.c:       | ^~~~~~~~~~~~~~~~~
bitslice_fft_256.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:
ImplementationCompiler
T:avx2gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:avx2gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:avx2gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)
T:avx2gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (11.4.0)