Test results for amd64, hydra8, crypto_kem/ntrulpr4591761

[Page version: 20260701 00:08:08]

Measurements for amd64, hydra8, crypto_kem Test results for amd64, hydra8, crypto_kem Test results for crypto_kem/ntrulpr4591761

Computer: hydra8
Microarchitecture: amd64; Ivy Bridge+AES (306a9)
Architecture: amd64
CPU ID: GenuineIntel-000306a9-bfebfbff
SUPERCOP version: 20260627
Operation: crypto_kem
Primitive: ntrulpr4591761

Time	Object size	Test size	Implementation	Compiler	Benchmark date	SUPERCOP version
54851227	25861 0 0	78000 1296 2312	`T:ref`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260308	20260217
54862744	23534 0 0	75938 1264 2376	`T:ref`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260308	20260217
54889022	5528 0 0	56234 1264 2376	`T:ref`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260308	20260217
54921107	4028 0 0	54160 1296 2312	`T:ref`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260308	20260217
54933182	12657 0 0	65136 1296 2312	`T:ref`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260308	20260217
54987242	4530 0 0	54657 1256 2376	`T:ref`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260308	20260217
55007922	5809 0 0	55258 1288 2312	`T:ref`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260308	20260217
55117108	4027 0 0	52964 1248 2344	`T:ref`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall`	20260308	20260217
55187172	13791 0 0	65008 1296 2312	`T:ref`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall`	20260308	20260217

Compiler output

mult.c: mult.c:732:38: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'rq_mult' that is compiled without support for 'avx2'
mult.c:   732 |     __m256i x = _mm256_add_epi16(fgi,_mm256_add_epi16(fgip,fgip1));
mult.c:       |                                      ^
mult.c: mult.c:732:17: error: always_inline function '_mm256_add_epi16' requires target feature 'avx2', but would be inlined into function 'rq_mult' that is compiled without support for 'avx2'
mult.c:   732 |     __m256i x = _mm256_add_epi16(fgi,_mm256_add_epi16(fgip,fgip1));
mult.c:       |                 ^
mult.c: 2 errors generated.

Number of similar (implementation,compiler) pairs: 4, namely:

Implementation	Compiler
`T:avx`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`T:avx`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`T:avx`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`T:avx`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`

Compiler output

mult.c: mult.c:147:22: error: invalid output size for constraint '=&x'
mult.c:   147 |   MULSTEP_fromzero(0,h0,h1,h2,h3,h4)
mult.c:       |                      ^
mult.c: mult.c:149:26: error: invalid output size for constraint '+x'
mult.c:   149 |     MULSTEP_noload(j + 1,h1,h2,h3,h4,h0)
mult.c:       |                          ^
mult.c: mult.c:150:26: error: invalid output size for constraint '+x'
mult.c:   150 |     MULSTEP_noload(j + 2,h2,h3,h4,h0,h1)
mult.c:       |                          ^
mult.c: mult.c:151:26: error: invalid output size for constraint '+x'
mult.c:   151 |     MULSTEP_noload(j + 3,h3,h4,h0,h1,h2)
mult.c:       |                          ^
mult.c: mult.c:152:26: error: invalid output size for constraint '+x'
mult.c:   152 |     MULSTEP_noload(j + 4,h4,h0,h1,h2,h3)
mult.c:       |                          ^
mult.c: mult.c:153:26: error: invalid output size for constraint '+x'
mult.c:   153 |     MULSTEP_noload(j + 5,h0,h1,h2,h3,h4)
mult.c:       |                          ^
mult.c: mult.c:155:24: error: invalid output size for constraint '+x'
mult.c:   155 |   MULSTEP_noload(j + 1,h1,h2,h3,h4,h0)
mult.c:       |                        ^
mult.c: mult.c:156:24: error: invalid output size for constraint '+x'
mult.c:   156 |   MULSTEP_noload(j + 2,h2,h3,h4,h0,h1)
mult.c:       |                        ^
mult.c: mult.c:157:24: error: invalid output size for constraint '+x'
mult.c: ...

Number of similar (implementation,compiler) pairs: 1, namely:

Implementation	Compiler
`T:avx`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`

Compiler output

mult.c: mult.c: In function 'mult768_mix2_m256i':
mult.c: mult.c:568:3: warning: 'mult96x16' accessing 6144 bytes in a region of size 512 [-Wstringop-overflow=]
mult.c:   568 |   mult96x16(hkara[12],fkara[6],(__m256i *) (1 + (__m128i *) gkara));
mult.c:       |   ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
mult.c: mult.c:568:3: note: referencing argument 1 of type '__m256i[192]'
mult.c: mult.c:568:3: warning: 'mult96x16' reading 3072 bytes from a region of size 512 [-Wstringop-overread]
mult.c: mult.c:568:3: note: referencing argument 2 of type 'const __m256i[96]'
mult.c: mult.c:568:3: warning: 'mult96x16' reading 3072 bytes from a region of size 3056 [-Wstringop-overread]
mult.c: mult.c:568:3: note: referencing argument 3 of type 'const __m256i[96]'
mult.c: mult.c:279:13: note: in a call to function 'mult96x16'
mult.c:   279 | static void mult96x16(__m256i h[192],const __m256i f[96],const __m256i g[96])
mult.c:       |             ^~~~~~~~~
mult.c: mult.c:569:3: warning: 'mult96x16' accessing 6144 bytes in a region of size 512 [-Wstringop-overflow=]
mult.c:   569 |   mult96x16(hkara[0],fkara[0],gkara[0]);
mult.c:       |   ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
mult.c: mult.c:569:3: note: referencing argument 1 of type '__m256i[192]'
mult.c: mult.c:569:3: warning: 'mult96x16' reading 3072 bytes from a region of size 512 [-Wstringop-overread]
mult.c: mult.c:569:3: note: referencing argument 2 of type 'const __m256i[96]'
mult.c: mult.c:569:3: warning: 'mult96x16' reading 3072 bytes from a region of size 1024 [-Wstringop-overread]
mult.c: mult.c:569:3: note: referencing argument 3 of type 'const __m256i[96]'
mult.c: mult.c:279:13: note: in a call to function 'mult96x16'
mult.c:   279 | static void mult96x16(__m256i h[192],const __m256i f[96],const __m256i g[96])
mult.c:       |             ^~~~~~~~~
mult.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/13/include/immintrin.h:109,
mult.c:                  from mult.c:2:
mult.c: ...

Number of similar (implementation,compiler) pairs: 4, namely:

Implementation	Compiler
`T:avx`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`T:avx`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`T:avx`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`T:avx`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`

Namespace violations

hide.o hide T
randomweightw.o small_random_weightw T
randomweightw.o small_seeded_weightw T
rq_fromseed.o rq_fromseed T
rq_mult.o rq_mult T
rq_round3.o rq_round3 T
rq_rounded.o rq_decoderounded T
rq_rounded.o rq_encoderounded T
small.o small_decode T
small.o small_encode T

Number of similar (implementation,compiler) pairs: 9, namely:

Implementation	Compiler
`T:ref`	`clang -march=native -O2 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`T:ref`	`clang -march=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`T:ref`	`clang -march=native -O -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`T:ref`	`clang -march=native -Os -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`T:ref`	`clang -mcpu=native -O3 -fwrapv -Qunused-arguments -fPIC -fPIE -gdwarf-4 -Wall (Ubuntu_Clang_18.1.3_(1ubuntu1))`
`T:ref`	`gcc -march=native -mtune=native -O2 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`T:ref`	`gcc -march=native -mtune=native -O3 -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`T:ref`	`gcc -march=native -mtune=native -O -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`
`T:ref`	`gcc -march=native -mtune=native -Os -fwrapv -fPIC -fPIE -gdwarf-4 -Wall (13.3.0)`