Implementation notes: amd64, genji291, crypto_scalarmult/kummer

Computer: genji291
Architecture: amd64
CPU ID: GenuineIntel-00050671-bfebfbff
SUPERCOP version: 20180818
Operation: crypto_scalarmult
Primitive: kummer
TimeImplementationCompilerBenchmark dateSUPERCOP version
430584avx2inticc -xMIC-AVX512 -O2 -fomit-frame-pointer2018082020180818
432922avx2inticc -xMIC-AVX512 -O3 -fomit-frame-pointer2018082020180818
536662avx2inticc -xCORE-AVX2 -O2 -fomit-frame-pointer2018082020180818
536760avx2inticc -xSSE4.2 -O2 -fomit-frame-pointer2018082020180818
536830avx2inticc -xSSE4.1 -O2 -fomit-frame-pointer2018082020180818
537404avx2inticc -no-vec2018082020180818
537432avx2inticc -xAVX -O2 -fomit-frame-pointer2018082020180818
537684avx2inticc -xCOMMON-AVX512 -O2 -fomit-frame-pointer2018082020180818
537922avx2inticc -xCORE-AVX-I -O2 -fomit-frame-pointer2018082020180818
546406avx2inticc -xSSE4.1 -O3 -fomit-frame-pointer2018082020180818
546924avx2inticc -xCORE-AVX2 -O3 -fomit-frame-pointer2018082020180818
547358avx2inticc -xSSE4.2 -O3 -fomit-frame-pointer2018082020180818
547848avx2inticc -xCOMMON-AVX512 -O3 -fomit-frame-pointer2018082020180818
552930avx2inticc -xCORE-AVX-I -O3 -fomit-frame-pointer2018082020180818
562128avx2inticc2018082020180818
632744avx2inticc -xAVX -O3 -fomit-frame-pointer2018082020180818
1399090avx2icc -xCOMMON-AVX512 -O3 -fomit-frame-pointer2018082020180818
1399944avx2icc -xAVX -O2 -fomit-frame-pointer2018082020180818
1401442avx2icc2018082020180818
1401694avx2icc -no-vec2018082020180818
1403346avx2icc -xMIC-AVX512 -O3 -fomit-frame-pointer2018082020180818
1405432avx2icc -xCORE-AVX2 -O2 -fomit-frame-pointer2018082020180818
1407686avx2icc -xSSE4.2 -O3 -fomit-frame-pointer2018082020180818
1407784avx2icc -xCORE-AVX2 -O3 -fomit-frame-pointer2018082020180818
1408428avx2icc -xMIC-AVX512 -O2 -fomit-frame-pointer2018082020180818
1408708avx2icc -xCORE-AVX-I -O2 -fomit-frame-pointer2018082020180818
1409184avx2icc -xSSE4.2 -O2 -fomit-frame-pointer2018082020180818
1413860avx2icc -xSSE4.1 -O2 -fomit-frame-pointer2018082020180818
1414406avx2icc -xSSE4.1 -O3 -fomit-frame-pointer2018082020180818
1414476avx2icc -xCORE-AVX-I -O3 -fomit-frame-pointer2018082020180818
1418326avx2icc -xAVX -O3 -fomit-frame-pointer2018082020180818
1421294avx2icc -xCOMMON-AVX512 -O2 -fomit-frame-pointer2018082020180818
1662080avxicc -xMIC-AVX512 -O2 -fomit-frame-pointer2018082020180818
1662220avxicc2018082020180818
1666672avxicc -xCOMMON-AVX512 -O2 -fomit-frame-pointer2018082020180818
1667008avxicc -xMIC-AVX512 -O3 -fomit-frame-pointer2018082020180818
1668226avxicc -xCORE-AVX-I -O3 -fomit-frame-pointer2018082020180818
1670214avxicc -xSSE4.2 -O2 -fomit-frame-pointer2018082020180818
1672342avxicc -xCORE-AVX2 -O3 -fomit-frame-pointer2018082020180818
1672678avxicc -xCORE-AVX2 -O2 -fomit-frame-pointer2018082020180818
1674722avxicc -xSSE4.1 -O3 -fomit-frame-pointer2018082020180818
1677102avxicc -no-vec2018082020180818
1680042avxicc -xSSE4.1 -O2 -fomit-frame-pointer2018082020180818
1680742avxicc -xCORE-AVX-I -O2 -fomit-frame-pointer2018082020180818
1683584avxicc -xAVX -O3 -fomit-frame-pointer2018082020180818
1693048avxicc -xSSE4.2 -O3 -fomit-frame-pointer2018082020180818
1710184avxicc -xCOMMON-AVX512 -O3 -fomit-frame-pointer2018082020180818
1995756ref5uicc -xSSE4.1 -O3 -fomit-frame-pointer2018082020180818
2017722ref5uicc -xCORE-AVX-I -O3 -fomit-frame-pointer2018082020180818
2040444ref5uicc -xSSE4.2 -O3 -fomit-frame-pointer2018082020180818
2045512ref5uicc -xAVX -O3 -fomit-frame-pointer2018082020180818
2058672ref5icc2018082020180818
2086028ref5uicc2018082020180818
2103346ref5icc -xCORE-AVX-I -O3 -fomit-frame-pointer2018082020180818
2119698ref5icc -xAVX -O3 -fomit-frame-pointer2018082020180818
2126180ref5icc -xSSE4.1 -O3 -fomit-frame-pointer2018082020180818
2152430avxicc -xAVX -O2 -fomit-frame-pointer2018082020180818
2164526ref5uicc -xMIC-AVX512 -O3 -fomit-frame-pointer2018082020180818
2165828ref5icc -xSSE4.2 -O3 -fomit-frame-pointer2018082020180818
2188830ref5uicc -no-vec2018082020180818
2217894ref5icc -xMIC-AVX512 -O2 -fomit-frame-pointer2018082020180818
2263450ref5icc -no-vec2018082020180818
2273698ref5icc -xMIC-AVX512 -O3 -fomit-frame-pointer2018082020180818
2318358ref5uicc -xMIC-AVX512 -O2 -fomit-frame-pointer2018082020180818
2511838ref5icc -xSSE4.2 -O2 -fomit-frame-pointer2018082020180818
2527952ref5icc -xSSE4.1 -O2 -fomit-frame-pointer2018082020180818
2558136ref5icc -xAVX -O2 -fomit-frame-pointer2018082020180818
2574544ref5uicc -xCORE-AVX-I -O2 -fomit-frame-pointer2018082020180818
2574600ref5uicc -xSSE4.2 -O2 -fomit-frame-pointer2018082020180818
2581698ref5uicc -xAVX -O2 -fomit-frame-pointer2018082020180818
2585828ref5uicc -xSSE4.1 -O2 -fomit-frame-pointer2018082020180818
2765532ref5icc -xCORE-AVX-I -O2 -fomit-frame-pointer2018082020180818
2923606ref5uicc -xCORE-AVX2 -O3 -fomit-frame-pointer2018082020180818
2952950ref5uicc -xCOMMON-AVX512 -O3 -fomit-frame-pointer2018082020180818
3029264ref5uicc -xCORE-AVX2 -O2 -fomit-frame-pointer2018082020180818
3034374ref5icc -xCOMMON-AVX512 -O2 -fomit-frame-pointer2018082020180818
3035172ref5icc -xCORE-AVX2 -O2 -fomit-frame-pointer2018082020180818
3039302ref5icc -xCORE-AVX2 -O3 -fomit-frame-pointer2018082020180818
3050642ref5icc -xCOMMON-AVX512 -O3 -fomit-frame-pointer2018082020180818
3309040ref5uicc -xCOMMON-AVX512 -O2 -fomit-frame-pointer2018082020180818

Compiler output

Implementation: crypto_scalarmult/kummer/avx2int
Compiler: cc
smult.c: In file included from smult.c:3:0:
smult.c: gfe4x.h:7:1: error: unknown type name '__m256i'
smult.c: typedef __m256i vec;
smult.c: ^
smult.c: smult.c:6:1: warning: excess elements in scalar initializer [enabled by default]
smult.c: static const vec mask25 = {0x1ffffff,0x1ffffff,0x1ffffff,0x1ffffff};
smult.c: ^
smult.c: smult.c:6:1: warning: (near initialization for 'mask25') [enabled by default]
smult.c: smult.c:6:1: warning: excess elements in scalar initializer [enabled by default]
smult.c: smult.c:6:1: warning: (near initialization for 'mask25') [enabled by default]
smult.c: smult.c:6:1: warning: excess elements in scalar initializer [enabled by default]
smult.c: smult.c:6:1: warning: (near initialization for 'mask25') [enabled by default]
smult.c: smult.c:7:1: warning: excess elements in scalar initializer [enabled by default]
smult.c: static const vec mask26 = {0x3ffffff,0x3ffffff,0x3ffffff,0x3ffffff};
smult.c: ^
smult.c: smult.c:7:1: warning: (near initialization for 'mask26') [enabled by default]
smult.c: smult.c:7:1: warning: excess elements in scalar initializer [enabled by default]
smult.c: smult.c:7:1: warning: (near initialization for 'mask26') [enabled by default]
smult.c: smult.c:7:1: warning: excess elements in scalar initializer [enabled by default]
smult.c: smult.c:7:1: warning: (near initialization for 'mask26') [enabled by default]
smult.c: smult.c:304:3: warning: braces around scalar initializer [enabled by default]
smult.c: init(0xffffffc+1,0xffffffc+1,0xffffffc+2,0xffffffc+2,0xffffffc+2,0xffffffc+2,0xffffffc+1,0xffffffc+1)
smult.c: ^
smult.c: smult.c:304:3: warning: (near initialization for 'hadamardoffset[0]') [enabled by default]
smult.c: smult.c:304:3: warning: overflow in implicit constant conversion [-Woverflow]
smult.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
cc avx2int

Compiler output

Implementation: crypto_scalarmult/kummer/avx
Compiler: cc
try.c: /scratch_lustre_DDN7k/bdolbeaur/supercop-20180818/supercop-data/genji291/amd64/lib/knownrandombytes.o: In function `randombytes':
try.c: knownrandombytes.c:(.text+0x...): undefined reference to `_intel_fast_memcpy'
try.c: knownrandombytes.c:(.text+0x...): undefined reference to `_intel_fast_memset'
try.c: /scratch_lustre_DDN7k/bdolbeaur/supercop-20180818/supercop-data/genji291/amd64/lib/libsupercop.a(crypto_stream_chacha20_dolbeau_amd64_avx2-api.o): In function `crypto_stream_chacha20_dolbeau_amd64_avx2':
try.c: api.c:(.text+0x...): undefined reference to `__intel_mic_avx512f_memset'
try.c: /scratch_lustre_DDN7k/bdolbeaur/supercop-20180818/supercop-data/genji291/amd64/lib/libsupercop.a(crypto_stream_chacha20_dolbeau_amd64_avx2-chacha.o): In function `crypto_stream_chacha20_dolbeau_amd64_avx2_ECRYPT_keystream_bytes':
try.c: chacha.c:(.text+0x...): undefined reference to `__intel_mic_avx512f_memset'
try.c: collect2: error: ld returned 1 exit status

Number of similar (compiler,implementation) pairs: 440, namely:
CompilerImplementations
cc avx avx2 ref5 ref5u
gcc avx avx2 ref5 ref5u
gcc -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -fno-schedule-insns -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -fno-schedule-insns -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -fno-schedule-insns -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -fno-schedule-insns -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops avx avx2 ref5 ref5u
gcc -funroll-loops -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core-avx-i -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core-avx-i -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core-avx-i -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer avx avx2 avx2int ref5 ref5u
gcc -m64 -march=core-avx2 -O3 -fomit-frame-pointer avx avx2 avx2int ref5 ref5u
gcc -m64 -march=core-avx2 -O -fomit-frame-pointer avx avx2 avx2int ref5 ref5u
gcc -m64 -march=core-avx2 -Os -fomit-frame-pointer avx avx2 avx2int ref5 ref5u
gcc -m64 -march=core2 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=corei7-avx -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=corei7-avx -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=corei7-avx -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=corei7-avx -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=corei7 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=corei7 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=k8 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=k8 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=k8 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=k8 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer avx avx2 avx2int ref5 ref5u
gcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer avx avx2 avx2int ref5 ref5u
gcc -m64 -march=native -mtune=native -O -fomit-frame-pointer avx avx2 avx2int ref5 ref5u
gcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer avx avx2 avx2int ref5 ref5u
gcc -m64 -march=nocona -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=nocona -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=nocona -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=nocona -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=barcelona -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=barcelona -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=barcelona -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=barcelona -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=k8 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=k8 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=k8 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=k8 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv avx avx2 avx2int ref5 ref5u
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv avx avx2 avx2int ref5 ref5u
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv avx avx2 avx2int ref5 ref5u
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv avx avx2 avx2int ref5 ref5u
gcc -march=nocona -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=nocona -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=nocona -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=nocona -Os -fomit-frame-pointer avx avx2 ref5 ref5u

Compiler output

Implementation: crypto_scalarmult/kummer/avx2int
Compiler: gcc
smult.c: smult.c: In function 'gfe4x_mulconst':
smult.c: smult.c:36:6: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
smult.c: t0 = _mm256_mul_epi32(a->gt;v[0],*b);
smult.c: ~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/immintrin.h:43,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/avx2intrin.h:126:1: error: inlining failed in call to always_inline '_mm256_add_epi64': target specific option mismatch
smult.c: _mm256_add_epi64 (__m256i __A, __m256i __B)
smult.c: ^~~~~~~~~~~~~~~~
smult.c: smult.c:45:10: note: called from here
smult.c: t2 = _mm256_add_epi64(t2,_mm256_srli_epi64(t1,25)); t1 &= mask25;
smult.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/immintrin.h:43,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/avx2intrin.h:787:1: error: inlining failed in call to always_inline '_mm256_srli_epi64': target specific option mismatch
smult.c: _mm256_srli_epi64 (__m256i __A, int __B)
smult.c: ^~~~~~~~~~~~~~~~~
smult.c: smult.c:45:10: note: called from here
smult.c: t2 = _mm256_add_epi64(t2,_mm256_srli_epi64(t1,25)); t1 &= mask25;
smult.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/immintrin.h:43,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: ...

Number of similar (compiler,implementation) pairs: 86, namely:
CompilerImplementations
gcc avx2int
gcc -O2 -fomit-frame-pointer avx2int
gcc -O3 -fomit-frame-pointer avx2int
gcc -O -fomit-frame-pointer avx2int
gcc -Os -fomit-frame-pointer avx2int
gcc -fno-schedule-insns -O2 -fomit-frame-pointer avx2int
gcc -fno-schedule-insns -O3 -fomit-frame-pointer avx2int
gcc -fno-schedule-insns -O -fomit-frame-pointer avx2int
gcc -fno-schedule-insns -Os -fomit-frame-pointer avx2int
gcc -funroll-loops avx2int
gcc -funroll-loops -O2 -fomit-frame-pointer avx2int
gcc -funroll-loops -O3 -fomit-frame-pointer avx2int
gcc -funroll-loops -O -fomit-frame-pointer avx2int
gcc -funroll-loops -Os -fomit-frame-pointer avx2int
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer avx2int
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer avx2int
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer avx2int
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -O -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -Os -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer avx2int
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer avx2int
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer avx2int
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer avx2int
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer avx2int
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer avx2int
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer avx2int
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer avx2int
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer avx2int
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer avx2int
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer avx2int
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer avx2int
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer avx2int
gcc -m64 -O2 -fomit-frame-pointer avx2int
gcc -m64 -O3 -fomit-frame-pointer avx2int
gcc -m64 -O -fomit-frame-pointer avx2int
gcc -m64 -Os -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -O2 -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -O3 -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -O -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -Os -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer avx2int
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer avx2int
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer avx2int
gcc -m64 -march=corei7 -O -fomit-frame-pointer avx2int
gcc -m64 -march=corei7 -Os -fomit-frame-pointer avx2int
gcc -m64 -march=k8 -O2 -fomit-frame-pointer avx2int
gcc -m64 -march=k8 -O3 -fomit-frame-pointer avx2int
gcc -m64 -march=k8 -O -fomit-frame-pointer avx2int
gcc -m64 -march=k8 -Os -fomit-frame-pointer avx2int
gcc -m64 -march=nocona -O2 -fomit-frame-pointer avx2int
gcc -m64 -march=nocona -O3 -fomit-frame-pointer avx2int
gcc -m64 -march=nocona -O -fomit-frame-pointer avx2int
gcc -m64 -march=nocona -Os -fomit-frame-pointer avx2int
gcc -march=barcelona -O2 -fomit-frame-pointer avx2int
gcc -march=barcelona -O3 -fomit-frame-pointer avx2int
gcc -march=barcelona -O -fomit-frame-pointer avx2int
gcc -march=barcelona -Os -fomit-frame-pointer avx2int
gcc -march=k8 -O2 -fomit-frame-pointer avx2int
gcc -march=k8 -O3 -fomit-frame-pointer avx2int
gcc -march=k8 -O -fomit-frame-pointer avx2int
gcc -march=k8 -Os -fomit-frame-pointer avx2int
gcc -march=nocona -O2 -fomit-frame-pointer avx2int
gcc -march=nocona -O3 -fomit-frame-pointer avx2int
gcc -march=nocona -O -fomit-frame-pointer avx2int
gcc -march=nocona -Os -fomit-frame-pointer avx2int

Compiler output

Implementation: crypto_scalarmult/kummer/avx2int
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
smult.c: smult.c: In function 'gfe4x_mulconst':
smult.c: smult.c:36:6: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
smult.c: t0 = _mm256_mul_epi32(a->gt;v[0],*b);
smult.c: ~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/immintrin.h:43,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/avx2intrin.h:126:1: error: inlining failed in call to always_inline '_mm256_add_epi64': target specific option mismatch
smult.c: _mm256_add_epi64 (__m256i __A, __m256i __B)
smult.c: ^~~~~~~~~~~~~~~~
smult.c: smult.c:45:10: note: called from here
smult.c: t2 = _mm256_add_epi64(t2,_mm256_srli_epi64(t1,25)); t1 &= mask25;
smult.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/immintrin.h:43,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/avx2intrin.h:787:1: error: inlining failed in call to always_inline '_mm256_srli_epi64': target specific option mismatch
smult.c: _mm256_srli_epi64 (__m256i __A, int __B)
smult.c: ^~~~~~~~~~~~~~~~~
smult.c: smult.c:45:10: note: called from here
smult.c: t2 = _mm256_add_epi64(t2,_mm256_srli_epi64(t1,25)); t1 &= mask25;
smult.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/immintrin.h:43,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: ...
smult.c: smult.c: In function 'gfe4x_mulconst':
smult.c: smult.c:36:6: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
smult.c: t0 = _mm256_mul_epi32(a->gt;v[0],*b);
smult.c: ~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/immintrin.h:43,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/avx2intrin.h:126:1: error: inlining failed in call to always_inline '_mm256_add_epi64': target specific option mismatch
smult.c: _mm256_add_epi64 (__m256i __A, __m256i __B)
smult.c: ^~~~~~~~~~~~~~~~
smult.c: smult.c:45:10: note: called from here
smult.c: t2 = _mm256_add_epi64(t2,_mm256_srli_epi64(t1,25)); t1 &= mask25;
smult.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/immintrin.h:43,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/avx2intrin.h:787:1: error: inlining failed in call to always_inline '_mm256_srli_epi64': target specific option mismatch
smult.c: _mm256_srli_epi64 (__m256i __A, int __B)
smult.c: ^~~~~~~~~~~~~~~~~
smult.c: smult.c:45:10: note: called from here
smult.c: t2 = _mm256_add_epi64(t2,_mm256_srli_epi64(t1,25)); t1 &= mask25;
smult.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/immintrin.h:43,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer avx2int
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer avx2int
gcc -m64 -march=barcelona -O -fomit-frame-pointer avx2int
gcc -m64 -march=barcelona -Os -fomit-frame-pointer avx2int

Compiler output

Implementation: crypto_scalarmult/kummer/avx
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
try.c: /scratch_lustre_DDN7k/bdolbeaur/supercop-20180818/supercop-data/genji291/amd64/lib/knownrandombytes.o: In function `randombytes':
try.c: knownrandombytes.c:(.text+0x...): undefined reference to `_intel_fast_memcpy'
try.c: knownrandombytes.c:(.text+0x...): undefined reference to `_intel_fast_memset'
try.c: /scratch_lustre_DDN7k/bdolbeaur/supercop-20180818/supercop-data/genji291/amd64/lib/libsupercop.a(crypto_stream_chacha20_dolbeau_amd64_avx2-api.o): In function `crypto_stream_chacha20_dolbeau_amd64_avx2':
try.c: api.c:(.text+0x...): undefined reference to `__intel_mic_avx512f_memset'
try.c: /scratch_lustre_DDN7k/bdolbeaur/supercop-20180818/supercop-data/genji291/amd64/lib/libsupercop.a(crypto_stream_chacha20_dolbeau_amd64_avx2-chacha.o): In function `crypto_stream_chacha20_dolbeau_amd64_avx2_ECRYPT_keystream_bytes':
try.c: chacha.c:(.text+0x...): undefined reference to `__intel_mic_avx512f_memset'
try.c: collect2: error: ld returned 1 exit status
try.c: /scratch_lustre_DDN7k/bdolbeaur/supercop-20180818/supercop-data/genji291/amd64/lib/knownrandombytes.o: In function `randombytes':
try.c: knownrandombytes.c:(.text+0x...): undefined reference to `_intel_fast_memcpy'
try.c: knownrandombytes.c:(.text+0x...): undefined reference to `_intel_fast_memset'
try.c: /scratch_lustre_DDN7k/bdolbeaur/supercop-20180818/supercop-data/genji291/amd64/lib/libsupercop.a(crypto_stream_chacha20_dolbeau_amd64_avx2-api.o): In function `crypto_stream_chacha20_dolbeau_amd64_avx2':
try.c: api.c:(.text+0x...): undefined reference to `__intel_mic_avx512f_memset'
try.c: /scratch_lustre_DDN7k/bdolbeaur/supercop-20180818/supercop-data/genji291/amd64/lib/libsupercop.a(crypto_stream_chacha20_dolbeau_amd64_avx2-chacha.o): In function `crypto_stream_chacha20_dolbeau_amd64_avx2_ECRYPT_keystream_bytes':
try.c: chacha.c:(.text+0x...): undefined reference to `__intel_mic_avx512f_memset'
try.c: collect2: error: ld returned 1 exit status

Number of similar (compiler,implementation) pairs: 16, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=barcelona -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=barcelona -Os -fomit-frame-pointer avx avx2 ref5 ref5u

Compiler output

Implementation: crypto_scalarmult/kummer/avx2int
Compiler: gcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer
smult.c: In file included from /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/immintrin.h:43,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: smult.c: In function 'gfe4x_mulconst':
smult.c: /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/avx2intrin.h:126:1: error: inlining failed in call to always_inline '_mm256_add_epi64': target specific option mismatch
smult.c: _mm256_add_epi64 (__m256i __A, __m256i __B)
smult.c: ^~~~~~~~~~~~~~~~
smult.c: smult.c:45:10: note: called from here
smult.c: t2 = _mm256_add_epi64(t2,_mm256_srli_epi64(t1,25)); t1 &= mask25;
smult.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/immintrin.h:43,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/avx2intrin.h:787:1: error: inlining failed in call to always_inline '_mm256_srli_epi64': target specific option mismatch
smult.c: _mm256_srli_epi64 (__m256i __A, int __B)
smult.c: ^~~~~~~~~~~~~~~~~
smult.c: smult.c:45:10: note: called from here
smult.c: t2 = _mm256_add_epi64(t2,_mm256_srli_epi64(t1,25)); t1 &= mask25;
smult.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/immintrin.h:43,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: /home_nfs_robin_ib/bdolbeaur/gcc-8.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/avx2intrin.h:126:1: error: inlining failed in call to always_inline '_mm256_add_epi64': target specific option mismatch
smult.c: _mm256_add_epi64 (__m256i __A, __m256i __B)
smult.c: ^~~~~~~~~~~~~~~~
smult.c: ...

Number of similar (compiler,implementation) pairs: 8, namely:
CompilerImplementations
gcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer avx2int
gcc -m64 -march=core-avx-i -O3 -fomit-frame-pointer avx2int
gcc -m64 -march=core-avx-i -O -fomit-frame-pointer avx2int
gcc -m64 -march=core-avx-i -Os -fomit-frame-pointer avx2int
gcc -m64 -march=corei7-avx -O2 -fomit-frame-pointer avx2int
gcc -m64 -march=corei7-avx -O3 -fomit-frame-pointer avx2int
gcc -m64 -march=corei7-avx -O -fomit-frame-pointer avx2int
gcc -m64 -march=corei7-avx -Os -fomit-frame-pointer avx2int