Implementation notes: amd64, genji202, crypto_scalarmult/kummer

Computer: genji202
Architecture: amd64
CPU ID: GenuineIntel-000306f2-bfebfbff
SUPERCOP version: 20171020
Operation: crypto_scalarmult
Primitive: kummer
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
272920? ? ?? ? ?avx2inticc_-no-vec2017111320171020
273112? ? ?? ? ?avx2inticc_-xCORE-AVX2_-O2_-fomit-frame-pointer2017111320171020
273184? ? ?? ? ?avx2inticc_-xCORE-AVX-I_-O2_-fomit-frame-pointer2017111320171020
273300? ? ?? ? ?avx2inticc2017111320171020
275128? ? ?? ? ?avx2inticc_-xCORE-AVX2_-O3_-fomit-frame-pointer2017111320171020
275164? ? ?? ? ?avx2inticc_-xCORE-AVX-I_-O3_-fomit-frame-pointer2017111320171020
287792? ? ?? ? ?avx2icc_-xCORE-AVX2_-O2_-fomit-frame-pointer2017111320171020
287800? ? ?? ? ?avx2icc_-xCORE-AVX-I_-O2_-fomit-frame-pointer2017111320171020
287880? ? ?? ? ?avx2icc_-xCORE-AVX2_-O3_-fomit-frame-pointer2017111320171020
288436? ? ?? ? ?avx2icc_-xCORE-AVX-I_-O3_-fomit-frame-pointer2017111320171020
288528? ? ?? ? ?avx2icc_-no-vec2017111320171020
288736? ? ?? ? ?avx2icc2017111320171020
374708? ? ?? ? ?avxicc_-xCORE-AVX-I_-O3_-fomit-frame-pointer2017111320171020
374784? ? ?? ? ?avxicc_-xCORE-AVX-I_-O2_-fomit-frame-pointer2017111320171020
374788? ? ?? ? ?avxicc_-xCORE-AVX2_-O2_-fomit-frame-pointer2017111320171020
374836? ? ?? ? ?avxicc_-xCORE-AVX2_-O3_-fomit-frame-pointer2017111320171020
375140? ? ?? ? ?avxicc_-no-vec2017111320171020
375320? ? ?? ? ?avxicc2017111320171020
889864? ? ?? ? ?ref5icc2017111320171020
903344? ? ?? ? ?ref5uicc_-xCORE-AVX-I_-O3_-fomit-frame-pointer2017111320171020
904232? ? ?? ? ?ref5icc_-xCORE-AVX-I_-O3_-fomit-frame-pointer2017111320171020
956644? ? ?? ? ?ref5uicc2017111320171020
985656? ? ?? ? ?ref5icc_-xCORE-AVX-I_-O2_-fomit-frame-pointer2017111320171020
1015216? ? ?? ? ?ref5icc_-no-vec2017111320171020
1033624? ? ?? ? ?ref5uicc_-no-vec2017111320171020
1044368? ? ?? ? ?ref5uicc_-xCORE-AVX-I_-O2_-fomit-frame-pointer2017111320171020
1088820? ? ?? ? ?ref5uicc_-xCORE-AVX2_-O3_-fomit-frame-pointer2017111320171020
1099432? ? ?? ? ?ref5icc_-xCORE-AVX2_-O2_-fomit-frame-pointer2017111320171020
1123164? ? ?? ? ?ref5icc_-xCORE-AVX2_-O3_-fomit-frame-pointer2017111320171020
1175368? ? ?? ? ?ref5uicc_-xCORE-AVX2_-O2_-fomit-frame-pointer2017111320171020

Compiler output

Implementation: crypto_scalarmult/kummer/avx2int
Compiler: cc
smult.c: In file included from smult.c:3:0:
smult.c: gfe4x.h:7:1: error: unknown type name '__m256i'
smult.c: typedef __m256i vec;
smult.c: ^
smult.c: smult.c:6:1: warning: excess elements in scalar initializer [enabled by default]
smult.c: static const vec mask25 = {0x1ffffff,0x1ffffff,0x1ffffff,0x1ffffff};
smult.c: ^
smult.c: smult.c:6:1: warning: (near initialization for 'mask25') [enabled by default]
smult.c: smult.c:6:1: warning: excess elements in scalar initializer [enabled by default]
smult.c: smult.c:6:1: warning: (near initialization for 'mask25') [enabled by default]
smult.c: smult.c:6:1: warning: excess elements in scalar initializer [enabled by default]
smult.c: smult.c:6:1: warning: (near initialization for 'mask25') [enabled by default]
smult.c: smult.c:7:1: warning: excess elements in scalar initializer [enabled by default]
smult.c: static const vec mask26 = {0x3ffffff,0x3ffffff,0x3ffffff,0x3ffffff};
smult.c: ^
smult.c: smult.c:7:1: warning: (near initialization for 'mask26') [enabled by default]
smult.c: smult.c:7:1: warning: excess elements in scalar initializer [enabled by default]
smult.c: smult.c:7:1: warning: (near initialization for 'mask26') [enabled by default]
smult.c: smult.c:7:1: warning: excess elements in scalar initializer [enabled by default]
smult.c: smult.c:7:1: warning: (near initialization for 'mask26') [enabled by default]
smult.c: smult.c:304:3: warning: braces around scalar initializer [enabled by default]
smult.c: init(0xffffffc+1,0xffffffc+1,0xffffffc+2,0xffffffc+2,0xffffffc+2,0xffffffc+2,0xffffffc+1,0xffffffc+1)
smult.c: ^
smult.c: smult.c:304:3: warning: (near initialization for 'hadamardoffset[0]') [enabled by default]
smult.c: smult.c:304:3: warning: overflow in implicit constant conversion [-Woverflow]
smult.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
cc avx2int

Compiler output

Implementation: crypto_scalarmult/kummer/avx
Compiler: cc
try.c: /home_nfs_robin_ib/bdolbeaur/supercop-20171020/supercop-data/genji202/amd64/lib/knownrandombytes.o: In function `randombytes':
try.c: knownrandombytes.c:(.text+0x...): undefined reference to `_intel_fast_memcpy'
try.c: knownrandombytes.c:(.text+0x...): undefined reference to `_intel_fast_memset'
try.c: /home_nfs_robin_ib/bdolbeaur/supercop-20171020/supercop-data/genji202/amd64/lib/libsupercop.a(crypto_stream_chacha20_dolbeau_amd64_avx2-api.o): In function `crypto_stream_chacha20_dolbeau_amd64_avx2':
try.c: api.c:(.text+0x...): undefined reference to `__intel_avx_rep_memset'
try.c: /home_nfs_robin_ib/bdolbeaur/supercop-20171020/supercop-data/genji202/amd64/lib/libsupercop.a(crypto_stream_chacha20_dolbeau_amd64_avx2-chacha.o): In function `crypto_stream_chacha20_dolbeau_amd64_avx2_ECRYPT_keystream_bytes':
try.c: chacha.c:(.text+0x...): undefined reference to `__intel_avx_rep_memset'
try.c: collect2: error: ld returned 1 exit status

Number of similar (compiler,implementation) pairs: 440, namely:
CompilerImplementations
cc avx avx2 ref5 ref5u
gcc avx avx2 ref5 ref5u
gcc -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -fno-schedule-insns -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -fno-schedule-insns -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -fno-schedule-insns -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -fno-schedule-insns -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops avx avx2 ref5 ref5u
gcc -funroll-loops -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core-avx-i -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core-avx-i -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core-avx-i -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer avx avx2 avx2int ref5 ref5u
gcc -m64 -march=core-avx2 -O3 -fomit-frame-pointer avx avx2 avx2int ref5 ref5u
gcc -m64 -march=core-avx2 -O -fomit-frame-pointer avx avx2 avx2int ref5 ref5u
gcc -m64 -march=core-avx2 -Os -fomit-frame-pointer avx avx2 avx2int ref5 ref5u
gcc -m64 -march=core2 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=corei7-avx -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=corei7-avx -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=corei7-avx -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=corei7-avx -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=corei7 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=corei7 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=k8 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=k8 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=k8 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=k8 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer avx avx2 avx2int ref5 ref5u
gcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer avx avx2 avx2int ref5 ref5u
gcc -m64 -march=native -mtune=native -O -fomit-frame-pointer avx avx2 avx2int ref5 ref5u
gcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer avx avx2 avx2int ref5 ref5u
gcc -m64 -march=nocona -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=nocona -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=nocona -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=nocona -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=barcelona -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=barcelona -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=barcelona -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=barcelona -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=k8 -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=k8 -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=k8 -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=k8 -Os -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv avx avx2 avx2int ref5 ref5u
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv avx avx2 avx2int ref5 ref5u
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv avx avx2 avx2int ref5 ref5u
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv avx avx2 avx2int ref5 ref5u
gcc -march=nocona -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=nocona -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=nocona -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -march=nocona -Os -fomit-frame-pointer avx avx2 ref5 ref5u

Compiler output

Implementation: crypto_scalarmult/kummer/avx2int
Compiler: gcc
smult.c: smult.c: In function 'gfe4x_mulconst':
smult.c: smult.c:36:6: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
smult.c: t0 = _mm256_mul_epi32(a->v[0],*b);
smult.c: ~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/immintrin.h:43:0,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/avx2intrin.h:126:1: error: inlining failed in call to always_inline '_mm256_add_epi64': target specific option mismatch
smult.c: _mm256_add_epi64 (__m256i __A, __m256i __B)
smult.c: ^~~~~~~~~~~~~~~~
smult.c: smult.c:45:8: note: called from here
smult.c: t2 = _mm256_add_epi64(t2,_mm256_srli_epi64(t1,25)); t1 &= mask25;
smult.c: ~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/immintrin.h:43:0,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/avx2intrin.h:787:1: error: inlining failed in call to always_inline '_mm256_srli_epi64': target specific option mismatch
smult.c: _mm256_srli_epi64 (__m256i __A, int __B)
smult.c: ^~~~~~~~~~~~~~~~~
smult.c: smult.c:45:10: note: called from here
smult.c: t2 = _mm256_add_epi64(t2,_mm256_srli_epi64(t1,25)); t1 &= mask25;
smult.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/immintrin.h:43:0,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: ...

Number of similar (compiler,implementation) pairs: 86, namely:
CompilerImplementations
gcc avx2int
gcc -O2 -fomit-frame-pointer avx2int
gcc -O3 -fomit-frame-pointer avx2int
gcc -O -fomit-frame-pointer avx2int
gcc -Os -fomit-frame-pointer avx2int
gcc -fno-schedule-insns -O2 -fomit-frame-pointer avx2int
gcc -fno-schedule-insns -O3 -fomit-frame-pointer avx2int
gcc -fno-schedule-insns -O -fomit-frame-pointer avx2int
gcc -fno-schedule-insns -Os -fomit-frame-pointer avx2int
gcc -funroll-loops avx2int
gcc -funroll-loops -O2 -fomit-frame-pointer avx2int
gcc -funroll-loops -O3 -fomit-frame-pointer avx2int
gcc -funroll-loops -O -fomit-frame-pointer avx2int
gcc -funroll-loops -Os -fomit-frame-pointer avx2int
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer avx2int
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer avx2int
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer avx2int
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -O -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -Os -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer avx2int
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer avx2int
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer avx2int
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer avx2int
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer avx2int
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer avx2int
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer avx2int
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer avx2int
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer avx2int
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer avx2int
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer avx2int
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer avx2int
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer avx2int
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer avx2int
gcc -m64 -O2 -fomit-frame-pointer avx2int
gcc -m64 -O3 -fomit-frame-pointer avx2int
gcc -m64 -O -fomit-frame-pointer avx2int
gcc -m64 -Os -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -O2 -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -O3 -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -O -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -Os -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer avx2int
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer avx2int
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer avx2int
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer avx2int
gcc -m64 -march=corei7 -O -fomit-frame-pointer avx2int
gcc -m64 -march=corei7 -Os -fomit-frame-pointer avx2int
gcc -m64 -march=k8 -O2 -fomit-frame-pointer avx2int
gcc -m64 -march=k8 -O3 -fomit-frame-pointer avx2int
gcc -m64 -march=k8 -O -fomit-frame-pointer avx2int
gcc -m64 -march=k8 -Os -fomit-frame-pointer avx2int
gcc -m64 -march=nocona -O2 -fomit-frame-pointer avx2int
gcc -m64 -march=nocona -O3 -fomit-frame-pointer avx2int
gcc -m64 -march=nocona -O -fomit-frame-pointer avx2int
gcc -m64 -march=nocona -Os -fomit-frame-pointer avx2int
gcc -march=barcelona -O2 -fomit-frame-pointer avx2int
gcc -march=barcelona -O3 -fomit-frame-pointer avx2int
gcc -march=barcelona -O -fomit-frame-pointer avx2int
gcc -march=barcelona -Os -fomit-frame-pointer avx2int
gcc -march=k8 -O2 -fomit-frame-pointer avx2int
gcc -march=k8 -O3 -fomit-frame-pointer avx2int
gcc -march=k8 -O -fomit-frame-pointer avx2int
gcc -march=k8 -Os -fomit-frame-pointer avx2int
gcc -march=nocona -O2 -fomit-frame-pointer avx2int
gcc -march=nocona -O3 -fomit-frame-pointer avx2int
gcc -march=nocona -O -fomit-frame-pointer avx2int
gcc -march=nocona -Os -fomit-frame-pointer avx2int

Compiler output

Implementation: crypto_scalarmult/kummer/avx2int
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
smult.c: smult.c: In function 'gfe4x_mulconst':
smult.c: smult.c:36:6: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
smult.c: t0 = _mm256_mul_epi32(a->v[0],*b);
smult.c: ~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/immintrin.h:43:0,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/avx2intrin.h:126:1: error: inlining failed in call to always_inline '_mm256_add_epi64': target specific option mismatch
smult.c: _mm256_add_epi64 (__m256i __A, __m256i __B)
smult.c: ^~~~~~~~~~~~~~~~
smult.c: smult.c:45:8: note: called from here
smult.c: t2 = _mm256_add_epi64(t2,_mm256_srli_epi64(t1,25)); t1 &= mask25;
smult.c: ~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/immintrin.h:43:0,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/avx2intrin.h:787:1: error: inlining failed in call to always_inline '_mm256_srli_epi64': target specific option mismatch
smult.c: _mm256_srli_epi64 (__m256i __A, int __B)
smult.c: ^~~~~~~~~~~~~~~~~
smult.c: smult.c:45:10: note: called from here
smult.c: t2 = _mm256_add_epi64(t2,_mm256_srli_epi64(t1,25)); t1 &= mask25;
smult.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/immintrin.h:43:0,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: ...
smult.c: smult.c: In function 'gfe4x_mulconst':
smult.c: smult.c:36:6: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
smult.c: t0 = _mm256_mul_epi32(a->v[0],*b);
smult.c: ~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/immintrin.h:43:0,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/avx2intrin.h:126:1: error: inlining failed in call to always_inline '_mm256_add_epi64': target specific option mismatch
smult.c: _mm256_add_epi64 (__m256i __A, __m256i __B)
smult.c: ^~~~~~~~~~~~~~~~
smult.c: smult.c:45:8: note: called from here
smult.c: t2 = _mm256_add_epi64(t2,_mm256_srli_epi64(t1,25)); t1 &= mask25;
smult.c: ~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/immintrin.h:43:0,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/avx2intrin.h:787:1: error: inlining failed in call to always_inline '_mm256_srli_epi64': target specific option mismatch
smult.c: _mm256_srli_epi64 (__m256i __A, int __B)
smult.c: ^~~~~~~~~~~~~~~~~
smult.c: smult.c:45:10: note: called from here
smult.c: t2 = _mm256_add_epi64(t2,_mm256_srli_epi64(t1,25)); t1 &= mask25;
smult.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/immintrin.h:43:0,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer avx2int
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer avx2int
gcc -m64 -march=barcelona -O -fomit-frame-pointer avx2int
gcc -m64 -march=barcelona -Os -fomit-frame-pointer avx2int

Compiler output

Implementation: crypto_scalarmult/kummer/avx
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
try.c: /home_nfs_robin_ib/bdolbeaur/supercop-20171020/supercop-data/genji202/amd64/lib/knownrandombytes.o: In function `randombytes':
try.c: knownrandombytes.c:(.text+0x...): undefined reference to `_intel_fast_memcpy'
try.c: knownrandombytes.c:(.text+0x...): undefined reference to `_intel_fast_memset'
try.c: /home_nfs_robin_ib/bdolbeaur/supercop-20171020/supercop-data/genji202/amd64/lib/libsupercop.a(crypto_stream_chacha20_dolbeau_amd64_avx2-api.o): In function `crypto_stream_chacha20_dolbeau_amd64_avx2':
try.c: api.c:(.text+0x...): undefined reference to `__intel_avx_rep_memset'
try.c: /home_nfs_robin_ib/bdolbeaur/supercop-20171020/supercop-data/genji202/amd64/lib/libsupercop.a(crypto_stream_chacha20_dolbeau_amd64_avx2-chacha.o): In function `crypto_stream_chacha20_dolbeau_amd64_avx2_ECRYPT_keystream_bytes':
try.c: chacha.c:(.text+0x...): undefined reference to `__intel_avx_rep_memset'
try.c: collect2: error: ld returned 1 exit status
try.c: /home_nfs_robin_ib/bdolbeaur/supercop-20171020/supercop-data/genji202/amd64/lib/knownrandombytes.o: In function `randombytes':
try.c: knownrandombytes.c:(.text+0x...): undefined reference to `_intel_fast_memcpy'
try.c: knownrandombytes.c:(.text+0x...): undefined reference to `_intel_fast_memset'
try.c: /home_nfs_robin_ib/bdolbeaur/supercop-20171020/supercop-data/genji202/amd64/lib/libsupercop.a(crypto_stream_chacha20_dolbeau_amd64_avx2-api.o): In function `crypto_stream_chacha20_dolbeau_amd64_avx2':
try.c: api.c:(.text+0x...): undefined reference to `__intel_avx_rep_memset'
try.c: /home_nfs_robin_ib/bdolbeaur/supercop-20171020/supercop-data/genji202/amd64/lib/libsupercop.a(crypto_stream_chacha20_dolbeau_amd64_avx2-chacha.o): In function `crypto_stream_chacha20_dolbeau_amd64_avx2_ECRYPT_keystream_bytes':
try.c: chacha.c:(.text+0x...): undefined reference to `__intel_avx_rep_memset'
try.c: collect2: error: ld returned 1 exit status

Number of similar (compiler,implementation) pairs: 16, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=barcelona -O -fomit-frame-pointer avx avx2 ref5 ref5u
gcc -m64 -march=barcelona -Os -fomit-frame-pointer avx avx2 ref5 ref5u

Compiler output

Implementation: crypto_scalarmult/kummer/avx2int
Compiler: gcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer
smult.c: In file included from /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/immintrin.h:43:0,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: smult.c: In function 'gfe4x_mulconst':
smult.c: /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/avx2intrin.h:126:1: error: inlining failed in call to always_inline '_mm256_add_epi64': target specific option mismatch
smult.c: _mm256_add_epi64 (__m256i __A, __m256i __B)
smult.c: ^~~~~~~~~~~~~~~~
smult.c: smult.c:45:8: note: called from here
smult.c: t2 = _mm256_add_epi64(t2,_mm256_srli_epi64(t1,25)); t1 &= mask25;
smult.c: ~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/immintrin.h:43:0,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/avx2intrin.h:787:1: error: inlining failed in call to always_inline '_mm256_srli_epi64': target specific option mismatch
smult.c: _mm256_srli_epi64 (__m256i __A, int __B)
smult.c: ^~~~~~~~~~~~~~~~~
smult.c: smult.c:45:10: note: called from here
smult.c: t2 = _mm256_add_epi64(t2,_mm256_srli_epi64(t1,25)); t1 &= mask25;
smult.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
smult.c: In file included from /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/immintrin.h:43:0,
smult.c: from gfe4x.h:5,
smult.c: from smult.c:3:
smult.c: /home_nfs/bdolbeaur/gcc-7.2.0-full+isl/lib/gcc/x86_64-pc-linux-gnu/7.2.0/include/avx2intrin.h:126:1: error: inlining failed in call to always_inline '_mm256_add_epi64': target specific option mismatch
smult.c: _mm256_add_epi64 (__m256i __A, __m256i __B)
smult.c: ^~~~~~~~~~~~~~~~
smult.c: ...

Number of similar (compiler,implementation) pairs: 8, namely:
CompilerImplementations
gcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer avx2int
gcc -m64 -march=core-avx-i -O3 -fomit-frame-pointer avx2int
gcc -m64 -march=core-avx-i -O -fomit-frame-pointer avx2int
gcc -m64 -march=core-avx-i -Os -fomit-frame-pointer avx2int
gcc -m64 -march=corei7-avx -O2 -fomit-frame-pointer avx2int
gcc -m64 -march=corei7-avx -O3 -fomit-frame-pointer avx2int
gcc -m64 -march=corei7-avx -O -fomit-frame-pointer avx2int
gcc -m64 -march=corei7-avx -Os -fomit-frame-pointer avx2int