Implementation notes: amd64, manny1024, crypto_sign/lattisigns512

Computer: manny1024
Architecture: amd64
CPU ID: GenuineIntel-00050654-bfebfbff
SUPERCOP version: 20170904
Operation: crypto_sign
Primitive: lattisigns512
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
485794? ? ?? ? ?avxicc_-xCORE-AVX512_-O3_-fomit-frame-pointer2017091820170904
487822? ? ?? ? ?avxicc_-xCORE-AVX512_-O2_-fomit-frame-pointer2017091820170904
493998? ? ?? ? ?avxicc_-xCOMMON-AVX512_-O2_-fomit-frame-pointer2017091820170904
494316? ? ?? ? ?avxicc_-xCORE-AVX2_-O2_-fomit-frame-pointer2017091820170904
496010? ? ?? ? ?avxicc_-xCOMMON-AVX512_-O3_-fomit-frame-pointer2017091820170904
500930? ? ?? ? ?avxicc_-xCORE-AVX2_-O3_-fomit-frame-pointer2017091820170904
556570? ? ?? ? ?avxicc2017091820170904
572880? ? ?? ? ?avxicc_-xCORE-AVX-I_-O3_-fomit-frame-pointer2017091820170904
579748? ? ?? ? ?avxicc_-no-vec2017091820170904
597506? ? ?? ? ?avxicc_-xCORE-AVX-I_-O2_-fomit-frame-pointer2017091820170904

Compiler output

Implementation: crypto_sign/lattisigns512/avx
Compiler: cc
ntt_transform.c: ntt_transform.c: In function 'ntt_transform':
ntt_transform.c: ntt_transform.c:24:3: error: unknown type name '__m256d'
ntt_transform.c: __m256d vt,vo0,vo10,vo11,vo20,vo21,vo22,vo23,vc,vp,vpinv,neg2,neg4;
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:25:3: error: unknown type name '__m256d'
ntt_transform.c: __m256d vx0,vx1,vx2,vx3,vx4,vx5,vx6,vx7;
ntt_transform.c: ^

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
cc avx

Compiler output

Implementation: crypto_sign/lattisigns512/avx
Compiler: gcc
ntt_transform.c: ntt_transform.c: In function 'ntt_transform':
ntt_transform.c: ntt_transform.c:27:9: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
ntt_transform.c: vpinv = _mm256_set_pd(PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE);
ntt_transform.c: ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ntt_transform.c: In file included from /home_nfs/bdolbeaur/gcc-6.3.0-full+isl-inca/lib/gcc/x86_64-pc-linux-gnu/6.3.0/include/immintrin.h:41:0,
ntt_transform.c: from ntt_transform.c:9:
ntt_transform.c: /home_nfs/bdolbeaur/gcc-6.3.0-full+isl-inca/lib/gcc/x86_64-pc-linux-gnu/6.3.0/include/avxintrin.h:834:1: error: inlining failed in call to always_inline '_mm256_load_pd': target specific option mismatch
ntt_transform.c: _mm256_load_pd (double const *__P)
ntt_transform.c: ^~~~~~~~~~~~~~
ntt_transform.c: ntt_transform.c:35:8: note: called from here
ntt_transform.c: neg4 = _mm256_load_pd(_neg4);
ntt_transform.c: ~~~~~^~~~~~~~~~~~~~~~~~~~~~~
ntt_transform.c: In file included from /home_nfs/bdolbeaur/gcc-6.3.0-full+isl-inca/lib/gcc/x86_64-pc-linux-gnu/6.3.0/include/immintrin.h:41:0,
ntt_transform.c: from ntt_transform.c:9:
ntt_transform.c: /home_nfs/bdolbeaur/gcc-6.3.0-full+isl-inca/lib/gcc/x86_64-pc-linux-gnu/6.3.0/include/avxintrin.h:834:1: error: inlining failed in call to always_inline '_mm256_load_pd': target specific option mismatch
ntt_transform.c: _mm256_load_pd (double const *__P)
ntt_transform.c: ^~~~~~~~~~~~~~
ntt_transform.c: ntt_transform.c:34:8: note: called from here
ntt_transform.c: neg2 = _mm256_load_pd(_neg2);
ntt_transform.c: ~~~~~^~~~~~~~~~~~~~~~~~~~~~~
ntt_transform.c: In file included from /home_nfs/bdolbeaur/gcc-6.3.0-full+isl-inca/lib/gcc/x86_64-pc-linux-gnu/6.3.0/include/immintrin.h:41:0,
ntt_transform.c: from ntt_transform.c:9:
ntt_transform.c: /home_nfs/bdolbeaur/gcc-6.3.0-full+isl-inca/lib/gcc/x86_64-pc-linux-gnu/6.3.0/include/avxintrin.h:834:1: error: inlining failed in call to always_inline '_mm256_load_pd': target specific option mismatch
ntt_transform.c: _mm256_load_pd (double const *__P)
ntt_transform.c: ^~~~~~~~~~~~~~
ntt_transform.c: ...

Number of similar (compiler,implementation) pairs: 86, namely:
CompilerImplementations
gcc avx
gcc -O2 -fomit-frame-pointer avx
gcc -O3 -fomit-frame-pointer avx
gcc -O -fomit-frame-pointer avx
gcc -Os -fomit-frame-pointer avx
gcc -fno-schedule-insns -O2 -fomit-frame-pointer avx
gcc -fno-schedule-insns -O3 -fomit-frame-pointer avx
gcc -fno-schedule-insns -O -fomit-frame-pointer avx
gcc -fno-schedule-insns -Os -fomit-frame-pointer avx
gcc -funroll-loops avx
gcc -funroll-loops -O2 -fomit-frame-pointer avx
gcc -funroll-loops -O3 -fomit-frame-pointer avx
gcc -funroll-loops -O -fomit-frame-pointer avx
gcc -funroll-loops -Os -fomit-frame-pointer avx
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer avx
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer avx
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer avx
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer avx
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -O -fomit-frame-pointer avx
gcc -funroll-loops -m64 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer avx
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer avx
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer avx
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer avx
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer avx
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer avx
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer avx
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer avx
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer avx
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer avx
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer avx
gcc -m64 -O2 -fomit-frame-pointer avx
gcc -m64 -O3 -fomit-frame-pointer avx
gcc -m64 -O -fomit-frame-pointer avx
gcc -m64 -Os -fomit-frame-pointer avx
gcc -m64 -march=core2 -O2 -fomit-frame-pointer avx
gcc -m64 -march=core2 -O3 -fomit-frame-pointer avx
gcc -m64 -march=core2 -O -fomit-frame-pointer avx
gcc -m64 -march=core2 -Os -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer avx
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer avx
gcc -m64 -march=corei7 -O3 -fomit-frame-pointer avx
gcc -m64 -march=corei7 -O -fomit-frame-pointer avx
gcc -m64 -march=corei7 -Os -fomit-frame-pointer avx
gcc -m64 -march=k8 -O2 -fomit-frame-pointer avx
gcc -m64 -march=k8 -O3 -fomit-frame-pointer avx
gcc -m64 -march=k8 -O -fomit-frame-pointer avx
gcc -m64 -march=k8 -Os -fomit-frame-pointer avx
gcc -m64 -march=nocona -O2 -fomit-frame-pointer avx
gcc -m64 -march=nocona -O3 -fomit-frame-pointer avx
gcc -m64 -march=nocona -O -fomit-frame-pointer avx
gcc -m64 -march=nocona -Os -fomit-frame-pointer avx
gcc -march=barcelona -O2 -fomit-frame-pointer avx
gcc -march=barcelona -O3 -fomit-frame-pointer avx
gcc -march=barcelona -O -fomit-frame-pointer avx
gcc -march=barcelona -Os -fomit-frame-pointer avx
gcc -march=k8 -O2 -fomit-frame-pointer avx
gcc -march=k8 -O3 -fomit-frame-pointer avx
gcc -march=k8 -O -fomit-frame-pointer avx
gcc -march=k8 -Os -fomit-frame-pointer avx
gcc -march=nocona -O2 -fomit-frame-pointer avx
gcc -march=nocona -O3 -fomit-frame-pointer avx
gcc -march=nocona -O -fomit-frame-pointer avx
gcc -march=nocona -Os -fomit-frame-pointer avx

Compiler output

Implementation: crypto_sign/lattisigns512/avx
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
ntt_transform.c: ntt_transform.c: In function 'ntt_transform':
ntt_transform.c: ntt_transform.c:27:9: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
ntt_transform.c: vpinv = _mm256_set_pd(PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE);
ntt_transform.c: ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ntt_transform.c: In file included from /home_nfs/bdolbeaur/gcc-6.3.0-full+isl-inca/lib/gcc/x86_64-pc-linux-gnu/6.3.0/include/immintrin.h:41:0,
ntt_transform.c: from ntt_transform.c:9:
ntt_transform.c: /home_nfs/bdolbeaur/gcc-6.3.0-full+isl-inca/lib/gcc/x86_64-pc-linux-gnu/6.3.0/include/avxintrin.h:834:1: error: inlining failed in call to always_inline '_mm256_load_pd': target specific option mismatch
ntt_transform.c: _mm256_load_pd (double const *__P)
ntt_transform.c: ^~~~~~~~~~~~~~
ntt_transform.c: ntt_transform.c:35:8: note: called from here
ntt_transform.c: neg4 = _mm256_load_pd(_neg4);
ntt_transform.c: ~~~~~^~~~~~~~~~~~~~~~~~~~~~~
ntt_transform.c: In file included from /home_nfs/bdolbeaur/gcc-6.3.0-full+isl-inca/lib/gcc/x86_64-pc-linux-gnu/6.3.0/include/immintrin.h:41:0,
ntt_transform.c: from ntt_transform.c:9:
ntt_transform.c: /home_nfs/bdolbeaur/gcc-6.3.0-full+isl-inca/lib/gcc/x86_64-pc-linux-gnu/6.3.0/include/avxintrin.h:834:1: error: inlining failed in call to always_inline '_mm256_load_pd': target specific option mismatch
ntt_transform.c: _mm256_load_pd (double const *__P)
ntt_transform.c: ^~~~~~~~~~~~~~
ntt_transform.c: ntt_transform.c:34:8: note: called from here
ntt_transform.c: neg2 = _mm256_load_pd(_neg2);
ntt_transform.c: ~~~~~^~~~~~~~~~~~~~~~~~~~~~~
ntt_transform.c: In file included from /home_nfs/bdolbeaur/gcc-6.3.0-full+isl-inca/lib/gcc/x86_64-pc-linux-gnu/6.3.0/include/immintrin.h:41:0,
ntt_transform.c: from ntt_transform.c:9:
ntt_transform.c: /home_nfs/bdolbeaur/gcc-6.3.0-full+isl-inca/lib/gcc/x86_64-pc-linux-gnu/6.3.0/include/avxintrin.h:834:1: error: inlining failed in call to always_inline '_mm256_load_pd': target specific option mismatch
ntt_transform.c: _mm256_load_pd (double const *__P)
ntt_transform.c: ^~~~~~~~~~~~~~
ntt_transform.c: ...
ntt_transform.c: ntt_transform.c: In function 'ntt_transform':
ntt_transform.c: ntt_transform.c:27:9: warning: AVX vector return without AVX enabled changes the ABI [-Wpsabi]
ntt_transform.c: vpinv = _mm256_set_pd(PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE);
ntt_transform.c: ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ntt_transform.c: In file included from /home_nfs/bdolbeaur/gcc-6.3.0-full+isl-inca/lib/gcc/x86_64-pc-linux-gnu/6.3.0/include/immintrin.h:41:0,
ntt_transform.c: from ntt_transform.c:9:
ntt_transform.c: /home_nfs/bdolbeaur/gcc-6.3.0-full+isl-inca/lib/gcc/x86_64-pc-linux-gnu/6.3.0/include/avxintrin.h:834:1: error: inlining failed in call to always_inline '_mm256_load_pd': target specific option mismatch
ntt_transform.c: _mm256_load_pd (double const *__P)
ntt_transform.c: ^~~~~~~~~~~~~~
ntt_transform.c: ntt_transform.c:35:8: note: called from here
ntt_transform.c: neg4 = _mm256_load_pd(_neg4);
ntt_transform.c: ~~~~~^~~~~~~~~~~~~~~~~~~~~~~
ntt_transform.c: In file included from /home_nfs/bdolbeaur/gcc-6.3.0-full+isl-inca/lib/gcc/x86_64-pc-linux-gnu/6.3.0/include/immintrin.h:41:0,
ntt_transform.c: from ntt_transform.c:9:
ntt_transform.c: /home_nfs/bdolbeaur/gcc-6.3.0-full+isl-inca/lib/gcc/x86_64-pc-linux-gnu/6.3.0/include/avxintrin.h:834:1: error: inlining failed in call to always_inline '_mm256_load_pd': target specific option mismatch
ntt_transform.c: _mm256_load_pd (double const *__P)
ntt_transform.c: ^~~~~~~~~~~~~~
ntt_transform.c: ntt_transform.c:34:8: note: called from here
ntt_transform.c: neg2 = _mm256_load_pd(_neg2);
ntt_transform.c: ~~~~~^~~~~~~~~~~~~~~~~~~~~~~
ntt_transform.c: In file included from /home_nfs/bdolbeaur/gcc-6.3.0-full+isl-inca/lib/gcc/x86_64-pc-linux-gnu/6.3.0/include/immintrin.h:41:0,
ntt_transform.c: from ntt_transform.c:9:
ntt_transform.c: /home_nfs/bdolbeaur/gcc-6.3.0-full+isl-inca/lib/gcc/x86_64-pc-linux-gnu/6.3.0/include/avxintrin.h:834:1: error: inlining failed in call to always_inline '_mm256_load_pd': target specific option mismatch
ntt_transform.c: _mm256_load_pd (double const *__P)
ntt_transform.c: ^~~~~~~~~~~~~~
ntt_transform.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer avx
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer avx
gcc -m64 -march=barcelona -O -fomit-frame-pointer avx
gcc -m64 -march=barcelona -Os -fomit-frame-pointer avx

Compiler output

Implementation: crypto_sign/lattisigns512/avx
Compiler: gcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer
try.c: /home_nfs/bdolbeaur/scratch/TEMP/supercop-20170904/supercop-data/manny1024/amd64/lib/knownrandombytes.o: In function `randombytes':
try.c: knownrandombytes.c:(.text+0x...): undefined reference to `_intel_fast_memcpy'
try.c: knownrandombytes.c:(.text+0x...): undefined reference to `_intel_fast_memset'
try.c: /home_nfs/bdolbeaur/scratch/TEMP/supercop-20170904/supercop-data/manny1024/amd64/lib/libsupercop.a(crypto_stream_salsa20_dolbeau_amd64_xmm6int-api.o): In function `crypto_stream_salsa20_dolbeau_amd64_xmm6int':
try.c: api.c:(.text+0x...): undefined reference to `__intel_avx_rep_memset'
try.c: /home_nfs/bdolbeaur/scratch/TEMP/supercop-20170904/supercop-data/manny1024/amd64/lib/libsupercop.a(crypto_stream_salsa20_dolbeau_amd64_xmm6int-salsa20-xmm6int.o): In function `crypto_stream_salsa20_dolbeau_amd64_xmm6int_ECRYPT_keystream_bytes':
try.c: salsa20-xmm6int.c:(.text+0x...): undefined reference to `__intel_avx_rep_memset'
try.c: /home_nfs/bdolbeaur/scratch/TEMP/supercop-20170904/supercop-data/manny1024/amd64/lib/libsupercop.a(crypto_stream_chacha20_dolbeau_amd64_avx2-api.o): In function `crypto_stream_chacha20_dolbeau_amd64_avx2':
try.c: api.c:(.text+0x...): undefined reference to `__intel_avx_rep_memset'
try.c: /home_nfs/bdolbeaur/scratch/TEMP/supercop-20170904/supercop-data/manny1024/amd64/lib/libsupercop.a(crypto_stream_chacha20_dolbeau_amd64_avx2-chacha.o): In function `crypto_stream_chacha20_dolbeau_amd64_avx2_ECRYPT_keystream_bytes':
try.c: chacha.c:(.text+0x...): undefined reference to `__intel_avx_rep_memset'
try.c: collect2: error: ld returned 1 exit status

Number of similar (compiler,implementation) pairs: 20, namely:
CompilerImplementations
gcc -m64 -march=core-avx-i -O2 -fomit-frame-pointer avx
gcc -m64 -march=core-avx-i -O3 -fomit-frame-pointer avx
gcc -m64 -march=core-avx-i -O -fomit-frame-pointer avx
gcc -m64 -march=core-avx-i -Os -fomit-frame-pointer avx
gcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer avx
gcc -m64 -march=core-avx2 -O3 -fomit-frame-pointer avx
gcc -m64 -march=core-avx2 -O -fomit-frame-pointer avx
gcc -m64 -march=core-avx2 -Os -fomit-frame-pointer avx
gcc -m64 -march=corei7-avx -O2 -fomit-frame-pointer avx
gcc -m64 -march=corei7-avx -O3 -fomit-frame-pointer avx
gcc -m64 -march=corei7-avx -O -fomit-frame-pointer avx
gcc -m64 -march=corei7-avx -Os -fomit-frame-pointer avx
gcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer avx
gcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer avx
gcc -m64 -march=native -mtune=native -O -fomit-frame-pointer avx
gcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer avx
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv avx
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv avx
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv avx
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv avx