Implementation notes: amd64, h4e350, crypto_sign/lattisigns512

Computer: h4e350
Architecture: amd64
CPU ID: AuthenticAMD-00500f10-178bfbff
SUPERCOP version: 20141014
Operation: crypto_sign
Primitive: lattisigns512

Compiler output

Implementation: crypto_sign/lattisigns512/avx
Compiler: cc
ntt_transform.c: ntt_transform.c: In function 'ntt_transform':
ntt_transform.c: ntt_transform.c:24:3: error: unknown type name '__m256d'
ntt_transform.c: ntt_transform.c:25:3: error: unknown type name '__m256d'

Number of similar (compiler,implementation) pairs: 88, namely:
CompilerImplementations
cc avx
gcc avx
gcc -O2 -fomit-frame-pointer avx
gcc -O3 -fomit-frame-pointer avx
gcc -O -fomit-frame-pointer avx
gcc -Os -fomit-frame-pointer avx
gcc -fno-schedule-insns -O2 -fomit-frame-pointer avx
gcc -fno-schedule-insns -O3 -fomit-frame-pointer avx
gcc -fno-schedule-insns -O -fomit-frame-pointer avx
gcc -fno-schedule-insns -Os -fomit-frame-pointer avx
gcc -funroll-loops avx
gcc -funroll-loops -O2 -fomit-frame-pointer avx
gcc -funroll-loops -O3 -fomit-frame-pointer avx
gcc -funroll-loops -O -fomit-frame-pointer avx
gcc -funroll-loops -Os -fomit-frame-pointer avx
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer avx
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer avx
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer avx
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer avx
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -O -fomit-frame-pointer avx
gcc -funroll-loops -m64 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer avx
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer avx
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer avx
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer avx
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer avx
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer avx
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer avx
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer avx
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer avx
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer avx
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer avx
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer avx
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer avx
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer avx
gcc -m64 -O2 -fomit-frame-pointer avx
gcc -m64 -O3 -fomit-frame-pointer avx
gcc -m64 -O -fomit-frame-pointer avx
gcc -m64 -Os -fomit-frame-pointer avx
gcc -m64 -march=core2 -O2 -fomit-frame-pointer avx
gcc -m64 -march=core2 -O3 -fomit-frame-pointer avx
gcc -m64 -march=core2 -O -fomit-frame-pointer avx
gcc -m64 -march=core2 -Os -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer avx
gcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer avx
gcc -m64 -march=corei7 -O2 -fomit-frame-pointer avx
gcc -m64 -march=corei7 -O -fomit-frame-pointer avx
gcc -m64 -march=corei7 -Os -fomit-frame-pointer avx
gcc -m64 -march=k8 -O2 -fomit-frame-pointer avx
gcc -m64 -march=k8 -O3 -fomit-frame-pointer avx
gcc -m64 -march=k8 -O -fomit-frame-pointer avx
gcc -m64 -march=k8 -Os -fomit-frame-pointer avx
gcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer avx
gcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer avx
gcc -m64 -march=native -mtune=native -O -fomit-frame-pointer avx
gcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer avx
gcc -m64 -march=nocona -O2 -fomit-frame-pointer avx
gcc -m64 -march=nocona -O3 -fomit-frame-pointer avx
gcc -m64 -march=nocona -O -fomit-frame-pointer avx
gcc -m64 -march=nocona -Os -fomit-frame-pointer avx
gcc -march=barcelona -O2 -fomit-frame-pointer avx
gcc -march=barcelona -O3 -fomit-frame-pointer avx
gcc -march=barcelona -O -fomit-frame-pointer avx
gcc -march=barcelona -Os -fomit-frame-pointer avx
gcc -march=k8 -O2 -fomit-frame-pointer avx
gcc -march=k8 -O3 -fomit-frame-pointer avx
gcc -march=k8 -O -fomit-frame-pointer avx
gcc -march=k8 -Os -fomit-frame-pointer avx
gcc -march=nocona -O2 -fomit-frame-pointer avx
gcc -march=nocona -O3 -fomit-frame-pointer avx
gcc -march=nocona -O -fomit-frame-pointer avx
gcc -march=nocona -Os -fomit-frame-pointer avx

Compiler output

Implementation: crypto_sign/lattisigns512/avx
Compiler: clang -O3 -fomit-frame-pointer
ntt_transform.c: ntt_transform.c:24:3: error: use of undeclared identifier '__m256d'
ntt_transform.c: __m256d vt,vo0,vo10,vo11,vo20,vo21,vo22,vo23,vc,vp,vpinv,neg2,neg4;
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:25:3: error: use of undeclared identifier '__m256d'
ntt_transform.c: __m256d vx0,vx1,vx2,vx3,vx4,vx5,vx6,vx7;
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:27:3: error: use of undeclared identifier 'vpinv'
ntt_transform.c: vpinv = _mm256_set_pd(PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:27:11: warning: implicit declaration of function '_mm256_set_pd' is invalid in C99 [-Wimplicit-function-declaration]
ntt_transform.c: vpinv = _mm256_set_pd(PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:28:3: error: use of undeclared identifier 'vp'
ntt_transform.c: vp = _mm256_set_pd(8383489., 8383489., 8383489., 8383489.);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:32:3: error: use of undeclared identifier 'vo10'
ntt_transform.c: vo10 = _mm256_load_pd(o+pos);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:32:10: warning: implicit declaration of function '_mm256_load_pd' is invalid in C99 [-Wimplicit-function-declaration]
ntt_transform.c: vo10 = _mm256_load_pd(o+pos);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:33:3: error: use of undeclared identifier 'vo20'
ntt_transform.c: vo20 = _mm256_load_pd(o+pos+4);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:34:3: error: use of undeclared identifier 'neg2'; did you mean '_neg2'?
ntt_transform.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer avx

Compiler output

Implementation: crypto_sign/lattisigns512/avx
Compiler: clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer
fastrandombytes.c: clang: warning: argument unused during compilation: '-mcpu=core-avx2'
fastrandombytes.c: clang: warning: argument unused during compilation: '-mavx2'
fastrandombytes.c: clang: warning: argument unused during compilation: '-mpclmul'
ntt_transform.c: clang: warning: argument unused during compilation: '-mcpu=core-avx2'
ntt_transform.c: clang: warning: argument unused during compilation: '-mavx2'
ntt_transform.c: clang: warning: argument unused during compilation: '-mpclmul'
ntt_transform.c: In file included from ntt_transform.c:9:
ntt_transform.c: In file included from /usr/include/clang/3.0/include/immintrin.h:52:
ntt_transform.c: In file included from /usr/include/clang/3.0/include/wmmintrin.h:31:
ntt_transform.c: /usr/include/clang/3.0/include/smmintrin.h:28:2: error: #error "SSE4.1 instruction set not enabled"
ntt_transform.c: #error "SSE4.1 instruction set not enabled"
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:24:3: error: use of undeclared identifier '__m256d'
ntt_transform.c: __m256d vt,vo0,vo10,vo11,vo20,vo21,vo22,vo23,vc,vp,vpinv,neg2,neg4;
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:25:3: error: use of undeclared identifier '__m256d'
ntt_transform.c: __m256d vx0,vx1,vx2,vx3,vx4,vx5,vx6,vx7;
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:27:3: error: use of undeclared identifier 'vpinv'
ntt_transform.c: vpinv = _mm256_set_pd(PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:27:11: warning: implicit declaration of function '_mm256_set_pd' is invalid in C99 [-Wimplicit-function-declaration]
ntt_transform.c: vpinv = _mm256_set_pd(PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:28:3: error: use of undeclared identifier 'vp'
ntt_transform.c: vp = _mm256_set_pd(8383489., 8383489., 8383489., 8383489.);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:32:3: error: use of undeclared identifier 'vo10'
ntt_transform.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fomit-frame-pointer avx

Compiler output

Implementation: crypto_sign/lattisigns512/avx
Compiler: clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer
fastrandombytes.c: clang: warning: argument unused during compilation: '-mcpu=core-avx2'
fastrandombytes.c: clang: warning: argument unused during compilation: '-mavx2'
fastrandombytes.c: clang: warning: argument unused during compilation: '-mpclmul'
fastrandombytes.c: clang: warning: argument unused during compilation: '-fpolly'
fastrandombytes.c: clang: warning: argument unused during compilation: '-fvectorize'
fastrandombytes.c: clang: warning: argument unused during compilation: '-fslp-vectorize'
fastrandombytes.c: clang: warning: argument unused during compilation: '-fslp-vectorize-aggressive'
ntt_transform.c: clang: warning: argument unused during compilation: '-mcpu=core-avx2'
ntt_transform.c: clang: warning: argument unused during compilation: '-mavx2'
ntt_transform.c: clang: warning: argument unused during compilation: '-mpclmul'
ntt_transform.c: clang: warning: argument unused during compilation: '-fpolly'
ntt_transform.c: clang: warning: argument unused during compilation: '-fvectorize'
ntt_transform.c: clang: warning: argument unused during compilation: '-fslp-vectorize'
ntt_transform.c: clang: warning: argument unused during compilation: '-fslp-vectorize-aggressive'
ntt_transform.c: In file included from ntt_transform.c:9:
ntt_transform.c: In file included from /usr/include/clang/3.0/include/immintrin.h:52:
ntt_transform.c: In file included from /usr/include/clang/3.0/include/wmmintrin.h:31:
ntt_transform.c: /usr/include/clang/3.0/include/smmintrin.h:28:2: error: #error "SSE4.1 instruction set not enabled"
ntt_transform.c: #error "SSE4.1 instruction set not enabled"
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:24:3: error: use of undeclared identifier '__m256d'
ntt_transform.c: __m256d vt,vo0,vo10,vo11,vo20,vo21,vo22,vo23,vc,vp,vpinv,neg2,neg4;
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:25:3: error: use of undeclared identifier '__m256d'
ntt_transform.c: __m256d vx0,vx1,vx2,vx3,vx4,vx5,vx6,vx7;
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:27:3: error: use of undeclared identifier 'vpinv'
ntt_transform.c: vpinv = _mm256_set_pd(PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:27:11: warning: implicit declaration of function '_mm256_set_pd' is invalid in C99 [-Wimplicit-function-declaration]
ntt_transform.c: vpinv = _mm256_set_pd(PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE);
ntt_transform.c: ^
ntt_transform.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -O3 -fwrapv -march=x86-64 -mcpu=core-avx2 -mavx2 -maes -mpclmul -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer avx

Compiler output

Implementation: crypto_sign/lattisigns512/avx
Compiler: clang -O3 -fwrapv -mavx2 -fomit-frame-pointer
fastrandombytes.c: clang: warning: argument unused during compilation: '-mavx2'
ntt_transform.c: clang: warning: argument unused during compilation: '-mavx2'
ntt_transform.c: ntt_transform.c:24:3: error: use of undeclared identifier '__m256d'
ntt_transform.c: __m256d vt,vo0,vo10,vo11,vo20,vo21,vo22,vo23,vc,vp,vpinv,neg2,neg4;
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:25:3: error: use of undeclared identifier '__m256d'
ntt_transform.c: __m256d vx0,vx1,vx2,vx3,vx4,vx5,vx6,vx7;
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:27:3: error: use of undeclared identifier 'vpinv'
ntt_transform.c: vpinv = _mm256_set_pd(PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:27:11: warning: implicit declaration of function '_mm256_set_pd' is invalid in C99 [-Wimplicit-function-declaration]
ntt_transform.c: vpinv = _mm256_set_pd(PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:28:3: error: use of undeclared identifier 'vp'
ntt_transform.c: vp = _mm256_set_pd(8383489., 8383489., 8383489., 8383489.);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:32:3: error: use of undeclared identifier 'vo10'
ntt_transform.c: vo10 = _mm256_load_pd(o+pos);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:32:10: warning: implicit declaration of function '_mm256_load_pd' is invalid in C99 [-Wimplicit-function-declaration]
ntt_transform.c: vo10 = _mm256_load_pd(o+pos);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:33:3: error: use of undeclared identifier 'vo20'
ntt_transform.c: vo20 = _mm256_load_pd(o+pos+4);
ntt_transform.c: ^
ntt_transform.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer avx

Compiler output

Implementation: crypto_sign/lattisigns512/avx
Compiler: clang -O3 -fwrapv -mavx2 -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer
fastrandombytes.c: clang: warning: argument unused during compilation: '-mavx2'
fastrandombytes.c: clang: warning: argument unused during compilation: '-fpolly'
fastrandombytes.c: clang: warning: argument unused during compilation: '-fvectorize'
fastrandombytes.c: clang: warning: argument unused during compilation: '-fslp-vectorize'
fastrandombytes.c: clang: warning: argument unused during compilation: '-fslp-vectorize-aggressive'
ntt_transform.c: clang: warning: argument unused during compilation: '-mavx2'
ntt_transform.c: clang: warning: argument unused during compilation: '-fpolly'
ntt_transform.c: clang: warning: argument unused during compilation: '-fvectorize'
ntt_transform.c: clang: warning: argument unused during compilation: '-fslp-vectorize'
ntt_transform.c: clang: warning: argument unused during compilation: '-fslp-vectorize-aggressive'
ntt_transform.c: ntt_transform.c:24:3: error: use of undeclared identifier '__m256d'
ntt_transform.c: __m256d vt,vo0,vo10,vo11,vo20,vo21,vo22,vo23,vc,vp,vpinv,neg2,neg4;
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:25:3: error: use of undeclared identifier '__m256d'
ntt_transform.c: __m256d vx0,vx1,vx2,vx3,vx4,vx5,vx6,vx7;
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:27:3: error: use of undeclared identifier 'vpinv'
ntt_transform.c: vpinv = _mm256_set_pd(PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:27:11: warning: implicit declaration of function '_mm256_set_pd' is invalid in C99 [-Wimplicit-function-declaration]
ntt_transform.c: vpinv = _mm256_set_pd(PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE, PARAM_APPROX_P_INVERSE);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:28:3: error: use of undeclared identifier 'vp'
ntt_transform.c: vp = _mm256_set_pd(8383489., 8383489., 8383489., 8383489.);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:32:3: error: use of undeclared identifier 'vo10'
ntt_transform.c: vo10 = _mm256_load_pd(o+pos);
ntt_transform.c: ^
ntt_transform.c: ntt_transform.c:32:10: warning: implicit declaration of function '_mm256_load_pd' is invalid in C99 [-Wimplicit-function-declaration]
ntt_transform.c: vo10 = _mm256_load_pd(o+pos);
ntt_transform.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -O3 -fwrapv -mavx2 -fpolly -funroll-loops -fvectorize -fslp-vectorize -fslp-vectorize-aggressive -fomit-frame-pointer avx

Compiler output

Implementation: crypto_sign/lattisigns512/avx
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
ntt_transform.c: ntt_transform.c: In function 'ntt_transform':
ntt_transform.c: ntt_transform.c:24:3: error: unknown type name '__m256d'
ntt_transform.c: ntt_transform.c:25:3: error: unknown type name '__m256d'
ntt_transform.c: ntt_transform.c: In function 'ntt_transform':
ntt_transform.c: ntt_transform.c:24:3: error: unknown type name '__m256d'
ntt_transform.c: ntt_transform.c:25:3: error: unknown type name '__m256d'

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer avx
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer avx
gcc -m64 -march=barcelona -O -fomit-frame-pointer avx
gcc -m64 -march=barcelona -Os -fomit-frame-pointer avx