Implementation notes: amd64, colossus6, crypto_sign/luov863256

Computer: colossus6
Architecture: amd64
CPU ID: AuthenticAMD-00830f10-178bfbff
SUPERCOP version: 20210125
Operation: crypto_sign
Primitive: luov863256
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
643941060727 0 090325 832 1592T:avx2clang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
644296548513 0 076721 816 1592T:avx2clang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
647588260727 0 090325 832 1592T:avx2clang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
647725532760 32768 064721 33584 1624T:avx2gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
649287064591 0 094525 832 1592T:avx2clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
649944020964 32768 050793 33584 1624T:avx2gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
657362220695 32768 050113 33584 1624T:avx2gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
682877215863 32768 044449 33576 1592T:avx2gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
9591750550714 0 049796 832 1592T:portableclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
9632025550714 0 049796 832 1592T:portableclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
9761602554198 0 051836 832 1592T:portableclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
9828428552511 0 050548 832 1592T:portableclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
10023075558832 0 055200 808 1624T:portablegcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
10177852552906 0 049440 808 1624T:portablegcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
10329030546338 0 044160 816 1592T:portableclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
11583855548363 0 045416 800 1592T:portablegcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
12088237550695 0 047544 808 1624T:portablegcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
81242100418414 36 0176528 808 1624T:refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
81948780413480 36 0172552 808 1624T:refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
81956723409614 36 0169360 800 1592T:refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
82160843415682 36 0174596 832 1592T:refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
82319895411806 36 0171496 808 1624T:refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2021031020210125
82780222410587 36 0171988 832 1592T:refclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
82961910413859 36 0173508 832 1592T:refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
83095650407325 36 0167616 816 1592T:refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125
83117497410587 36 0171988 832 1592T:refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2021031020210125

Compiler output

Implementation: T:avx2
Security model: timingleaks
Compiler: clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
LUOV.c: LUOV.c:38:44: error: always_inline function '_mm256_loadu_si256' requires target feature 'sse4.2', but would be inlined into function 'calculateQ2' that is compiled without support for 'sse4.2'
LUOV.c: __m256i rrrr = _mm256_permute4x64_epi64(_mm256_loadu_si256((__m256i *)&Q1[col++]),0);
LUOV.c: ^
LUOV.c: LUOV.c:38:19: error: always_inline function '_mm256_undefined_si256' requires target feature 'sse4.2', but would be inlined into function 'calculateQ2' that is compiled without support for 'sse4.2'
LUOV.c: __m256i rrrr = _mm256_permute4x64_epi64(_mm256_loadu_si256((__m256i *)&Q1[col++]),0);
LUOV.c: ^
LUOV.c: /usr/lib/llvm-6.0/lib/clang/6.0.0/include/avx2intrin.h:968:44: note: expanded from macro '_mm256_permute4x64_epi64'
LUOV.c: (__v4di)_mm256_undefined_si256(), \
LUOV.c: ^
LUOV.c: LUOV.c:43:52: error: always_inline function '_mm256_blendv_pd' requires target feature 'sse4.2', but would be inlined into function 'calculateQ2' that is compiled without support for 'sse4.2'
LUOV.c: *((__m256i *)&TempMat[i][k*8+4]) ^= (__m256i) _mm256_blendv_pd(_mm256_setzero_pd(),(__m256d) rrrr,(__m256d)TJ);
LUOV.c: ^
LUOV.c: LUOV.c:43:69: error: always_inline function '_mm256_setzero_pd' requires target feature 'sse4.2', but would be inlined into function 'calculateQ2' that is compiled without support for 'sse4.2'
LUOV.c: *((__m256i *)&TempMat[i][k*8+4]) ^= (__m256i) _mm256_blendv_pd(_mm256_setzero_pd(),(__m256d) rrrr,(__m256d)TJ);
LUOV.c: ^
LUOV.c: LUOV.c:44:10: error: always_inline function '_mm256_slli_epi64' requires target feature 'avx2', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx2'
LUOV.c: TJ = _mm256_slli_epi64(TJ,4);
LUOV.c: ^
LUOV.c: LUOV.c:46:49: error: always_inline function '_mm256_blendv_pd' requires target feature 'sse4.2', but would be inlined into function 'calculateQ2' that is compiled without support for 'sse4.2'
LUOV.c: *((__m256i *)&TempMat[i][k*8]) ^= (__m256i) _mm256_blendv_pd(_mm256_setzero_pd(),(__m256d) rrrr,(__m256d)TJ);
LUOV.c: ^
LUOV.c: LUOV.c:46:66: error: always_inline function '_mm256_setzero_pd' requires target feature 'sse4.2', but would be inlined into function 'calculateQ2' that is compiled without support for 'sse4.2'
LUOV.c: *((__m256i *)&TempMat[i][k*8]) ^= (__m256i) _mm256_blendv_pd(_mm256_setzero_pd(),(__m256d) rrrr,(__m256d)TJ);
LUOV.c: ^
LUOV.c: LUOV.c:47:10: error: always_inline function '_mm256_slli_epi64' requires target feature 'avx2', but would be inlined into function 'calculateQ2' that is compiled without support for 'avx2'
LUOV.c: ...

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2

Namespace violations

Implementation: T:avx2
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
LUOV.o BuildAugmentedMatrixFromP1L T
LUOV.o BuildAugmentedMatrixFromQ1 T
LUOV.o EvaluateVinegarPart T
LUOV.o TransformQ1 T
LUOV.o _generateKeyPair T
LUOV.o _sign T
LUOV.o calculateQ2 T
LUOV.o computeTarget T
LUOV.o deserializeQ2 T
LUOV.o evaluatePublicMap T
LUOV.o extractMessage T
LUOV.o generateBigKeyPair T
LUOV.o generateKeyPair T
LUOV.o initializeVinegarSponge T
LUOV.o multiplyByT T
LUOV.o precompute_sign T
LUOV.o precompute_verify T
LUOV.o serializeQ2 T
LUOV.o solvePrivateUOVSystem T
LUOV.o verify T
LUOV.o verify_fast T
LinearAlgebra.o destroy_matrix T
LinearAlgebra.o getUniqueSolution T
LinearAlgebra.o newMatrix T
LinearAlgebra.o printMatrix T
LinearAlgebra.o rowEchelonAugmented T
LinearAlgebra.o rowOp T
LinearAlgebra.o scalarMul2 T
LinearAlgebra.o scaleRow T
LinearAlgebra.o swapRows T
intermediateValues.o printAugmentedMatrix T
intermediateValues.o printEvaluation T
intermediateValues.o printPrivateSolution T
intermediateValues.o printVinegarValues T
intermediateValues.o reportSolutionFound T
keccakrng.o calculateQ1 T
keccakrng.o initializeAndAbsorb T
keccakrng.o squeezeCols T
keccakrng.o squeezeVector T
keccakrng.o squeezeuint64_t T

Number of similar (compiler,implementation) pairs: 8, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:avx2

Namespace violations

Implementation: T:portable
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
Bitcontainer.o squeezeBitcontainerArray T
F16Field.o f16EucildeanDivision T
F16Field.o f16ExtendedEuclideanAlgorithm T
F16Field.o f16antilog T
F16Field.o f16deserialize_FELT T
F16Field.o f16inverse T
F16Field.o f16log T
F16Field.o f16multiply T
F16Field.o f16polyAdd T
F16Field.o f16polyCopy T
F16Field.o f16polyMult T
F16Field.o f16polyOne T
F16Field.o f16polyZero T
F16Field.o f16printFELT T
F16Field.o f16scalarMultiply T
F16Field.o f16serialize_FELT T
F16Field.o isArrayOfZeros T
F32Field.o f32add T
F32Field.o f32addInPlace T
F32Field.o f32deserialize_FELT T
F32Field.o f32inverse T
F32Field.o f32isEqual T
F32Field.o f32multiply T
F32Field.o f32multiplyOld T
F32Field.o f32printFELT T
F32Field.o f32scalarMultiply T
F32Field.o f32serialize_FELT T
F32Field.o newF32FELT T
F48Field.o f48add T
F48Field.o f48addInPlace T
F48Field.o f48deserialize_FELT T
F48Field.o f48inverse T
F48Field.o f48isEqual T
F48Field.o f48multiply T
F48Field.o f48printFELT T
F48Field.o f48scalarMultiply T
F48Field.o f48serialize_FELT T
F48Field.o newF48FELT T
F64Field.o f64add T
F64Field.o f64addInPlace T
F64Field.o f64deserialize_FELT T
F64Field.o f64inverse T
F64Field.o f64isEqual T
F64Field.o f64multiply T
F64Field.o f64printFELT T
F64Field.o f64scalarMultiply T
F64Field.o f64serialize_FELT T
F64Field.o newF64FELT T
F80Field.o f80Scalarmultiply T
F80Field.o f80add T
F80Field.o f80addInPlace T
F80Field.o f80deserialize_FELT T
F80Field.o f80inverse T
F80Field.o f80isEqual T
F80Field.o f80multiply T
F80Field.o f80printFELT T
F80Field.o f80serialize_FELT T
F80Field.o newF80FELT T
F8Field.o f8antilog T
F8Field.o f8deserialize_FELT T
F8Field.o f8inverse T
F8Field.o f8log T
F8Field.o f8multiply T
F8Field.o f8printFELT T
F8Field.o f8serialize_FELT T
LUOV.o BuildAugmentedMatrix T
LUOV.o _addScalarProduct1 T
LUOV.o _addScalarProduct3 T
LUOV.o addScalarProduct T
LUOV.o addScalarProduct3 T
LUOV.o calculateQ2 T
LUOV.o computeTarget T
LUOV.o deserialize_PublicKey T
LUOV.o deserialize_SecretKey T
LUOV.o deserialize_signature T
LUOV.o destroy_PublicKey T
LUOV.o destroy_SecretKey T
LUOV.o destroy_signature T
LUOV.o evaluatePublicMap T
LUOV.o expandTable T
LUOV.o extractMessage T
LUOV.o generateKeyPair T
LUOV.o repeatTable T
LUOV.o serialize_PublicKey T
LUOV.o serialize_SecretKey T
LUOV.o serialize_signature T
LUOV.o signDocument T
LUOV.o solvePrivateUOVSystem T
LUOV.o verify T
LinearAlgebra.o destroy_matrix T
LinearAlgebra.o getUniqueSolution T
LinearAlgebra.o newMatrix T
LinearAlgebra.o printMatrix T
LinearAlgebra.o rowEchelonAugmented T
LinearAlgebra.o rowOp T
LinearAlgebra.o scaleRow T
LinearAlgebra.o swapRows T
LinearAlgebra.o zeroMatrix T
buffer.o deserialize_uint64_t T
buffer.o newReader T
buffer.o newWriter T
buffer.o readBit T
buffer.o serialize_uint64_t T
buffer.o transcribe T
buffer.o writeBit T
intermediateValues.o printAugmentedMatrix T
intermediateValues.o printEvaluation T
intermediateValues.o printPrivateSolution T
intermediateValues.o printVinegarValues T
intermediateValues.o reportSolutionFound T
keccakrng.o initializeAndAbsorb T
keccakrng.o squeezeVector T
keccakrng.o squeezeuint64_t T

Number of similar (compiler,implementation) pairs: 9, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:portable
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:portable
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:portable
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:portable
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:portable
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:portable
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:portable
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:portable
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:portable

Namespace violations

Implementation: T:ref
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
Bitcontainer.o squeezeBitcontainerArray T
F16Field.o f16EucildeanDivision T
F16Field.o f16ExtendedEuclideanAlgorithm T
F16Field.o f16antilog T
F16Field.o f16deserialize_FELT T
F16Field.o f16inverse T
F16Field.o f16log T
F16Field.o f16multiply T
F16Field.o f16polyAdd T
F16Field.o f16polyCopy T
F16Field.o f16polyMult T
F16Field.o f16polyOne T
F16Field.o f16polyZero T
F16Field.o f16printFELT T
F16Field.o f16scalarMultiply T
F16Field.o f16serialize_FELT T
F16Field.o isArrayOfZeros T
F32Field.o f32add T
F32Field.o f32deserialize_FELT T
F32Field.o f32inverse T
F32Field.o f32isEqual T
F32Field.o f32multiply T
F32Field.o f32multiplyOld T
F32Field.o f32printFELT T
F32Field.o f32scalarMultiply T
F32Field.o f32serialize_FELT T
F32Field.o newF32FELT T
F48Field.o f48add T
F48Field.o f48deserialize_FELT T
F48Field.o f48inverse T
F48Field.o f48isEqual T
F48Field.o f48multiply T
F48Field.o f48printFELT T
F48Field.o f48scalarMultiply T
F48Field.o f48serialize_FELT T
F48Field.o newF48FELT T
F64Field.o f64add T
F64Field.o f64deserialize_FELT T
F64Field.o f64inverse T
F64Field.o f64isEqual T
F64Field.o f64multiply T
F64Field.o f64printFELT T
F64Field.o f64scalarMultiply T
F64Field.o f64serialize_FELT T
F64Field.o newF64FELT T
F80Field.o f80Scalarmultiply T
F80Field.o f80add T
F80Field.o f80deserialize_FELT T
F80Field.o f80inverse T
F80Field.o f80isEqual T
F80Field.o f80multiply T
F80Field.o f80printFELT T
F80Field.o f80serialize_FELT T
F80Field.o newF80FELT T
F8Field.o f8antilog T
F8Field.o f8deserialize_FELT T
F8Field.o f8inverse T
F8Field.o f8log T
F8Field.o f8multiply T
F8Field.o f8printFELT T
F8Field.o f8serialize_FELT T
LUOV.o BuildAugmentedMatrix T
LUOV.o calculateQ2 T
LUOV.o computeTarget T
LUOV.o deserialize_PublicKey T
LUOV.o deserialize_SecretKey T
LUOV.o deserialize_signature T
LUOV.o destroy_PublicKey T
LUOV.o destroy_SecretKey T
LUOV.o destroy_signature T
LUOV.o evaluatePublicMap T
LUOV.o extractMessage T
LUOV.o generateKeyPair T
LUOV.o serialize_PublicKey T
LUOV.o serialize_SecretKey T
LUOV.o serialize_signature T
LUOV.o signDocument T
LUOV.o solvePrivateUOVSystem T
LUOV.o verify T
LinearAlgebra.o destroy_matrix T
LinearAlgebra.o getUniqueSolution T
LinearAlgebra.o newMatrix T
LinearAlgebra.o printMatrix T
LinearAlgebra.o rowEchelonAugmented T
LinearAlgebra.o rowOp T
LinearAlgebra.o scaleRow T
LinearAlgebra.o swapRows T
LinearAlgebra.o zeroMatrix T
buffer.o deserialize_uint64_t T
buffer.o newReader T
buffer.o newWriter T
buffer.o readBit T
buffer.o serialize_uint64_t T
buffer.o transcribe T
buffer.o writeBit T
intermediateValues.o printAugmentedMatrix T
intermediateValues.o printEvaluation T
intermediateValues.o printPrivateSolution T
intermediateValues.o printVinegarValues T
intermediateValues.o reportSolutionFound T
keccakrng.o initializeAndAbsorb T
keccakrng.o squeezeVector T
keccakrng.o squeezeuint64_t T

Number of similar (compiler,implementation) pairs: 9, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:ref
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:ref
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:ref
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:ref
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:ref