Implementation notes: aarch64, rockpi4, crypto_kem/kyber768

Computer: rockpi4
Architecture: aarch64
CPU ID: unknown CPU ID
SUPERCOP version: 20190816
Operation: crypto_kem
Primitive: kyber768
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
48321035580 512 054583 1424 1584refgcc_-funroll-loops_-fno-schedule-insns_-O3_-fomit-frame-pointer2019121320190816
50138230448 512 048103 1424 1584refgcc_-fno-schedule-insns_-O3_-fomit-frame-pointer2019121320190816
50262131620 512 049644 1320 1584refclang_-O3_-fwrapv_-mavx_-fomit-frame-pointer_-Qunused-arguments2019121320190816
50922930952 512 048743 1424 1584refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv2019121320190816
52533631700 512 049724 1320 1584refclang_-mcpu=native_-mfpu=neon_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments2019121320190816
53359616356 512 036079 1416 1568refgcc_-funroll-loops_-O_-fomit-frame-pointer2019121320190816
6351949492 512 025119 1400 1552refgcc_-funroll-loops_-fno-schedule-insns_-Os_-fomit-frame-pointer2019121320190816
6356079492 512 025119 1400 1552refgcc_-funroll-loops_-Os_-fomit-frame-pointer2019121320190816
63973710380 512 026879 1416 1568refgcc_-O_-fomit-frame-pointer2019121320190816
139181031620 512 049580 1320 1584refclang_-O3_-fomit-frame-pointer_-Qunused-arguments2019121320190816
139304931620 512 049644 1320 1584refclang_-O3_-fwrapv_-mavx_-maes_-mpclmul_-fomit-frame-pointer_-Qunused-arguments2019121320190816
139676631620 512 049644 1320 1584refclang_-O3_-fwrapv_-mavx2_-fomit-frame-pointer_-Qunused-arguments2019121320190816
140791735880 512 054887 1424 1584refgcc_-funroll-loops_-O3_-fomit-frame-pointer2019121320190816
142361130812 512 048487 1424 1584refgcc_-O3_-fomit-frame-pointer2019121320190816
144839117208 512 035839 1416 1568refgcc_-funroll-loops_-fno-schedule-insns_-O2_-fomit-frame-pointer2019121320190816
146615017260 512 035815 1416 1568refgcc_-funroll-loops_-O2_-fomit-frame-pointer2019121320190816
162474210192 512 026615 1416 1568refgcc_-fno-schedule-insns_-O2_-fomit-frame-pointer2019121320190816
165778210244 512 026711 1416 1568refgcc_-O2_-fomit-frame-pointer2019121320190816
167678010404 512 026999 1416 1568refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv2019121320190816
16776069500 512 025071 1400 1552refgcc_-Os_-fomit-frame-pointer2019121320190816
16800849500 512 025071 1400 1552refgcc_-fno-schedule-insns_-Os_-fomit-frame-pointer2019121320190816
16966049548 512 025135 1400 1552refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv2019121320190816
176351016356 512 036079 1416 1568refgcc_-funroll-loops_-fno-schedule-insns_-O_-fomit-frame-pointer2019121320190816
188121510380 512 026879 1416 1568refgcc_-fno-schedule-insns_-O_-fomit-frame-pointer2019121320190816
190103910384 512 026879 1416 1568refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv2019121320190816
336099418904 512 038319 1400 1568refcc2019121320190816
768716918904 512 038319 1400 1568refgcc_-funroll-loops2019121320190816
772351318904 512 038319 1400 1568refgcc2019121320190816

Compiler output

Implementation: avx2
Security model: unknown
Compiler: cc
KeccakP-1600-times4-SIMD256.c: KeccakP-1600-times4-SIMD256.c:19:10: fatal error: smmintrin.h: No such file or directory
KeccakP-1600-times4-SIMD256.c: #include <smmintrin.h>
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: compilation terminated.

Number of similar (compiler,implementation) pairs: 23, namely:
CompilerImplementations
cc avx2
gcc avx2
gcc -O2 -fomit-frame-pointer avx2
gcc -O3 -fomit-frame-pointer avx2
gcc -O -fomit-frame-pointer avx2
gcc -Os -fomit-frame-pointer avx2
gcc -fno-schedule-insns -O2 -fomit-frame-pointer avx2
gcc -fno-schedule-insns -O3 -fomit-frame-pointer avx2
gcc -fno-schedule-insns -O -fomit-frame-pointer avx2
gcc -fno-schedule-insns -Os -fomit-frame-pointer avx2
gcc -funroll-loops avx2
gcc -funroll-loops -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -O -fomit-frame-pointer avx2
gcc -funroll-loops -Os -fomit-frame-pointer avx2
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer avx2
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer avx2
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer avx2
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer avx2
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv avx2
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv avx2
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv avx2
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv avx2

Compiler output

Implementation: avx2
Security model: unknown
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
KeccakP-1600-times4-SIMD256.c: In file included from KeccakP-1600-times4-SIMD256.c:19:
KeccakP-1600-times4-SIMD256.c: In file included from /usr/lib/llvm-7/lib/clang/7.0.1/include/smmintrin.h:27:
KeccakP-1600-times4-SIMD256.c: In file included from /usr/lib/llvm-7/lib/clang/7.0.1/include/tmmintrin.h:27:
KeccakP-1600-times4-SIMD256.c: In file included from /usr/lib/llvm-7/lib/clang/7.0.1/include/pmmintrin.h:27:
KeccakP-1600-times4-SIMD256.c: In file included from /usr/lib/llvm-7/lib/clang/7.0.1/include/emmintrin.h:27:
KeccakP-1600-times4-SIMD256.c: In file included from /usr/lib/llvm-7/lib/clang/7.0.1/include/xmmintrin.h:27:
KeccakP-1600-times4-SIMD256.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:64:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
KeccakP-1600-times4-SIMD256.c: return (__m64)__builtin_ia32_vec_init_v2si(__i, 0);
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:143:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
KeccakP-1600-times4-SIMD256.c: return (__m64)__builtin_ia32_packsswb((__v4hi)__m1, (__v4hi)__m2);
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:173:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
KeccakP-1600-times4-SIMD256.c: return (__m64)__builtin_ia32_packssdw((__v2si)__m1, (__v2si)__m2);
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:203:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
KeccakP-1600-times4-SIMD256.c: return (__m64)__builtin_ia32_packuswb((__v4hi)__m1, (__v4hi)__m2);
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:230:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
KeccakP-1600-times4-SIMD256.c: return (__m64)__builtin_ia32_punpckhbw((__v8qi)__m1, (__v8qi)__m2);
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:253:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
KeccakP-1600-times4-SIMD256.c: return (__m64)__builtin_ia32_punpckhwd((__v4hi)__m1, (__v4hi)__m2);
KeccakP-1600-times4-SIMD256.c: ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
KeccakP-1600-times4-SIMD256.c: /usr/lib/llvm-7/lib/clang/7.0.1/include/mmintrin.h:274:12: error: invalid conversion between vector type '__m64' (vector of 1 'long long' value) and integer type 'int' of different size
KeccakP-1600-times4-SIMD256.c: ...

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments avx2
clang -O3 -fwrapv -mavx2 -fomit-frame-pointer -Qunused-arguments avx2
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments avx2
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments avx2
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments avx2