Implementation notes: amd64, gpu, crypto_stream/cryptmtv3

Computer: gpu
Architecture: amd64
CPU ID: GenuineIntel-000206d7-bfebfbff
SUPERCOP version: 20190816
Operation: crypto_stream
Primitive: cryptmtv3
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
767210362 4 127005 764 1160e/v3clang_-O3_-fwrapv_-march=native_-fomit-frame-pointer_-Qunused-arguments2019082420190816
801910449 4 128085 764 824e/v3clang_-O3_-fomit-frame-pointer_-Qunused-arguments2019082420190816
815616282 4 431005 772 864e/v3gcc_-funroll-loops_-m64_-march=nocona_-O2_-fomit-frame-pointer2019082420190816
840616352 4 431453 772 864e/v3gcc_-funroll-loops_-m64_-O2_-fomit-frame-pointer2019082420190816
847627424 4 444790 780 864e/v3gcc_-funroll-loops_-m64_-march=barcelona_-O3_-fomit-frame-pointer2019082420190816
858818880 4 435470 780 864e/v3gcc_-funroll-loops_-O3_-fomit-frame-pointer2019082420190816
861220992 4 435590 780 864e/v3gcc_-m64_-march=k8_-O3_-fomit-frame-pointer2019082420190816
864810314 4 126301 764 1160e/v3clang_-O3_-fwrapv_-mavx_-maes_-mpclmul_-fomit-frame-pointer_-Qunused-arguments2019082420190816
873211160 4 423037 772 864e/v3gcc_-m64_-march=core2_-msse4.1_-O2_-fomit-frame-pointer2019082420190816
873617286 4 432445 772 864e/v3gcc_-funroll-loops_-m64_-march=nocona_-O_-fomit-frame-pointer2019082420190816
877216352 4 431453 772 864e/v3gcc_-funroll-loops_-O2_-fomit-frame-pointer2019082420190816
885618880 4 435470 780 864e/v3gcc_-funroll-loops_-fno-schedule-insns_-O3_-fomit-frame-pointer2019082420190816
886818166 4 434438 780 864e/v3gcc_-funroll-loops_-march=nocona_-O3_-fomit-frame-pointer2019082420190816
888427104 4 444366 780 864e/v3gcc_-funroll-loops_-m64_-march=k8_-O3_-fomit-frame-pointer2019082420190816
890816032 4 431261 772 864e/v3gcc_-funroll-loops_-march=barcelona_-O2_-fomit-frame-pointer2019082420190816
893218880 4 435470 780 864e/v3gcc_-funroll-loops_-m64_-O3_-fomit-frame-pointer2019082420190816
893217286 4 432445 772 864e/v3gcc_-funroll-loops_-march=nocona_-O_-fomit-frame-pointer2019082420190816
896017301 4 432541 772 864e/v3gcc_-funroll-loops_-m64_-O_-fomit-frame-pointer2019082420190816
897615776 4 430837 772 864e/v3gcc_-funroll-loops_-m64_-march=k8_-O2_-fomit-frame-pointer2019082420190816
898816032 4 431261 772 864e/v3gcc_-funroll-loops_-m64_-march=barcelona_-O2_-fomit-frame-pointer2019082420190816
900410888 4 422709 772 864e/v3gcc_-m64_-march=corei7-avx_-O2_-fomit-frame-pointer2019082420190816
902416282 4 431005 772 864e/v3gcc_-funroll-loops_-march=nocona_-O2_-fomit-frame-pointer2019082420190816
903617308 4 432317 772 864e/v3gcc_-funroll-loops_-m64_-march=barcelona_-O_-fomit-frame-pointer2019082420190816
904810976 4 422797 772 864e/v3gcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv2019082420190816
908011160 4 423037 772 864e/v3gcc_-m64_-march=core2_-msse4_-O2_-fomit-frame-pointer2019082420190816
908412410 4 424205 772 864e/v3gcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv2019082420190816
908811104 4 422981 772 864e/v3gcc_-march=k8_-O2_-fomit-frame-pointer2019082420190816
911210362 4 127005 764 1160e/v3clang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments2019082420190816
912011160 4 423037 772 864e/v3gcc_-m64_-march=core2_-O2_-fomit-frame-pointer2019082420190816
913221344 4 436110 780 864e/v3gcc_-march=barcelona_-O3_-fomit-frame-pointer2019082420190816
916411144 4 422965 772 864e/v3gcc_-m64_-march=corei7_-O2_-fomit-frame-pointer2019082420190816
917212688 4 426998 780 864e/v3gcc_-m64_-march=corei7-avx_-O3_-fomit-frame-pointer2019082420190816
918016352 4 431453 772 864e/v3gcc_-funroll-loops_-fno-schedule-insns_-O2_-fomit-frame-pointer2019082420190816
918017301 4 432541 772 864e/v3gcc_-funroll-loops_-fno-schedule-insns_-O_-fomit-frame-pointer2019082420190816
92125883 4 416665 756 832e/v3gcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv2019082420190816
922012688 4 426998 780 864e/v3gcc_-m64_-march=native_-mtune=native_-O3_-fomit-frame-pointer2019082420190816
922817308 4 432317 772 864e/v3gcc_-funroll-loops_-march=barcelona_-O_-fomit-frame-pointer2019082420190816
927212568 4 426822 780 864e/v3gcc_-fno-schedule-insns_-O3_-fomit-frame-pointer2019082420190816
927212688 4 426998 780 864e/v3gcc_-m64_-march=core-avx-i_-O3_-fomit-frame-pointer2019082420190816
929217301 4 432541 772 864e/v3gcc_-funroll-loops_-O_-fomit-frame-pointer2019082420190816
932012568 4 426878 780 864e/v3gcc_-m64_-march=core2_-O3_-fomit-frame-pointer2019082420190816
934010977 4 129941 764 824e/v3clang_-mcpu=cortex-a8_-mfpu=neon_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments2019082420190816
936815776 4 430837 772 864e/v3gcc_-funroll-loops_-march=k8_-O2_-fomit-frame-pointer2019082420190816
936812568 4 426798 780 864e/v3gcc_-m64_-march=core2_-msse4.1_-O3_-fomit-frame-pointer2019082420190816
93685795 4 416577 756 832e/v3gcc_-m64_-march=native_-mtune=native_-Os_-fomit-frame-pointer2019082420190816
938442688 8 836110 780 864e/v3gcc_-m64_-march=barcelona_-O3_-fomit-frame-pointer2019082420190816
938810314 4 126301 764 1160e/v3clang_-O3_-fwrapv_-mavx_-fomit-frame-pointer_-Qunused-arguments2019082420190816
938827424 4 444790 780 864e/v3gcc_-funroll-loops_-march=barcelona_-O3_-fomit-frame-pointer2019082420190816
943610888 4 422709 772 864e/v3gcc_-m64_-march=native_-mtune=native_-O2_-fomit-frame-pointer2019082420190816
944412944 4 427158 780 864e/v3gcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv2019082420190816
949212711 4 424509 772 864e/v3gcc_-fno-schedule-insns_-O_-fomit-frame-pointer2019082420190816
951620992 4 435590 780 864e/v3gcc_-march=k8_-O3_-fomit-frame-pointer2019082420190816
952812568 4 426862 780 864e/v3gcc_-m64_-march=core2_-msse4_-O3_-fomit-frame-pointer2019082420190816
954412568 4 426822 780 864e/v3gcc_-O3_-fomit-frame-pointer2019082420190816
955210888 4 422709 772 864e/v3gcc_-m64_-march=core-avx-i_-O2_-fomit-frame-pointer2019082420190816
95565913 4 416713 756 832e/v3gcc_-funroll-loops_-march=nocona_-Os_-fomit-frame-pointer2019082420190816
95605798 4 416561 756 832e/v3gcc_-m64_-march=corei7_-Os_-fomit-frame-pointer2019082420190816
956812388 4 424173 772 864e/v3gcc_-m64_-march=corei7-avx_-O_-fomit-frame-pointer2019082420190816
957610977 4 129941 764 824e/v3clang_-mcpu=cortex-a9_-mfpu=neon_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments2019082420190816
957612388 4 424173 772 864e/v3gcc_-m64_-march=native_-mtune=native_-O_-fomit-frame-pointer2019082420190816
958418166 4 434438 780 864e/v3gcc_-funroll-loops_-m64_-march=nocona_-O3_-fomit-frame-pointer2019082420190816
963611144 4 422965 772 864e/v3gcc_-fno-schedule-insns_-O2_-fomit-frame-pointer2019082420190816
963627104 4 444366 780 864e/v3gcc_-funroll-loops_-march=k8_-O3_-fomit-frame-pointer2019082420190816
964010882 4 422397 772 864e/v3gcc_-m64_-march=nocona_-O2_-fomit-frame-pointer2019082420190816
964411384 4 423525 772 864e/v3gcc_-march=barcelona_-O2_-fomit-frame-pointer2019082420190816
972410977 4 129941 764 824e/v3clang_-mcpu=native_-mfpu=neon_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments2019082420190816
977612651 4 424445 772 864e/v3gcc_-m64_-march=corei7_-O_-fomit-frame-pointer2019082420190816
97805795 4 416577 756 832e/v3gcc_-m64_-march=core-avx-i_-Os_-fomit-frame-pointer2019082420190816
983617254 4 432485 772 864e/v3gcc_-funroll-loops_-march=k8_-O_-fomit-frame-pointer2019082420190816
986417254 4 432485 772 864e/v3gcc_-funroll-loops_-m64_-march=k8_-O_-fomit-frame-pointer2019082420190816
98725780 4 416545 756 832e/v3gcc_-m64_-march=core2_-msse4_-Os_-fomit-frame-pointer2019082420190816
990411104 4 422981 772 864e/v3gcc_-m64_-march=k8_-O2_-fomit-frame-pointer2019082420190816
992811144 4 422965 772 864e/v3gcc_-m64_-O2_-fomit-frame-pointer2019082420190816
994811144 4 422965 772 864e/v3gcc_-O2_-fomit-frame-pointer2019082420190816
99605780 4 416545 756 832e/v3gcc_-m64_-march=core2_-msse4.1_-Os_-fomit-frame-pointer2019082420190816
100165900 4 416689 756 832e/v3gcc_-funroll-loops_-march=barcelona_-Os_-fomit-frame-pointer2019082420190816
1006812348 4 426334 780 864e/v3gcc_-m64_-march=nocona_-O3_-fomit-frame-pointer2019082420190816
1008012568 4 426822 780 864e/v3gcc_-m64_-O3_-fomit-frame-pointer2019082420190816
1010022768 8 823525 772 864e/v3gcc_-m64_-march=barcelona_-O2_-fomit-frame-pointer2019082420190816
1010012348 4 426334 780 864e/v3gcc_-march=nocona_-O3_-fomit-frame-pointer2019082420190816
1010812710 4 424493 772 864e/v3gcc_-m64_-march=core2_-msse4_-O_-fomit-frame-pointer2019082420190816
1013212600 4 426678 780 864e/v3gcc_-m64_-march=corei7_-O3_-fomit-frame-pointer2019082420190816
101965795 4 416577 756 832e/v3gcc_-m64_-march=corei7-avx_-Os_-fomit-frame-pointer2019082420190816
1019612743 4 424541 772 864e/v3gcc_-march=nocona_-O_-fomit-frame-pointer2019082420190816
102165900 4 416689 756 832e/v3gcc_-funroll-loops_-march=k8_-Os_-fomit-frame-pointer2019082420190816
1028012710 4 424493 772 864e/v3gcc_-m64_-march=core2_-O_-fomit-frame-pointer2019082420190816
1028412388 4 424173 772 864e/v3gcc_-m64_-march=core-avx-i_-O_-fomit-frame-pointer2019082420190816
102885900 4 416689 756 832e/v3gcc_-funroll-loops_-m64_-Os_-fomit-frame-pointer2019082420190816
1029610882 4 422397 772 864e/v3gcc_-march=nocona_-O2_-fomit-frame-pointer2019082420190816
1034812713 4 424549 772 864e/v3gcc_-march=k8_-O_-fomit-frame-pointer2019082420190816
1036412713 4 424549 772 864e/v3gcc_-m64_-march=k8_-O_-fomit-frame-pointer2019082420190816
103725780 4 416545 756 832e/v3gcc_-m64_-march=core2_-Os_-fomit-frame-pointer2019082420190816
1037612710 4 424493 772 864e/v3gcc_-m64_-march=core2_-msse4.1_-O_-fomit-frame-pointer2019082420190816
104045780 4 416545 756 832e/v3gcc_-m64_-Os_-fomit-frame-pointer2019082420190816
104125900 4 416689 756 832e/v3gcc_-funroll-loops_-m64_-march=barcelona_-Os_-fomit-frame-pointer2019082420190816
105085780 4 416545 756 832e/v3gcc_-march=k8_-Os_-fomit-frame-pointer2019082420190816
105365780 4 416545 756 832e/v3gcc_-Os_-fomit-frame-pointer2019082420190816
105605900 4 416689 756 832e/v3gcc_-funroll-loops_-fno-schedule-insns_-Os_-fomit-frame-pointer2019082420190816
1062412707 4 424461 772 864e/v3gcc_-march=barcelona_-O_-fomit-frame-pointer2019082420190816
1064012711 4 424509 772 864e/v3gcc_-O_-fomit-frame-pointer2019082420190816
107085795 4 416553 756 832e/v3gcc_-march=nocona_-Os_-fomit-frame-pointer2019082420190816
107565780 4 416545 756 832e/v3gcc_-fno-schedule-insns_-Os_-fomit-frame-pointer2019082420190816
107685913 4 416713 756 832e/v3gcc_-funroll-loops_-m64_-march=nocona_-Os_-fomit-frame-pointer2019082420190816
1078412743 4 424541 772 864e/v3gcc_-m64_-march=nocona_-O_-fomit-frame-pointer2019082420190816
1084811560 8 816545 756 832e/v3gcc_-m64_-march=barcelona_-Os_-fomit-frame-pointer2019082420190816
108685900 4 416689 756 832e/v3gcc_-funroll-loops_-Os_-fomit-frame-pointer2019082420190816
1092025414 8 824461 772 864e/v3gcc_-m64_-march=barcelona_-O_-fomit-frame-pointer2019082420190816
109245780 4 416545 756 832e/v3gcc_-m64_-march=k8_-Os_-fomit-frame-pointer2019082420190816
109525780 4 416545 756 832e/v3gcc_-march=barcelona_-Os_-fomit-frame-pointer2019082420190816
1102812711 4 424509 772 864e/v3gcc_-m64_-O_-fomit-frame-pointer2019082420190816
110405795 4 416553 756 832e/v3gcc_-m64_-march=nocona_-Os_-fomit-frame-pointer2019082420190816
111885900 4 416689 756 832e/v3gcc_-funroll-loops_-m64_-march=k8_-Os_-fomit-frame-pointer2019082420190816
7142414603 4 428063 772 824e/v3gcc2019082420190816
7304814603 4 428063 772 824e/v3gcc_-funroll-loops2019082420190816
7494814603 4 428063 772 824e/v3cc2019082420190816

Test failure

Implementation: e/v3
Security model: unknown
Compiler: gcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer
error 111

Number of similar (compiler,implementation) pairs: 3, namely:
CompilerImplementations
gcc -m64 -march=core-avx2 -O2 -fomit-frame-pointer e/v3
gcc -m64 -march=core-avx2 -O -fomit-frame-pointer e/v3
gcc -m64 -march=core-avx2 -Os -fomit-frame-pointer e/v3

Compiler output

Implementation: e/v3
Security model: unknown
Compiler: clang -O3 -fomit-frame-pointer -Qunused-arguments
cryptmt-v3.c: In file included from cryptmt-v3.c:2:
cryptmt-v3.c: ./e/cryptmt-v3.c:189:18: warning: array index 156 is past the end of the array (which contains 156 elements) [-Warray-bounds]
cryptmt-v3.c: do_recursion(ps->sfmt[N], ps->sfmt[POS1], ps->sfmt[N - 1]);
cryptmt-v3.c: ^ ~
cryptmt-v3.c: ./e/cryptmt-v3.c:25:5: note: array 'sfmt' declared here
cryptmt-v3.c: u32 sfmt[N][4];
cryptmt-v3.c: ^
cryptmt-v3.c: ./e/cryptmt-v3.c:403:27: warning: 'memset' call operates on objects of type 'crypto_stream_cryptmtv3_e_v3_ECRYPT_ctx' while the size is based on a different type 'crypto_stream_cryptmtv3_e_v3_ECRYPT_ctx *' [-Wsizeof-pointer-memaccess]
cryptmt-v3.c: memset(ctx, 0, sizeof(ctx));
cryptmt-v3.c: ~~~ ^~~
cryptmt-v3.c: ./e/cryptmt-v3.c:403:27: note: did you mean to dereference the argument to 'sizeof' (and multiply it by the number of elements)?
cryptmt-v3.c: memset(ctx, 0, sizeof(ctx));
cryptmt-v3.c: ^~~
cryptmt-v3.c: 2 warnings generated.

Number of similar (compiler,implementation) pairs: 8, namely:
CompilerImplementations
clang -O3 -fomit-frame-pointer -Qunused-arguments e/v3
clang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments e/v3
clang -O3 -fwrapv -mavx -fomit-frame-pointer -Qunused-arguments e/v3
clang -O3 -fwrapv -mavx -maes -mpclmul -fomit-frame-pointer -Qunused-arguments e/v3
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments e/v3
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments e/v3
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments e/v3
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments e/v3