| Time | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 235344 | unrolled | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 236394 | unrolled | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 237549 | simple | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 237615 | unrolled | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 238866 | simple | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 243630 | unrolled | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 247131 | simple | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 251469 | unrolled | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 288208 | unrolled | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 726874 | spec | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 774916 | spec | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 781734 | spec | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 1123992 | simple | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 1134695 | simple | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 1158201 | simple | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 1207104 | spec | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 1331386 | spec | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 1414473 | spec | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |