| Time | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 719503 | unrolled | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 722062 | simple | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 732313 | simple | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 738071 | unrolled | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 752050 | unrolled | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 760660 | unrolled | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 765857 | simple | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 772676 | unrolled | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 905774 | unrolled | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 1273244 | spec | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 1335848 | spec | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 1340574 | spec | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 2871724 | spec | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 3158594 | spec | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 3332565 | spec | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 4014230 | simple | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 4016689 | simple | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 4099063 | simple | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |