| Time | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 1454013 | simple | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 1457934 | unrolled | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 1508457 | unrolled | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |
| 1514713 | simple | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 1571872 | simple | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 1573047 | unrolled | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 1585345 | unrolled | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 1591775 | unrolled | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 1826858 | unrolled | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |
| 2584965 | spec | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 2592325 | spec | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 2748332 | spec | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 6117838 | spec | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |
| 6246520 | spec | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 6291478 | spec | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |
| 8428395 | simple | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 8719464 | simple | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |
| 8719527 | simple | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |