| Time | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 523787 | neon | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161219 | 20161026 |
| 529521 | neon | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161219 | 20161026 |
| 537232 | neon | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161219 | 20161026 |
| 578927 | neon | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161219 | 20161026 |
| 582384 | neon | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161219 | 20161026 |
| 590715 | neon | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161219 | 20161026 |
| 1248946 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161219 | 20161026 |
| 1262506 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161219 | 20161026 |
| 2144591 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161219 | 20161026 |
| 2742059 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161219 | 20161026 |
| 2894711 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161219 | 20161026 |
| 3088495 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161219 | 20161026 |