| Time | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 4033807 | neon | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161226 | 20161026 |
| 4051795 | neon | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161226 | 20161026 |
| 4083747 | neon | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161226 | 20161026 |
| 4299679 | neon | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161226 | 20161026 |
| 4395559 | neon | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161226 | 20161026 |
| 4469428 | neon | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161226 | 20161026 |
| 9748068 | arm32 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161226 | 20161026 |
| 9794817 | arm32 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161226 | 20161026 |
| 9830485 | arm32 | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161226 | 20161026 |
| 10208952 | arm32 | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161226 | 20161026 |
| 11282662 | arm32 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161226 | 20161026 |
| 11717543 | arm32 | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161226 | 20161026 |
| 11740894 | 32 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161226 | 20161026 |
| 12525008 | 32 | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161226 | 20161026 |
| 12941359 | 32 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161226 | 20161026 |
| 13177998 | 32 | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161226 | 20161026 |
| 14776407 | 32 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161226 | 20161026 |
| 15940332 | 32 | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161226 | 20161026 |