| Time | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 194667 | opt-32 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161216 | 20161026 |
| 196653 | opt-32 | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161216 | 20161026 |
| 198029 | opt-32 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161216 | 20161026 |
| 201459 | opt-32 | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161216 | 20161026 |
| 403116 | opt-32 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161216 | 20161026 |
| 404439 | opt-32 | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161216 | 20161026 |
| 461760 | opt-64 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161216 | 20161026 |
| 474462 | opt-64 | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161216 | 20161026 |
| 605202 | opt-64 | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161216 | 20161026 |
| 803825 | opt-64 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161216 | 20161026 |
| 809880 | opt-64 | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161216 | 20161026 |
| 817782 | opt-64 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161216 | 20161026 |
| 4539094 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161216 | 20161026 |
| 4559830 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161216 | 20161026 |
| 4570488 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161216 | 20161026 |
| 4709730 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161216 | 20161026 |
| 4715881 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161216 | 20161026 |
| 4841632 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161216 | 20161026 |