| Time | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 263345 | opt | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 267971 | opt | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 269560 | opt | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |
| 280334 | opt | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 281877 | opt | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 282319 | opt | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |
| 407988 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 418656 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 430503 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 472988 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |
| 493552 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |
| 529967 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |