| Time | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 561153 | opt | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |
| 563692 | opt | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |
| 620966 | opt | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 643856 | opt | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 700402 | opt | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 715904 | opt | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 26577039 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 27041676 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 53926793 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 75229689 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |
| 89222639 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 89535267 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |