| Time | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 118520 | opt-32 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 119167 | opt-32 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 120085 | opt-32 | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 122682 | opt-32 | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 133209 | opt-32 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 136272 | opt-32 | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 326485 | opt-64 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 330809 | opt-64 | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 462993 | opt-64 | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 574169 | opt-64 | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 578605 | opt-64 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 580656 | opt-64 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 3390105 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 3394418 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 3466704 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 3586277 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 3590295 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 3630106 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |