| Time | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 2595228 | unrolled | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 2604125 | simple | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 2657804 | simple | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 2657860 | unrolled | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 2709788 | unrolled | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 2738910 | unrolled | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 2739493 | unrolled | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 2767282 | simple | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 3232275 | unrolled | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 3430574 | spec | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 3530928 | spec | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 3551590 | spec | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 9190038 | spec | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 10084552 | spec | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 10598481 | spec | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 14740588 | simple | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 14774316 | simple | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 15054940 | simple | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |