| Time | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 1294875 | ref10 | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161217 | 20161026 |
| 1297737 | ref10 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161217 | 20161026 |
| 1327231 | ref10 | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161217 | 20161026 |
| 1332248 | ref10 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161217 | 20161026 |
| 1390597 | ref10 | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161217 | 20161026 |
| 1408262 | ref10 | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161217 | 20161026 |
| 10544321 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161217 | 20161026 |
| 10566879 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161217 | 20161026 |
| 15151070 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161217 | 20161026 |
| 22778816 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161217 | 20161026 |
| 25042738 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161217 | 20161026 |
| 25519893 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161217 | 20161026 |