| Time | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 597923 | reference | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161215 | 20161026 |
| 683590 | reference | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161215 | 20161026 |
| 740670 | reference | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161215 | 20161026 |
| 985330 | compact | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161215 | 20161026 |
| 1223356 | compact | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161215 | 20161026 |
| 1263479 | compact | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161215 | 20161026 |
| 1338278 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161215 | 20161026 |
| 1488753 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161215 | 20161026 |
| 1497759 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161215 | 20161026 |
| 1705035 | compact | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161215 | 20161026 |
| 1764865 | compact | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161215 | 20161026 |
| 1795130 | compact | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161215 | 20161026 |
| 3490815 | reference | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161215 | 20161026 |
| 3882856 | reference | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161215 | 20161026 |
| 3928528 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161215 | 20161026 |
| 4076891 | reference | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161215 | 20161026 |
| 4176567 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161215 | 20161026 |
| 4192885 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161215 | 20161026 |