| Time | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 790453 | reference | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161221 | 20161026 |
| 790459 | compact | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161221 | 20161026 |
| 884211 | reference | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161221 | 20161026 |
| 891830 | compact | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161221 | 20161026 |
| 913529 | reference | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161221 | 20161026 |
| 918111 | compact | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161221 | 20161026 |
| 1385097 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161221 | 20161026 |
| 1511995 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161221 | 20161026 |
| 1598357 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161221 | 20161026 |
| 3656042 | compact | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161221 | 20161026 |
| 3656410 | reference | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161221 | 20161026 |
| 4135295 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161221 | 20161026 |
| 4483855 | reference | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161221 | 20161026 |
| 4484705 | compact | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161221 | 20161026 |
| 4676840 | reference | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161221 | 20161026 |
| 4677078 | compact | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161221 | 20161026 |
| 4929700 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161221 | 20161026 |
| 5081823 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161221 | 20161026 |