| Time | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 441351 | simple | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 448135 | unrolled | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 453069 | unrolled | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |
| 466169 | unrolled | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 467858 | simple | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 475078 | unrolled | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 476550 | unrolled | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 493537 | simple | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 539223 | unrolled | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |
| 1191333 | spec | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 1235407 | spec | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161210 | 20161026 |
| 1246097 | spec | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 2178575 | spec | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 2329402 | spec | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |
| 2356216 | spec | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |
| 2419903 | simple | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161210 | 20161026 |
| 2591153 | simple | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |
| 2700272 | simple | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161210 | 20161026 |