| Time | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 1341711 | unrolled | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 1346653 | simple | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 1370924 | simple | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 1374131 | unrolled | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 1400538 | unrolled | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 1416178 | unrolled | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 1424453 | unrolled | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 1428556 | simple | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 1677163 | unrolled | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 1992837 | spec | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 2065029 | spec | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161214 | 20161026 |
| 2072386 | spec | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 4993978 | spec | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 5462706 | spec | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 5748866 | spec | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 7595764 | simple | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |
| 7603611 | simple | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161214 | 20161026 |
| 7759012 | simple | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161214 | 20161026 |