| Time | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 1508570 | arm | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161225 | 20161026 |
| 1520306 | arm | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161225 | 20161026 |
| 1714468 | arm | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161225 | 20161026 |
| 1806706 | arm | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161225 | 20161026 |
| 1808103 | arm | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161225 | 20161026 |
| 1808257 | arm | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161225 | 20161026 |
| 2071980 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161225 | 20161026 |
| 2098866 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | 20161225 | 20161026 |
| 2335201 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161225 | 20161026 |
| 2472438 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | 20161225 | 20161026 |
| 2509830 | ref | gcc -mcpu=native -mfpu=neon-vfpv4 -Os | 20161225 | 20161026 |
| 2513040 | ref | gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | 20161225 | 20161026 |