| Time | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 91843 | faster | gcc -funroll-loops -mcpu=marvell-pj4 -O3 | 20161211 | 20161026 |
| 92593 | faster | gcc -mcpu=marvell-pj4 -O3 | 20161211 | 20161026 |
| 98602 | ref | gcc -funroll-loops -mcpu=marvell-pj4 -O3 | 20161211 | 20161026 |
| 102079 | faster | gcc -funroll-loops -mcpu=marvell-pj4 -O2 | 20161211 | 20161026 |
| 106503 | ref | gcc -mcpu=marvell-pj4 -O3 | 20161211 | 20161026 |
| 109957 | faster | gcc -mcpu=marvell-pj4 -O2 | 20161211 | 20161026 |
| 114863 | ref | gcc -funroll-loops -mcpu=marvell-pj4 -O2 | 20161211 | 20161026 |
| 118268 | faster | gcc -mcpu=marvell-pj4 -Os | 20161211 | 20161026 |
| 118884 | faster | gcc -funroll-loops -mcpu=marvell-pj4 -Os | 20161211 | 20161026 |
| 143152 | ref | gcc -mcpu=marvell-pj4 -O2 | 20161211 | 20161026 |
| 156194 | ref | gcc -funroll-loops -mcpu=marvell-pj4 -Os | 20161211 | 20161026 |
| 157608 | ref | gcc -mcpu=marvell-pj4 -Os | 20161211 | 20161026 |