| Time | Implementation | Compiler | Benchmark date | SUPERCOP version |
| 66184 | faster | gcc -march=loongson2f -mabi=n32 -O3 | 20161214 | 20161026 |
| 69370 | faster | gcc -funroll-loops -march=loongson2f -mabi=n32 -O3 | 20161214 | 20161026 |
| 69868 | faster | gcc -funroll-loops -march=loongson2f -mabi=n32 -O2 | 20161214 | 20161026 |
| 77192 | faster | gcc -march=loongson2f -mabi=n32 -O2 | 20161214 | 20161026 |
| 78032 | ref | gcc -march=loongson2f -mabi=n32 -O3 | 20161214 | 20161026 |
| 79912 | ref | gcc -funroll-loops -march=loongson2f -mabi=n32 -O3 | 20161214 | 20161026 |
| 93040 | faster | gcc -march=loongson2f -mabi=n32 -Os | 20161214 | 20161026 |
| 93362 | faster | gcc -funroll-loops -march=loongson2f -mabi=n32 -Os | 20161214 | 20161026 |
| 102496 | ref | gcc -funroll-loops -march=loongson2f -mabi=n32 -O2 | 20161214 | 20161026 |
| 175700 | ref | gcc -march=loongson2f -mabi=n32 -Os | 20161214 | 20161026 |
| 179994 | ref | gcc -funroll-loops -march=loongson2f -mabi=n32 -Os | 20161214 | 20161026 |
| 203700 | ref | gcc -march=loongson2f -mabi=n32 -O2 | 20161214 | 20161026 |