| Compiler | Implementations |
| gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | ref |
| gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | ref |
| gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | ref |
| gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | ref |
| gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | ref |
| gcc -mcpu=native -mfpu=neon-vfpv4 -Os | ref |
| Compiler | Implementations |
| gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | avx1 |
| Compiler | Implementations |
| gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O2 | sse4 |
| Compiler | Implementations |
| gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | avx1 |
| Compiler | Implementations |
| gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -O3 | sse4 |
| Compiler | Implementations |
| gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | sse4 |
| Compiler | Implementations |
| gcc -funroll-loops -mcpu=native -mfpu=neon-vfpv4 -Os | avx1 |
| Compiler | Implementations |
| gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | sse4 |
| Compiler | Implementations |
| gcc -mcpu=native -mfpu=neon-vfpv4 -O2 | avx1 |
| Compiler | Implementations |
| gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | avx1 |
| Compiler | Implementations |
| gcc -mcpu=native -mfpu=neon-vfpv4 -O3 | sse4 |
| Compiler | Implementations |
| gcc -mcpu=native -mfpu=neon-vfpv4 -Os | sse4 |
| Compiler | Implementations |
| gcc -mcpu=native -mfpu=neon-vfpv4 -Os | avx1 |