Implementation notes: aarch64, par3, crypto_aead/hs1sivlov2

Computer: par3
Architecture: aarch64
CPU ID: unknown CPU ID
SUPERCOP version: 20170718
Operation: crypto_aead
Primitive: hs1sivlov2
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
38800? ? ?? ? ?refgcc_-funroll-loops_-march=native_-mtune=native_-O32017072020170718
45650? ? ?? ? ?refgcc_-funroll-loops_-march=native_-mtune=native_-O22017072020170718
46689? ? ?? ? ?refgcc_-march=native_-mtune=native_-O32017072020170718
75464? ? ?? ? ?refgcc_-march=native_-mtune=native_-O22017072020170718
79871? ? ?? ? ?refgcc_-funroll-loops_-march=native_-mtune=native_-Os2017072020170718
85598? ? ?? ? ?refgcc_-march=native_-mtune=native_-Os2017072020170718

Test failure

Implementation: crypto_aead/hs1sivlov2/faster
Compiler: gcc -funroll-loops -march=native -mtune=native -O2
error 111
crypto_aead_decrypt returns nonzero

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mtune=native -O2 faster
gcc -funroll-loops -march=native -mtune=native -O3 faster
gcc -funroll-loops -march=native -mtune=native -Os faster
gcc -march=native -mtune=native -O2 faster
gcc -march=native -mtune=native -O3 faster

Compiler output

Implementation: crypto_aead/hs1sivlov2/faster
Compiler: gcc -funroll-loops -march=native -mtune=native -O2
hs1.c: hs1.c: In function 'hs1siv_decrypt':
hs1.c: hs1.c:1033:30: warning: implicit declaration of function 'malloc' [-Wimplicit-function-declaration]
hs1.c: unsigned char *tmp = malloc(abytes);
hs1.c: ^~~~~~
hs1.c: hs1.c:1033:30: warning: incompatible implicit declaration of built-in function 'malloc'
hs1.c: hs1.c:1033:30: note: include '<stdlib.h>' or provide a declaration of 'malloc'
hs1.c: hs1.c:1037:9: warning: implicit declaration of function 'free' [-Wimplicit-function-declaration]
hs1.c: free(tmp);
hs1.c: ^~~~
hs1.c: hs1.c:1037:9: warning: incompatible implicit declaration of built-in function 'free'
hs1.c: hs1.c:1037:9: note: include '<stdlib.h>' or provide a declaration of 'free'

Number of similar (compiler,implementation) pairs: 6, namely:
CompilerImplementations
gcc -funroll-loops -march=native -mtune=native -O2 faster
gcc -funroll-loops -march=native -mtune=native -O3 faster
gcc -funroll-loops -march=native -mtune=native -Os faster
gcc -march=native -mtune=native -O2 faster
gcc -march=native -mtune=native -O3 faster
gcc -march=native -mtune=native -Os faster