Implementation notes: x86, h4e450, crypto_aead/morus640128v2

Computer: h4e450
Architecture: x86
CPU ID: AuthenticAMD-00500f20-178bfbff
SUPERCOP version: 20171218
Operation: crypto_aead
Primitive: morus640128v2
TimeImplementationCompilerBenchmark dateSUPERCOP version
33554sse2gcc -funroll-loops -m32 -march=barcelona -O3 -fomit-frame-pointer2017100120170725
33968sse2gcc -funroll-loops -m32 -march=pentium-m -O3 -fomit-frame-pointer2017100120170725
33981sse2gcc -m32 -march=barcelona -O3 -fomit-frame-pointer2017100120170725
34257sse2gcc -m32 -march=native -mtune=native -O3 -fomit-frame-pointer2017100120170725
34291sse2gcc -funroll-loops -m32 -march=pentium4 -O3 -fomit-frame-pointer2017100120170725
34390sse2gcc -m32 -march=pentium-m -O3 -fomit-frame-pointer2017100120170725
34395sse2gcc -funroll-loops -m32 -march=nocona -O3 -fomit-frame-pointer2017100120170725
34400sse2gcc -funroll-loops -m32 -march=prescott -O3 -fomit-frame-pointer2017100120170725
34514sse2gcc -funroll-loops -m32 -march=k8 -O3 -fomit-frame-pointer2017100120170725
34528sse2gcc -m32 -march=pentium4 -O3 -fomit-frame-pointer2017100120170725
34604sse2gcc -m32 -march=core2 -O3 -fomit-frame-pointer2017100120170725
34623sse2gcc -m32 -march=nocona -O3 -fomit-frame-pointer2017100120170725
34623sse2gcc -m32 -march=prescott -O3 -fomit-frame-pointer2017100120170725
34671sse2gcc -m32 -march=k8 -O3 -fomit-frame-pointer2017100120170725
39335sse2gcc -funroll-loops -m32 -march=pentium-m -O2 -fomit-frame-pointer2017100120170725
41130sse2gcc -m32 -march=k8 -O2 -fomit-frame-pointer2017100120170725
41686sse2gcc -m32 -march=core2 -O -fomit-frame-pointer2017100120170725
41700sse2gcc -m32 -march=pentium-m -O2 -fomit-frame-pointer2017100120170725
41772sse2gcc -m32 -march=corei7 -O2 -fomit-frame-pointer2017100120170725
41786sse2gcc -funroll-loops -m32 -march=k8 -O2 -fomit-frame-pointer2017100120170725
41790sse2gcc -m32 -march=core2 -msse4.1 -O -fomit-frame-pointer2017100120170725
41871sse2gcc -m32 -march=core2 -msse4 -O -fomit-frame-pointer2017100120170725
41918sse2gcc -funroll-loops -m32 -march=k8 -O -fomit-frame-pointer2017100120170725
41976sse2gcc -funroll-loops -m32 -march=pentium-m -O -fomit-frame-pointer2017100120170725
41990sse2gcc -m32 -march=k8 -O -fomit-frame-pointer2017100120170725
42037sse2gcc -m32 -march=pentium-m -O -fomit-frame-pointer2017100120170725
42066sse2gcc -m32 -march=core2 -O2 -fomit-frame-pointer2017100120170725
42132sse2gcc -m32 -march=core2 -msse4 -O2 -fomit-frame-pointer2017100120170725
42270sse2gcc -m32 -march=corei7 -O -fomit-frame-pointer2017100120170725
42327sse2gcc -funroll-loops -m32 -march=barcelona -O -fomit-frame-pointer2017100120170725
42356sse2gcc -m32 -march=native -mtune=native -O -fomit-frame-pointer2017100120170725
42446sse2gcc -m32 -march=barcelona -O -fomit-frame-pointer2017100120170725
42465sse2gcc -funroll-loops -m32 -march=prescott -O -fomit-frame-pointer2017100120170725
42484sse2gcc -m32 -march=core2 -msse4.1 -O2 -fomit-frame-pointer2017100120170725
42527sse2gcc -funroll-loops -m32 -march=pentium4 -O -fomit-frame-pointer2017100120170725
42622sse2gcc -funroll-loops -m32 -march=nocona -O2 -fomit-frame-pointer2017100120170725
42650sse2gcc -funroll-loops -m32 -march=nocona -O -fomit-frame-pointer2017100120170725
42845sse2gcc -m32 -march=pentium4 -O -fomit-frame-pointer2017100120170725
43092sse2gcc -funroll-loops -m32 -march=prescott -O2 -fomit-frame-pointer2017100120170725
43101sse2gcc -m32 -march=nocona -O -fomit-frame-pointer2017100120170725
43106sse2gcc -m32 -march=prescott -O -fomit-frame-pointer2017100120170725
43149sse2gcc -funroll-loops -m32 -march=pentium4 -O2 -fomit-frame-pointer2017100120170725
43753sse2gcc -m32 -march=prescott -O2 -fomit-frame-pointer2017100120170725
43833sse2gcc -m32 -march=pentium4 -O2 -fomit-frame-pointer2017100120170725
43871sse2gcc -m32 -march=nocona -O2 -fomit-frame-pointer2017100120170725
44128sse2gcc -m32 -march=native -mtune=native -O2 -fomit-frame-pointer2017100120170725
44213sse2gcc -m32 -march=barcelona -O2 -fomit-frame-pointer2017100120170725
44308sse2gcc -funroll-loops -m32 -march=barcelona -O2 -fomit-frame-pointer2017100120170725
52397refgcc -funroll-loops -m32 -march=athlon -O2 -fomit-frame-pointer2017100120170725
52478refgcc -funroll-loops -m32 -march=barcelona -O2 -fomit-frame-pointer2017100120170725
52497refgcc -funroll-loops -m32 -march=k8 -O2 -fomit-frame-pointer2017100120170725
52739refgcc -funroll-loops -m32 -O2 -fomit-frame-pointer2017100120170725
52944refgcc -m32 -march=k8 -O2 -fomit-frame-pointer2017100120170725
53090refgcc -m32 -march=athlon -O2 -fomit-frame-pointer2017100120170725
53138refgcc -m32 -march=native -mtune=native -O2 -fomit-frame-pointer2017100120170725
53196refgcc -funroll-loops -m32 -march=k6-3 -O2 -fomit-frame-pointer2017100120170725
53209refgcc -funroll-loops -m32 -march=k6-2 -O2 -fomit-frame-pointer2017100120170725
53238refgcc -m32 -march=barcelona -O2 -fomit-frame-pointer2017100120170725
53314refgcc -funroll-loops -m32 -march=k6 -O2 -fomit-frame-pointer2017100120170725
53428refgcc -m32 -march=corei7 -O2 -fomit-frame-pointer2017100120170725
53433refgcc -m32 -march=core2 -msse4 -O2 -fomit-frame-pointer2017100120170725
53442refgcc -m32 -march=core2 -O2 -fomit-frame-pointer2017100120170725
53499refgcc -m32 -march=k6-2 -O2 -fomit-frame-pointer2017100120170725
53523refgcc -m32 -march=core2 -msse4.1 -O2 -fomit-frame-pointer2017100120170725
53528refgcc -m32 -march=k6 -O2 -fomit-frame-pointer2017100120170725
53585refgcc -m32 -march=k6-3 -O2 -fomit-frame-pointer2017100120170725
53623refgcc -funroll-loops -m32 -march=prescott -O -fomit-frame-pointer2017100120170725
53665refgcc -funroll-loops -m32 -march=nocona -O -fomit-frame-pointer2017100120170725
53832refgcc -m32 -O2 -fomit-frame-pointer2017100120170725
53837refgcc -funroll-loops -m32 -march=nocona -O2 -fomit-frame-pointer2017100120170725
53846refgcc -funroll-loops -m32 -march=prescott -O2 -fomit-frame-pointer2017100120170725
53908refgcc -funroll-loops -m32 -O -fomit-frame-pointer2017100120170725
53941refgcc -funroll-loops -m32 -march=athlon -O -fomit-frame-pointer2017100120170725
53946refgcc -funroll-loops -m32 -march=barcelona -O -fomit-frame-pointer2017100120170725
54093refgcc -m32 -march=nocona -O -fomit-frame-pointer2017100120170725
54154refgcc -m32 -march=native -mtune=native -O -fomit-frame-pointer2017100120170725
54193refgcc -funroll-loops -m32 -march=k6 -O -fomit-frame-pointer2017100120170725
54193refgcc -m32 -march=prescott -O -fomit-frame-pointer2017100120170725
54226refgcc -funroll-loops -m32 -march=pentium4 -O2 -fomit-frame-pointer2017100120170725
54230refgcc -funroll-loops -m32 -march=pentium-m -O -fomit-frame-pointer2017100120170725
54250refgcc -funroll-loops -m32 -march=k6-3 -O -fomit-frame-pointer2017100120170725
54264refgcc -funroll-loops -m32 -march=pentium3 -O -fomit-frame-pointer2017100120170725
54283refgcc -funroll-loops -m32 -march=k6-2 -O -fomit-frame-pointer2017100120170725
54293refgcc -funroll-loops -m32 -march=i486 -O2 -fomit-frame-pointer2017100120170725
54312refgcc -m32 -march=core2 -O -fomit-frame-pointer2017100120170725
54325refgcc -funroll-loops -m32 -march=pentium2 -O -fomit-frame-pointer2017100120170725
54345refgcc -funroll-loops -m32 -march=pentiumpro -O -fomit-frame-pointer2017100120170725
54359refgcc -m32 -march=core2 -msse4.1 -O -fomit-frame-pointer2017100120170725
54373refgcc -m32 -march=corei7 -O -fomit-frame-pointer2017100120170725
54374refgcc -funroll-loops -m32 -march=k8 -O -fomit-frame-pointer2017100120170725
54383refgcc -m32 -O -fomit-frame-pointer2017100120170725
54435refgcc -m32 -march=core2 -msse4 -O -fomit-frame-pointer2017100120170725
54454refgcc -m32 -march=pentium2 -O -fomit-frame-pointer2017100120170725
54506refgcc -m32 -march=prescott -O2 -fomit-frame-pointer2017100120170725
54516refgcc -funroll-loops -m32 -march=pentium4 -O -fomit-frame-pointer2017100120170725
54525refgcc -m32 -march=athlon -O -fomit-frame-pointer2017100120170725
54535refgcc -m32 -march=barcelona -O -fomit-frame-pointer2017100120170725
54549refgcc -m32 -march=nocona -O2 -fomit-frame-pointer2017100120170725
54568refgcc -m32 -march=pentiumpro -O -fomit-frame-pointer2017100120170725
54592refgcc -m32 -march=k8 -O -fomit-frame-pointer2017100120170725
54663refgcc -funroll-loops -m32 -march=i386 -O2 -fomit-frame-pointer2017100120170725
54706refgcc -m32 -march=pentium4 -O2 -fomit-frame-pointer2017100120170725
54744refgcc -funroll-loops -m32 -march=pentium3 -Os -fomit-frame-pointer2017100120170725
54782refgcc -m32 -march=pentium-m -O -fomit-frame-pointer2017100120170725
54824refgcc -funroll-loops -m32 -march=pentium-m -Os -fomit-frame-pointer2017100120170725
54853refgcc -m32 -march=i486 -O2 -fomit-frame-pointer2017100120170725
54862refgcc -funroll-loops -m32 -march=pentiumpro -Os -fomit-frame-pointer2017100120170725
54882refgcc -m32 -march=pentium4 -O -fomit-frame-pointer2017100120170725
54895refgcc -funroll-loops -m32 -march=pentium2 -Os -fomit-frame-pointer2017100120170725
54900refgcc -funroll-loops -m32 -march=pentium-mmx -O -fomit-frame-pointer2017100120170725
54915refgcc -m32 -march=pentium3 -O -fomit-frame-pointer2017100120170725
55000refgcc -funroll-loops -m32 -march=pentium -O -fomit-frame-pointer2017100120170725
55019refgcc -m32 -march=pentium3 -Os -fomit-frame-pointer2017100120170725
55024refgcc -m32 -march=i386 -O2 -fomit-frame-pointer2017100120170725
55028refgcc -funroll-loops -m32 -march=i486 -O -fomit-frame-pointer2017100120170725
55033refgcc -m32 -march=pentium-m -Os -fomit-frame-pointer2017100120170725
55085refgcc -m32 -march=pentium -O -fomit-frame-pointer2017100120170725
55086refgcc -m32 -march=pentium-mmx -O -fomit-frame-pointer2017100120170725
55276refgcc -m32 -march=k6-2 -O -fomit-frame-pointer2017100120170725
55276refgcc -m32 -march=k6-3 -O -fomit-frame-pointer2017100120170725
55280refgcc -m32 -march=k6 -O -fomit-frame-pointer2017100120170725
55385refgcc -funroll-loops -m32 -march=i386 -O -fomit-frame-pointer2017100120170725
55432refgcc -m32 -march=k6-2 -Os -fomit-frame-pointer2017100120170725
55433refgcc -m32 -march=i486 -O -fomit-frame-pointer2017100120170725
55594refgcc -m32 -march=k6-3 -Os -fomit-frame-pointer2017100120170725
55598refgcc -m32 -march=k6 -Os -fomit-frame-pointer2017100120170725
55665refgcc -m32 -march=pentium2 -Os -fomit-frame-pointer2017100120170725
55698refgcc -m32 -march=pentiumpro -Os -fomit-frame-pointer2017100120170725
55723refgcc -m32 -march=native -mtune=native -Os -fomit-frame-pointer2017100120170725
55732refgcc -m32 -march=k8 -Os -fomit-frame-pointer2017100120170725
55750refgcc -funroll-loops -m32 -march=k6 -Os -fomit-frame-pointer2017100120170725
55751refgcc -m32 -march=i386 -O -fomit-frame-pointer2017100120170725
55761refgcc -m32 -march=barcelona -Os -fomit-frame-pointer2017100120170725
55765refgcc -funroll-loops -m32 -march=k6-2 -Os -fomit-frame-pointer2017100120170725
55770refgcc -funroll-loops -m32 -march=barcelona -Os -fomit-frame-pointer2017100120170725
55907refgcc -funroll-loops -m32 -Os -fomit-frame-pointer2017100120170725
55912refgcc -funroll-loops -m32 -march=athlon -Os -fomit-frame-pointer2017100120170725
55955refgcc -m32 -march=core2 -Os -fomit-frame-pointer2017100120170725
56008refgcc -m32 -march=i386 -Os -fomit-frame-pointer2017100120170725
56045refgcc -m32 -march=nocona -Os -fomit-frame-pointer2017100120170725
56046refgcc -m32 -march=pentium4 -Os -fomit-frame-pointer2017100120170725
56046refgcc -m32 -march=prescott -Os -fomit-frame-pointer2017100120170725
56050refgcc -m32 -march=corei7 -Os -fomit-frame-pointer2017100120170725
56060refgcc -m32 -march=athlon -Os -fomit-frame-pointer2017100120170725
56064refgcc -funroll-loops -m32 -march=i386 -Os -fomit-frame-pointer2017100120170725
56073refgcc -funroll-loops -m32 -march=i486 -Os -fomit-frame-pointer2017100120170725
56074refgcc -m32 -march=core2 -msse4.1 -Os -fomit-frame-pointer2017100120170725
56074refgcc -m32 -march=i486 -Os -fomit-frame-pointer2017100120170725
56126refgcc -m32 -march=core2 -msse4 -Os -fomit-frame-pointer2017100120170725
56169refgcc -funroll-loops -m32 -march=pentium4 -Os -fomit-frame-pointer2017100120170725
56169refgcc -funroll-loops -m32 -march=prescott -Os -fomit-frame-pointer2017100120170725
56187refgcc -m32 -Os -fomit-frame-pointer2017100120170725
56197refgcc -funroll-loops -m32 -march=nocona -Os -fomit-frame-pointer2017100120170725
56435refgcc -funroll-loops -m32 -march=k8 -Os -fomit-frame-pointer2017100120170725
56468refgcc -funroll-loops -m32 -march=k6-3 -Os -fomit-frame-pointer2017100120170725
56482refgcc -m32 -march=pentium -Os -fomit-frame-pointer2017100120170725
56491refgcc -m32 -march=pentium-mmx -Os -fomit-frame-pointer2017100120170725
56952refgcc -funroll-loops -m32 -march=pentium-mmx -Os -fomit-frame-pointer2017100120170725
57000refgcc -funroll-loops -m32 -march=pentium2 -O2 -fomit-frame-pointer2017100120170725
57057refgcc -funroll-loops -m32 -march=pentiumpro -O2 -fomit-frame-pointer2017100120170725
57190refgcc -funroll-loops -m32 -march=pentium3 -O2 -fomit-frame-pointer2017100120170725
57242refgcc -funroll-loops -m32 -march=pentium -Os -fomit-frame-pointer2017100120170725
57257refgcc -funroll-loops -m32 -march=pentium-m -O2 -fomit-frame-pointer2017100120170725
57413refgcc -funroll-loops -m32 -march=pentium -O2 -fomit-frame-pointer2017100120170725
57546refgcc -funroll-loops -m32 -march=pentium-mmx -O2 -fomit-frame-pointer2017100120170725
57964refgcc -m32 -march=pentium2 -O2 -fomit-frame-pointer2017100120170725
58055refgcc -m32 -march=pentium3 -O2 -fomit-frame-pointer2017100120170725
58059refgcc -m32 -march=pentiumpro -O2 -fomit-frame-pointer2017100120170725
58064refgcc -m32 -march=pentium-m -O2 -fomit-frame-pointer2017100120170725
58159refgcc -m32 -march=pentium-mmx -O2 -fomit-frame-pointer2017100120170725
58292refgcc -m32 -march=pentium -O2 -fomit-frame-pointer2017100120170725
61793refgcc -funroll-loops -m32 -march=k6 -O3 -fomit-frame-pointer2017100120170725
62363refgcc -funroll-loops -m32 -O3 -fomit-frame-pointer2017100120170725
62481refgcc -funroll-loops -m32 -march=k6-3 -O3 -fomit-frame-pointer2017100120170725
62496refgcc -funroll-loops -m32 -march=k6-2 -O3 -fomit-frame-pointer2017100120170725
62548refgcc -m32 -O3 -fomit-frame-pointer2017100120170725
62653refgcc -m32 -march=core2 -O3 -fomit-frame-pointer2017100120170725
62781refgcc -m32 -march=k6 -O3 -fomit-frame-pointer2017100120170725
63071refgcc -m32 -march=k6-2 -O3 -fomit-frame-pointer2017100120170725
63147refgcc -m32 -march=k6-3 -O3 -fomit-frame-pointer2017100120170725
63351refgcc -funroll-loops -m32 -march=pentium4 -O3 -fomit-frame-pointer2017100120170725
63451refgcc -funroll-loops -m32 -march=nocona -O3 -fomit-frame-pointer2017100120170725
63555refgcc -funroll-loops -m32 -march=prescott -O3 -fomit-frame-pointer2017100120170725
63693refgcc -funroll-loops -m32 -march=athlon -O3 -fomit-frame-pointer2017100120170725
63921refgcc -m32 -march=pentium4 -O3 -fomit-frame-pointer2017100120170725
63964refgcc -funroll-loops -m32 -march=i486 -O3 -fomit-frame-pointer2017100120170725
64096refgcc -m32 -march=athlon -O3 -fomit-frame-pointer2017100120170725
64172refgcc -funroll-loops -m32 -march=i386 -O3 -fomit-frame-pointer2017100120170725
64239refgcc -m32 -march=i486 -O3 -fomit-frame-pointer2017100120170725
64334refgcc -m32 -march=nocona -O3 -fomit-frame-pointer2017100120170725
64376refgcc -m32 -march=prescott -O3 -fomit-frame-pointer2017100120170725
64472refgcc -m32 -march=i386 -O3 -fomit-frame-pointer2017100120170725
64951refgcc -funroll-loops -m32 -march=k8 -O3 -fomit-frame-pointer2017100120170725
65479refgcc -funroll-loops -m32 -march=barcelona -O3 -fomit-frame-pointer2017100120170725
65535refgcc -m32 -march=barcelona -O3 -fomit-frame-pointer2017100120170725
65773refgcc -m32 -march=native -mtune=native -O3 -fomit-frame-pointer2017100120170725
65835refgcc -m32 -march=k8 -O3 -fomit-frame-pointer2017100120170725
65850refgcc -funroll-loops -m32 -march=pentium -O3 -fomit-frame-pointer2017100120170725
65854refgcc -funroll-loops -m32 -march=pentium-mmx -O3 -fomit-frame-pointer2017100120170725
66134refgcc -funroll-loops -m32 -march=pentium3 -O3 -fomit-frame-pointer2017100120170725
66358refgcc -funroll-loops -m32 -march=pentium-m -O3 -fomit-frame-pointer2017100120170725
66381refgcc -funroll-loops -m32 -march=pentiumpro -O3 -fomit-frame-pointer2017100120170725
66448refgcc -funroll-loops -m32 -march=pentium2 -O3 -fomit-frame-pointer2017100120170725
66619refgcc -m32 -march=pentium3 -O3 -fomit-frame-pointer2017100120170725
67075refgcc -m32 -march=pentium -O3 -fomit-frame-pointer2017100120170725
67099refgcc -m32 -march=pentium-mmx -O3 -fomit-frame-pointer2017100120170725
67255refgcc -m32 -march=pentium-m -O3 -fomit-frame-pointer2017100120170725
67635refgcc -m32 -march=pentiumpro -O3 -fomit-frame-pointer2017100120170725
67725refgcc -m32 -march=pentium2 -O3 -fomit-frame-pointer2017100120170725

Test failure

Implementation: crypto_aead/morus640128v2/sse2
Compiler: gcc -funroll-loops -m32 -march=barcelona -Os -fomit-frame-pointer
error 111

Number of similar (compiler,implementation) pairs: 34, namely:
CompilerImplementations
gcc -funroll-loops -m32 -march=barcelona -Os -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=k8 -Os -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=nocona -Os -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentium-m -Os -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentium4 -Os -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=prescott -Os -fomit-frame-pointer sse2
gcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer ref sse2
gcc -m32 -march=core-avx-i -O -fomit-frame-pointer ref sse2
gcc -m32 -march=core-avx-i -Os -fomit-frame-pointer ref sse2
gcc -m32 -march=core-avx2 -O2 -fomit-frame-pointer ref sse2
gcc -m32 -march=core-avx2 -O -fomit-frame-pointer ref sse2
gcc -m32 -march=core-avx2 -Os -fomit-frame-pointer ref sse2
gcc -m32 -march=core2 -Os -fomit-frame-pointer sse2
gcc -m32 -march=core2 -msse4.1 -Os -fomit-frame-pointer sse2
gcc -m32 -march=core2 -msse4 -Os -fomit-frame-pointer sse2
gcc -m32 -march=corei7-avx -O2 -fomit-frame-pointer ref sse2
gcc -m32 -march=corei7-avx -O -fomit-frame-pointer ref sse2
gcc -m32 -march=corei7-avx -Os -fomit-frame-pointer ref sse2
gcc -m32 -march=corei7 -Os -fomit-frame-pointer sse2
gcc -m32 -march=k8 -Os -fomit-frame-pointer sse2
gcc -m32 -march=native -mtune=native -Os -fomit-frame-pointer sse2
gcc -m32 -march=nocona -Os -fomit-frame-pointer sse2
gcc -m32 -march=pentium-m -Os -fomit-frame-pointer sse2
gcc -m32 -march=pentium4 -Os -fomit-frame-pointer sse2
gcc -m32 -march=prescott -Os -fomit-frame-pointer sse2

Test failure

Implementation: crypto_aead/morus640128v2/sse2
Compiler: gcc -m32 -march=barcelona -Os -fomit-frame-pointer
error 111
error 111

Number of similar (compiler,implementation) pairs: 1, namely:
CompilerImplementations
gcc -m32 -march=barcelona -Os -fomit-frame-pointer sse2

Compiler output

Implementation: crypto_aead/morus640128v2/sse2
Compiler: gcc -funroll-loops -m32 -O2 -fomit-frame-pointer
encrypt.c: encrypt.c: In function 'morus_stateupdate':
encrypt.c: encrypt.c:39:11: warning: SSE vector return without SSE enabled changes the ABI [-Wpsabi]
encrypt.c: state[0] = XOR(state[0], state[3]);
encrypt.c: ^
encrypt.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/xmmintrin.h:1249:0,
encrypt.c: from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:29,
encrypt.c: from encrypt.c:2:
encrypt.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/emmintrin.h:1415:1: error: inlining failed in call to always_inline '_mm_shuffle_epi32': target specific option mismatch
encrypt.c: _mm_shuffle_epi32 (__m128i __A, const int __mask)
encrypt.c: ^
encrypt.c: encrypt.c:28:24: error: called from here
encrypt.c: #define ROTL32(x) _mm_shuffle_epi32((x),_MM_SHUFFLE(2,1,0,3)) /*Rotate x by 32-bit positions to the left*/
encrypt.c: ^
encrypt.c: encrypt.c:66:13: note: in expansion of macro 'ROTL32'
encrypt.c: state[2] = ROTL32(state[2]);
encrypt.c: ^
encrypt.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/xmmintrin.h:1249:0,
encrypt.c: from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:29,
encrypt.c: from encrypt.c:2:
encrypt.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/emmintrin.h:1286:1: error: inlining failed in call to always_inline '_mm_xor_si128': target specific option mismatch
encrypt.c: _mm_xor_si128 (__m128i __A, __m128i __B)
encrypt.c: ^
encrypt.c: encrypt.c:18:24: error: called from here
encrypt.c: #define XOR(x,y) _mm_xor_si128((x),(y)) /*XOR(x,y) = x ^ y, where x and y are two 128-bit word*/
encrypt.c: ^
encrypt.c: ...

Number of similar (compiler,implementation) pairs: 88, namely:
CompilerImplementations
gcc -funroll-loops -m32 -O2 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -O3 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -O -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -Os -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=athlon -O2 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=athlon -O3 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=athlon -O -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=athlon -Os -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=i386 -O2 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=i386 -O3 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=i386 -O -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=i386 -Os -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=i486 -O2 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=i486 -O3 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=i486 -O -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=i486 -Os -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=k6-2 -O2 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=k6-2 -O3 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=k6-2 -O -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=k6-2 -Os -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=k6-3 -O2 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=k6-3 -O3 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=k6-3 -O -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=k6-3 -Os -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=k6 -O2 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=k6 -O3 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=k6 -O -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=k6 -Os -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentium-mmx -O2 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentium-mmx -O3 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentium-mmx -O -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentium-mmx -Os -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentium2 -O2 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentium2 -O3 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentium2 -O -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentium2 -Os -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentium -O2 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentium -O3 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentium -O -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentium -Os -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentiumpro -O2 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentiumpro -O3 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentiumpro -O -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentiumpro -Os -fomit-frame-pointer sse2
gcc -m32 -O2 -fomit-frame-pointer sse2
gcc -m32 -O3 -fomit-frame-pointer sse2
gcc -m32 -O -fomit-frame-pointer sse2
gcc -m32 -Os -fomit-frame-pointer sse2
gcc -m32 -march=athlon -O2 -fomit-frame-pointer sse2
gcc -m32 -march=athlon -O3 -fomit-frame-pointer sse2
gcc -m32 -march=athlon -O -fomit-frame-pointer sse2
gcc -m32 -march=athlon -Os -fomit-frame-pointer sse2
gcc -m32 -march=i386 -O2 -fomit-frame-pointer sse2
gcc -m32 -march=i386 -O3 -fomit-frame-pointer sse2
gcc -m32 -march=i386 -O -fomit-frame-pointer sse2
gcc -m32 -march=i386 -Os -fomit-frame-pointer sse2
gcc -m32 -march=i486 -O2 -fomit-frame-pointer sse2
gcc -m32 -march=i486 -O3 -fomit-frame-pointer sse2
gcc -m32 -march=i486 -O -fomit-frame-pointer sse2
gcc -m32 -march=i486 -Os -fomit-frame-pointer sse2
gcc -m32 -march=k6-2 -O2 -fomit-frame-pointer sse2
gcc -m32 -march=k6-2 -O3 -fomit-frame-pointer sse2
gcc -m32 -march=k6-2 -O -fomit-frame-pointer sse2
gcc -m32 -march=k6-2 -Os -fomit-frame-pointer sse2
gcc -m32 -march=k6-3 -O2 -fomit-frame-pointer sse2
gcc -m32 -march=k6-3 -O3 -fomit-frame-pointer sse2
gcc -m32 -march=k6-3 -O -fomit-frame-pointer sse2
gcc -m32 -march=k6-3 -Os -fomit-frame-pointer sse2
gcc -m32 -march=k6 -O2 -fomit-frame-pointer sse2
gcc -m32 -march=k6 -O3 -fomit-frame-pointer sse2
gcc -m32 -march=k6 -O -fomit-frame-pointer sse2
gcc -m32 -march=k6 -Os -fomit-frame-pointer sse2
gcc -m32 -march=pentium-mmx -O2 -fomit-frame-pointer sse2
gcc -m32 -march=pentium-mmx -O3 -fomit-frame-pointer sse2
gcc -m32 -march=pentium-mmx -O -fomit-frame-pointer sse2
gcc -m32 -march=pentium-mmx -Os -fomit-frame-pointer sse2
gcc -m32 -march=pentium2 -O2 -fomit-frame-pointer sse2
gcc -m32 -march=pentium2 -O3 -fomit-frame-pointer sse2
gcc -m32 -march=pentium2 -O -fomit-frame-pointer sse2
gcc -m32 -march=pentium2 -Os -fomit-frame-pointer sse2
gcc -m32 -march=pentium -O2 -fomit-frame-pointer sse2
gcc -m32 -march=pentium -O3 -fomit-frame-pointer sse2
gcc -m32 -march=pentium -O -fomit-frame-pointer sse2
gcc -m32 -march=pentium -Os -fomit-frame-pointer sse2
gcc -m32 -march=pentiumpro -O2 -fomit-frame-pointer sse2
gcc -m32 -march=pentiumpro -O3 -fomit-frame-pointer sse2
gcc -m32 -march=pentiumpro -O -fomit-frame-pointer sse2
gcc -m32 -march=pentiumpro -Os -fomit-frame-pointer sse2

Compiler output

Implementation: crypto_aead/morus640128v2/neon
Compiler: gcc -funroll-loops -m32 -O2 -fomit-frame-pointer
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s: Assembler messages:
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:47: Error: unknown pseudo-op: `.fpu'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:84: Error: too many memory references for `sub'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:89: Error: invalid char '[' beginning operand 2 `[sp'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:94: Error: invalid char '[' beginning operand 2 `[sp'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:99: Error: no such instruction: `vld1.8 {d0-d1},[r2,:128]'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:104: Error: too many memory references for `add'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:109: Error: no such instruction: `vld1.8 {d2-d3},[r2,:128]'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:114: Error: too many memory references for `add'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:119: Error: no such instruction: `vld1.8 {d4-d5},[r2,:128]'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:124: Error: too many memory references for `add'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:129: Error: no such instruction: `vld1.8 {d6-d7},[r2,:128]'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:134: Error: too many memory references for `add'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:139: Error: no such instruction: `vld1.8 {d16-d17},[r2,:128]'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:144: Error: no such instruction: `ldrd r4,[r3,'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:152: Error: expecting operand after ','; got nothing
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:155: Error: no such instruction: `bne ._start'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:160: Error: expecting operand after ','; got nothing
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:163: Error: no such instruction: `beq ._end'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:171: Error: no such instruction: `vld1.8 {d18-d19},[r1,:128]'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:176: Error: too many memory references for `add'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:181: Error: no such instruction: `vand q10,q2,q3'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:186: Error: no such instruction: `vext.32 q11,q1,q1,'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:191: Error: no such instruction: `veor q10,q10,q11'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:196: Error: no such instruction: `veor q10,q10,q0'
morus_dec_aut_step_looped.s: ...

Number of similar (compiler,implementation) pairs: 166, namely:
CompilerImplementations
gcc -funroll-loops -m32 -O2 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -O3 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -O -fomit-frame-pointer neon
gcc -funroll-loops -m32 -Os -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=athlon -O2 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=athlon -O3 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=athlon -O -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=athlon -Os -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=barcelona -O2 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=barcelona -O3 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=barcelona -O -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=barcelona -Os -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=i386 -O2 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=i386 -O3 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=i386 -O -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=i386 -Os -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=i486 -O2 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=i486 -O3 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=i486 -O -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=i486 -Os -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=k6-2 -O2 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=k6-2 -O3 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=k6-2 -O -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=k6-2 -Os -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=k6-3 -O2 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=k6-3 -O3 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=k6-3 -O -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=k6-3 -Os -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=k6 -O2 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=k6 -O3 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=k6 -O -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=k6 -Os -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=k8 -O2 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=k8 -O3 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=k8 -O -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=k8 -Os -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=nocona -O2 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=nocona -O3 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=nocona -O -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=nocona -Os -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium-m -O2 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium-m -O3 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium-m -O -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium-m -Os -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium-mmx -O2 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium-mmx -O3 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium-mmx -O -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium-mmx -Os -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium2 -O2 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium2 -O3 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium2 -O -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium2 -Os -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium3 -O2 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium3 -O3 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium3 -O -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium3 -Os -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium4 -O2 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium4 -O3 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium4 -O -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium4 -Os -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium -O2 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium -O3 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium -O -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentium -Os -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentiumpro -O2 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentiumpro -O3 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentiumpro -O -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=pentiumpro -Os -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=prescott -O2 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=prescott -O3 -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=prescott -O -fomit-frame-pointer neon
gcc -funroll-loops -m32 -march=prescott -Os -fomit-frame-pointer neon
gcc -m32 -O2 -fomit-frame-pointer neon
gcc -m32 -O3 -fomit-frame-pointer neon
gcc -m32 -O -fomit-frame-pointer neon
gcc -m32 -Os -fomit-frame-pointer neon
gcc -m32 -march=athlon -O2 -fomit-frame-pointer neon
gcc -m32 -march=athlon -O3 -fomit-frame-pointer neon
gcc -m32 -march=athlon -O -fomit-frame-pointer neon
gcc -m32 -march=athlon -Os -fomit-frame-pointer neon
gcc -m32 -march=core-avx-i -O2 -fomit-frame-pointer neon
gcc -m32 -march=core-avx-i -O -fomit-frame-pointer neon
gcc -m32 -march=core-avx-i -Os -fomit-frame-pointer neon
gcc -m32 -march=core-avx2 -O2 -fomit-frame-pointer neon
gcc -m32 -march=core-avx2 -O -fomit-frame-pointer neon
gcc -m32 -march=core-avx2 -Os -fomit-frame-pointer neon
gcc -m32 -march=core2 -O2 -fomit-frame-pointer neon
gcc -m32 -march=core2 -O3 -fomit-frame-pointer neon
gcc -m32 -march=core2 -O -fomit-frame-pointer neon
gcc -m32 -march=core2 -Os -fomit-frame-pointer neon
gcc -m32 -march=core2 -msse4.1 -O2 -fomit-frame-pointer neon
gcc -m32 -march=core2 -msse4.1 -O -fomit-frame-pointer neon
gcc -m32 -march=core2 -msse4.1 -Os -fomit-frame-pointer neon
gcc -m32 -march=core2 -msse4 -O2 -fomit-frame-pointer neon
gcc -m32 -march=core2 -msse4 -O -fomit-frame-pointer neon
gcc -m32 -march=core2 -msse4 -Os -fomit-frame-pointer neon
gcc -m32 -march=corei7-avx -O2 -fomit-frame-pointer neon
gcc -m32 -march=corei7-avx -O -fomit-frame-pointer neon
gcc -m32 -march=corei7-avx -Os -fomit-frame-pointer neon
gcc -m32 -march=corei7 -O2 -fomit-frame-pointer neon
gcc -m32 -march=corei7 -O -fomit-frame-pointer neon
gcc -m32 -march=corei7 -Os -fomit-frame-pointer neon
gcc -m32 -march=i386 -O2 -fomit-frame-pointer neon
gcc -m32 -march=i386 -O3 -fomit-frame-pointer neon
gcc -m32 -march=i386 -O -fomit-frame-pointer neon
gcc -m32 -march=i386 -Os -fomit-frame-pointer neon
gcc -m32 -march=i486 -O2 -fomit-frame-pointer neon
gcc -m32 -march=i486 -O3 -fomit-frame-pointer neon
gcc -m32 -march=i486 -O -fomit-frame-pointer neon
gcc -m32 -march=i486 -Os -fomit-frame-pointer neon
gcc -m32 -march=k6-2 -O2 -fomit-frame-pointer neon
gcc -m32 -march=k6-2 -O3 -fomit-frame-pointer neon
gcc -m32 -march=k6-2 -O -fomit-frame-pointer neon
gcc -m32 -march=k6-2 -Os -fomit-frame-pointer neon
gcc -m32 -march=k6-3 -O2 -fomit-frame-pointer neon
gcc -m32 -march=k6-3 -O3 -fomit-frame-pointer neon
gcc -m32 -march=k6-3 -O -fomit-frame-pointer neon
gcc -m32 -march=k6-3 -Os -fomit-frame-pointer neon
gcc -m32 -march=k6 -O2 -fomit-frame-pointer neon
gcc -m32 -march=k6 -O3 -fomit-frame-pointer neon
gcc -m32 -march=k6 -O -fomit-frame-pointer neon
gcc -m32 -march=k6 -Os -fomit-frame-pointer neon
gcc -m32 -march=k8 -O2 -fomit-frame-pointer neon
gcc -m32 -march=k8 -O3 -fomit-frame-pointer neon
gcc -m32 -march=k8 -O -fomit-frame-pointer neon
gcc -m32 -march=k8 -Os -fomit-frame-pointer neon
gcc -m32 -march=native -mtune=native -O2 -fomit-frame-pointer neon
gcc -m32 -march=native -mtune=native -O3 -fomit-frame-pointer neon
gcc -m32 -march=native -mtune=native -O -fomit-frame-pointer neon
gcc -m32 -march=native -mtune=native -Os -fomit-frame-pointer neon
gcc -m32 -march=nocona -O2 -fomit-frame-pointer neon
gcc -m32 -march=nocona -O3 -fomit-frame-pointer neon
gcc -m32 -march=nocona -O -fomit-frame-pointer neon
gcc -m32 -march=nocona -Os -fomit-frame-pointer neon
gcc -m32 -march=pentium-m -O2 -fomit-frame-pointer neon
gcc -m32 -march=pentium-m -O3 -fomit-frame-pointer neon
gcc -m32 -march=pentium-m -O -fomit-frame-pointer neon
gcc -m32 -march=pentium-m -Os -fomit-frame-pointer neon
gcc -m32 -march=pentium-mmx -O2 -fomit-frame-pointer neon
gcc -m32 -march=pentium-mmx -O3 -fomit-frame-pointer neon
gcc -m32 -march=pentium-mmx -O -fomit-frame-pointer neon
gcc -m32 -march=pentium-mmx -Os -fomit-frame-pointer neon
gcc -m32 -march=pentium2 -O2 -fomit-frame-pointer neon
gcc -m32 -march=pentium2 -O3 -fomit-frame-pointer neon
gcc -m32 -march=pentium2 -O -fomit-frame-pointer neon
gcc -m32 -march=pentium2 -Os -fomit-frame-pointer neon
gcc -m32 -march=pentium3 -O2 -fomit-frame-pointer neon
gcc -m32 -march=pentium3 -O3 -fomit-frame-pointer neon
gcc -m32 -march=pentium3 -O -fomit-frame-pointer neon
gcc -m32 -march=pentium3 -Os -fomit-frame-pointer neon
gcc -m32 -march=pentium4 -O2 -fomit-frame-pointer neon
gcc -m32 -march=pentium4 -O3 -fomit-frame-pointer neon
gcc -m32 -march=pentium4 -O -fomit-frame-pointer neon
gcc -m32 -march=pentium4 -Os -fomit-frame-pointer neon
gcc -m32 -march=pentium -O2 -fomit-frame-pointer neon
gcc -m32 -march=pentium -O3 -fomit-frame-pointer neon
gcc -m32 -march=pentium -O -fomit-frame-pointer neon
gcc -m32 -march=pentium -Os -fomit-frame-pointer neon
gcc -m32 -march=pentiumpro -O2 -fomit-frame-pointer neon
gcc -m32 -march=pentiumpro -O3 -fomit-frame-pointer neon
gcc -m32 -march=pentiumpro -O -fomit-frame-pointer neon
gcc -m32 -march=pentiumpro -Os -fomit-frame-pointer neon
gcc -m32 -march=prescott -O2 -fomit-frame-pointer neon
gcc -m32 -march=prescott -O3 -fomit-frame-pointer neon
gcc -m32 -march=prescott -O -fomit-frame-pointer neon
gcc -m32 -march=prescott -Os -fomit-frame-pointer neon

Compiler output

Implementation: crypto_aead/morus640128v2/sse2
Compiler: gcc -funroll-loops -m32 -march=pentium3 -O2 -fomit-frame-pointer
encrypt.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/xmmintrin.h:1249:0,
encrypt.c: from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:29,
encrypt.c: from encrypt.c:2:
encrypt.c: encrypt.c: In function 'morus_stateupdate':
encrypt.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/emmintrin.h:1415:1: error: inlining failed in call to always_inline '_mm_shuffle_epi32': target specific option mismatch
encrypt.c: _mm_shuffle_epi32 (__m128i __A, const int __mask)
encrypt.c: ^
encrypt.c: encrypt.c:28:24: error: called from here
encrypt.c: #define ROTL32(x) _mm_shuffle_epi32((x),_MM_SHUFFLE(2,1,0,3)) /*Rotate x by 32-bit positions to the left*/
encrypt.c: ^
encrypt.c: encrypt.c:66:13: note: in expansion of macro 'ROTL32'
encrypt.c: state[2] = ROTL32(state[2]);
encrypt.c: ^
encrypt.c: In file included from /usr/lib/gcc/x86_64-linux-gnu/5/include/xmmintrin.h:1249:0,
encrypt.c: from /usr/lib/gcc/x86_64-linux-gnu/5/include/immintrin.h:29,
encrypt.c: from encrypt.c:2:
encrypt.c: /usr/lib/gcc/x86_64-linux-gnu/5/include/emmintrin.h:1286:1: error: inlining failed in call to always_inline '_mm_xor_si128': target specific option mismatch
encrypt.c: _mm_xor_si128 (__m128i __A, __m128i __B)
encrypt.c: ^
encrypt.c: encrypt.c:18:24: error: called from here
encrypt.c: #define XOR(x,y) _mm_xor_si128((x),(y)) /*XOR(x,y) = x ^ y, where x and y are two 128-bit word*/
encrypt.c: ^
encrypt.c: encrypt.c:25:24: note: in expansion of macro 'XOR'
encrypt.c: #define ROTL(x,n) XOR(_mm_slli_epi32((x), (n)), _mm_srli_epi32((x),(32-n))) /*Rotate 4 32-bit unsigned integers in x to the left by n-bit positions*/
encrypt.c: ^
encrypt.c: ...

Number of similar (compiler,implementation) pairs: 8, namely:
CompilerImplementations
gcc -funroll-loops -m32 -march=pentium3 -O2 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentium3 -O3 -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentium3 -O -fomit-frame-pointer sse2
gcc -funroll-loops -m32 -march=pentium3 -Os -fomit-frame-pointer sse2
gcc -m32 -march=pentium3 -O2 -fomit-frame-pointer sse2
gcc -m32 -march=pentium3 -O3 -fomit-frame-pointer sse2
gcc -m32 -march=pentium3 -O -fomit-frame-pointer sse2
gcc -m32 -march=pentium3 -Os -fomit-frame-pointer sse2

Compiler output

Implementation: crypto_aead/morus640128v2/neon
Compiler: gcc -m32 -march=barcelona -O2 -fomit-frame-pointer
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s: Assembler messages:
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:47: Error: unknown pseudo-op: `.fpu'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:84: Error: too many memory references for `sub'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:89: Error: invalid char '[' beginning operand 2 `[sp'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:94: Error: invalid char '[' beginning operand 2 `[sp'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:99: Error: no such instruction: `vld1.8 {d0-d1},[r2,:128]'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:104: Error: too many memory references for `add'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:109: Error: no such instruction: `vld1.8 {d2-d3},[r2,:128]'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:114: Error: too many memory references for `add'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:119: Error: no such instruction: `vld1.8 {d4-d5},[r2,:128]'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:124: Error: too many memory references for `add'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:129: Error: no such instruction: `vld1.8 {d6-d7},[r2,:128]'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:134: Error: too many memory references for `add'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:139: Error: no such instruction: `vld1.8 {d16-d17},[r2,:128]'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:144: Error: no such instruction: `ldrd r4,[r3,'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:152: Error: expecting operand after ','; got nothing
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:155: Error: no such instruction: `bne ._start'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:160: Error: expecting operand after ','; got nothing
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:163: Error: no such instruction: `beq ._end'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:171: Error: no such instruction: `vld1.8 {d18-d19},[r1,:128]'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:176: Error: too many memory references for `add'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:181: Error: no such instruction: `vand q10,q2,q3'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:186: Error: no such instruction: `vext.32 q11,q1,q1,'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:191: Error: no such instruction: `veor q10,q10,q11'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:196: Error: no such instruction: `veor q10,q10,q0'
morus_dec_aut_step_looped.s: ...
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s: Assembler messages:
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:47: Error: unknown pseudo-op: `.fpu'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:84: Error: too many memory references for `sub'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:89: Error: invalid char '[' beginning operand 2 `[sp'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:94: Error: invalid char '[' beginning operand 2 `[sp'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:99: Error: no such instruction: `vld1.8 {d0-d1},[r2,:128]'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:104: Error: too many memory references for `add'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:109: Error: no such instruction: `vld1.8 {d2-d3},[r2,:128]'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:114: Error: too many memory references for `add'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:119: Error: no such instruction: `vld1.8 {d4-d5},[r2,:128]'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:124: Error: too many memory references for `add'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:129: Error: no such instruction: `vld1.8 {d6-d7},[r2,:128]'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:134: Error: too many memory references for `add'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:139: Error: no such instruction: `vld1.8 {d16-d17},[r2,:128]'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:144: Error: no such instruction: `ldrd r4,[r3,'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:152: Error: expecting operand after ','; got nothing
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:155: Error: no such instruction: `bne ._start'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:160: Error: expecting operand after ','; got nothing
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:163: Error: no such instruction: `beq ._end'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:171: Error: no such instruction: `vld1.8 {d18-d19},[r1,:128]'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:176: Error: too many memory references for `add'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:181: Error: no such instruction: `vand q10,q2,q3'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:186: Error: no such instruction: `vext.32 q11,q1,q1,'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:191: Error: no such instruction: `veor q10,q10,q11'
morus_dec_aut_step_looped.s: morus_dec_aut_step_looped.s:196: Error: no such instruction: `veor q10,q10,q0'
morus_dec_aut_step_looped.s: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m32 -march=barcelona -O2 -fomit-frame-pointer neon
gcc -m32 -march=barcelona -O3 -fomit-frame-pointer neon
gcc -m32 -march=barcelona -O -fomit-frame-pointer neon
gcc -m32 -march=barcelona -Os -fomit-frame-pointer neon