Implementation notes: amd64, glyme, crypto_aead/scream12v3

Computer: glyme
Architecture: amd64
CPU ID: GenuineIntel-00020652-bfebfbff
SUPERCOP version: 201720170105
Operation: crypto_aead
Primitive: scream12v3
TimeImplementationCompilerBenchmark dateSUPERCOP version
98384ssegcc -m64 -march=corei7 -O3 -fomit-frame-pointer2017020520170105
98724ssegcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer2017020520170105
98824ssegcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer2017020520170105
98828ssegcc -m64 -march=core2 -O3 -fomit-frame-pointer2017020520170105
98840ssegcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer2017020520170105
104284ssegcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2017020520170105
106852ssegcc -m64 -march=corei7 -O2 -fomit-frame-pointer2017020520170105
107236ssegcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer2017020520170105
107240ssegcc -m64 -march=core2 -O2 -fomit-frame-pointer2017020520170105
107600ssegcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer2017020520170105
107696ssegcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer2017020520170105
111720ssegcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2017020520170105
113972sseclang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments2017020520170105
114048sseclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2017020520170105
117160ssegcc -m64 -march=core2 -O -fomit-frame-pointer2017020520170105
117180ssegcc -m64 -march=native -mtune=native -O -fomit-frame-pointer2017020520170105
117220ssegcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer2017020520170105
117500ssegcc -m64 -march=corei7 -O -fomit-frame-pointer2017020520170105
117788ssegcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2017020520170105
118288ssegcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer2017020520170105
159712ssegcc -m64 -march=corei7 -Os -fomit-frame-pointer2017020520170105
159912ssegcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer2017020520170105
159920ssegcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer2017020520170105
159924ssegcc -m64 -march=core2 -Os -fomit-frame-pointer2017020520170105
160176ssegcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer2017020520170105
160432ssegcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2017020520170105
648796refclang -O3 -fomit-frame-pointer -Qunused-arguments2017020520170105
649064refclang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2017020520170105
650444refclang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2017020520170105
652324refclang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2017020520170105
654320refgcc -march=nocona -O3 -fomit-frame-pointer2017020520170105
655556refgcc -m64 -march=nocona -O3 -fomit-frame-pointer2017020520170105
660216refgcc -m64 -march=native -mtune=native -O3 -fomit-frame-pointer2017020520170105
660300refgcc -m64 -march=core2 -msse4 -O3 -fomit-frame-pointer2017020520170105
660600refgcc -m64 -march=core2 -O3 -fomit-frame-pointer2017020520170105
660856refgcc -m64 -march=core2 -msse4.1 -O3 -fomit-frame-pointer2017020520170105
661612refgcc -m64 -march=corei7 -O3 -fomit-frame-pointer2017020520170105
662540refgcc -march=barcelona -O3 -fomit-frame-pointer2017020520170105
662556refgcc -m64 -march=barcelona -O3 -fomit-frame-pointer2017020520170105
663428refgcc -march=k8 -O3 -fomit-frame-pointer2017020520170105
664680refgcc -m64 -march=k8 -O3 -fomit-frame-pointer2017020520170105
664924refgcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer2017020520170105
664928refgcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer2017020520170105
665776refgcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer2017020520170105
665780refgcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer2017020520170105
666940refgcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer2017020520170105
666944refgcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer2017020520170105
667924refgcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv2017020520170105
674984refclang -O3 -fwrapv -march=native -fomit-frame-pointer -Qunused-arguments2017020520170105
676568refgcc -funroll-loops -O3 -fomit-frame-pointer2017020520170105
676568refgcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer2017020520170105
676568refgcc -funroll-loops -m64 -O3 -fomit-frame-pointer2017020520170105
676616refclang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments2017020520170105
677808refgcc -O3 -fomit-frame-pointer2017020520170105
677808refgcc -fno-schedule-insns -O3 -fomit-frame-pointer2017020520170105
677808refgcc -m64 -O3 -fomit-frame-pointer2017020520170105
724148refgcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer2017020520170105
724152refgcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer2017020520170105
737488refgcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer2017020520170105
737488refgcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer2017020520170105
737864refgcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer2017020520170105
737864refgcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer2017020520170105
739120refgcc -funroll-loops -O2 -fomit-frame-pointer2017020520170105
739120refgcc -funroll-loops -m64 -O2 -fomit-frame-pointer2017020520170105
744432refgcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer2017020520170105
760552refgcc -funroll-loops -O -fomit-frame-pointer2017020520170105
760572refgcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer2017020520170105
761020refgcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer2017020520170105
761132refgcc -funroll-loops -march=nocona -O -fomit-frame-pointer2017020520170105
762136refgcc -funroll-loops -m64 -O -fomit-frame-pointer2017020520170105
762844refgcc -funroll-loops -march=barcelona -O -fomit-frame-pointer2017020520170105
763128refgcc -funroll-loops -march=k8 -O -fomit-frame-pointer2017020520170105
764408refgcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer2017020520170105
764460refgcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer2017020520170105
799796refgcc -m64 -march=nocona -O2 -fomit-frame-pointer2017020520170105
799864refgcc -march=nocona -O -fomit-frame-pointer2017020520170105
807292refgcc -m64 -march=nocona -O -fomit-frame-pointer2017020520170105
807836refgcc -m64 -march=k8 -O -fomit-frame-pointer2017020520170105
808988refgcc -march=nocona -O2 -fomit-frame-pointer2017020520170105
809716refgcc -m64 -march=core2 -msse4.1 -O -fomit-frame-pointer2017020520170105
810452refgcc -m64 -march=corei7 -O -fomit-frame-pointer2017020520170105
811228refgcc -m64 -march=core2 -msse4 -O -fomit-frame-pointer2017020520170105
811836refgcc -m64 -march=native -mtune=native -O -fomit-frame-pointer2017020520170105
813100refgcc -m64 -O -fomit-frame-pointer2017020520170105
815424refgcc -march=k8 -O2 -fomit-frame-pointer2017020520170105
815976refgcc -march=k8 -O -fomit-frame-pointer2017020520170105
817600refgcc -m64 -march=core2 -O -fomit-frame-pointer2017020520170105
819228refgcc -m64 -O2 -fomit-frame-pointer2017020520170105
821192refgcc -fno-schedule-insns -O -fomit-frame-pointer2017020520170105
823004refgcc -m64 -march=k8 -O2 -fomit-frame-pointer2017020520170105
823248refgcc -m64 -march=barcelona -O2 -fomit-frame-pointer2017020520170105
824876refgcc -fno-schedule-insns -O2 -fomit-frame-pointer2017020520170105
826676refgcc -O2 -fomit-frame-pointer2017020520170105
828012refgcc -O -fomit-frame-pointer2017020520170105
830136refgcc -m64 -march=barcelona -O -fomit-frame-pointer2017020520170105
830412refgcc -march=barcelona -O2 -fomit-frame-pointer2017020520170105
833956refgcc -m64 -march=core2 -msse4.1 -O2 -fomit-frame-pointer2017020520170105
838940refgcc -march=barcelona -O -fomit-frame-pointer2017020520170105
842040refgcc -m64 -march=native -mtune=native -O2 -fomit-frame-pointer2017020520170105
842600refgcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv2017020520170105
845032refgcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv2017020520170105
845268refgcc -m64 -march=core2 -O2 -fomit-frame-pointer2017020520170105
848556refgcc -m64 -march=core2 -msse4 -O2 -fomit-frame-pointer2017020520170105
849860refgcc -m64 -march=corei7 -O2 -fomit-frame-pointer2017020520170105
920104refgcc -m64 -march=corei7 -Os -fomit-frame-pointer2017020520170105
931744refgcc -march=barcelona -Os -fomit-frame-pointer2017020520170105
934660refgcc -m64 -Os -fomit-frame-pointer2017020520170105
936004refgcc -m64 -march=native -mtune=native -Os -fomit-frame-pointer2017020520170105
936552refgcc -m64 -march=core2 -msse4 -Os -fomit-frame-pointer2017020520170105
936684refgcc -m64 -march=core2 -msse4.1 -Os -fomit-frame-pointer2017020520170105
936688refgcc -m64 -march=core2 -Os -fomit-frame-pointer2017020520170105
937388refgcc -fno-schedule-insns -Os -fomit-frame-pointer2017020520170105
937936refgcc -march=k8 -Os -fomit-frame-pointer2017020520170105
942296refgcc -m64 -march=barcelona -Os -fomit-frame-pointer2017020520170105
943332refgcc -march=nocona -Os -fomit-frame-pointer2017020520170105
945152refgcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer2017020520170105
945248refgcc -m64 -march=nocona -Os -fomit-frame-pointer2017020520170105
947700refgcc -m64 -march=k8 -Os -fomit-frame-pointer2017020520170105
951092refgcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer2017020520170105
951284refgcc -funroll-loops -march=nocona -Os -fomit-frame-pointer2017020520170105
951596refgcc -Os -fomit-frame-pointer2017020520170105
952624refgcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer2017020520170105
954248refgcc -funroll-loops -m64 -Os -fomit-frame-pointer2017020520170105
957284refgcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer2017020520170105
957980refgcc -funroll-loops -march=k8 -Os -fomit-frame-pointer2017020520170105
958096refgcc -funroll-loops -Os -fomit-frame-pointer2017020520170105
958124refgcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer2017020520170105
962564refgcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv2017020520170105
2943696refcc2017020520170105
3224608refgcc -funroll-loops2017020520170105
3239672refgcc2017020520170105

Compiler output

Implementation: crypto_aead/scream12v3/sse
Compiler: cc
scream.c: scream.c:202:10: error: '__builtin_ia32_pshufb128' needs target feature ssse3
scream.c: A = __builtin_ia32_pshufb128(table, t0);
scream.c: ^
scream.c: scream.c:203:10: error: '__builtin_ia32_pshufb128' needs target feature ssse3
scream.c: C = __builtin_ia32_pshufb128(table, t1);
scream.c: ^
scream.c: scream.c:207:10: error: '__builtin_ia32_pshufb128' needs target feature ssse3
scream.c: B = __builtin_ia32_pshufb128(table, t0);
scream.c: ^
scream.c: scream.c:208:10: error: '__builtin_ia32_pshufb128' needs target feature ssse3
scream.c: D = __builtin_ia32_pshufb128(table, t1);
scream.c: ^
scream.c: scream.c:215:10: error: '__builtin_ia32_pshufb128' needs target feature ssse3
scream.c: A ^= __builtin_ia32_pshufb128(table, in[0]);
scream.c: ^
scream.c: scream.c:216:10: error: '__builtin_ia32_pshufb128' needs target feature ssse3
scream.c: C ^= __builtin_ia32_pshufb128(table, in[2]);
scream.c: ^
scream.c: scream.c:220:10: error: '__builtin_ia32_pshufb128' needs target feature ssse3
scream.c: B ^= __builtin_ia32_pshufb128(table, in[0]);
scream.c: ^
scream.c: scream.c:221:10: error: '__builtin_ia32_pshufb128' needs target feature ssse3
scream.c: D ^= __builtin_ia32_pshufb128(table, in[2]);
scream.c: ^
scream.c: scream.c:228:10: error: '__builtin_ia32_pshufb128' needs target feature ssse3
scream.c: ...

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
cc sse
clang -O3 -fomit-frame-pointer -Qunused-arguments sse
clang -mcpu=cortex-a8 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments sse
clang -mcpu=cortex-a9 -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments sse
clang -mcpu=native -mfpu=neon -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments sse

Compiler output

Implementation: crypto_aead/scream12v3/sse
Compiler: gcc
scream.c: scream.c: In function 'LBox16P':
scream.c: scream.c:202:10: warning: implicit declaration of function '__builtin_ia32_pshufb128' [-Wimplicit-function-declaration]
scream.c: A = __builtin_ia32_pshufb128(table, t0);
scream.c: ^~~~~~~~~~~~~~~~~~~~~~~~
scream.c: scream.c:202:8: error: incompatible types when assigning to type 'v16qi {aka __vector(16) char}' from type 'int'
scream.c: A = __builtin_ia32_pshufb128(table, t0);
scream.c: ^
scream.c: scream.c:203:8: error: incompatible types when assigning to type 'v16qi {aka __vector(16) char}' from type 'int'
scream.c: C = __builtin_ia32_pshufb128(table, t1);
scream.c: ^
scream.c: scream.c:207:8: error: incompatible types when assigning to type 'v16qi {aka __vector(16) char}' from type 'int'
scream.c: B = __builtin_ia32_pshufb128(table, t0);
scream.c: ^
scream.c: scream.c:208:8: error: incompatible types when assigning to type 'v16qi {aka __vector(16) char}' from type 'int'
scream.c: D = __builtin_ia32_pshufb128(table, t1);
scream.c: ^
scream.c: scream.c:215:7: error: conversion of scalar 'int' to vector 'v16qi {aka __vector(16) char}' involves truncation
scream.c: A ^= __builtin_ia32_pshufb128(table, in[0]);
scream.c: ^~
scream.c: scream.c:216:7: error: conversion of scalar 'int' to vector 'v16qi {aka __vector(16) char}' involves truncation
scream.c: C ^= __builtin_ia32_pshufb128(table, in[2]);
scream.c: ^~
scream.c: scream.c:220:7: error: conversion of scalar 'int' to vector 'v16qi {aka __vector(16) char}' involves truncation
scream.c: B ^= __builtin_ia32_pshufb128(table, in[0]);
scream.c: ^~
scream.c: ...

Number of similar (compiler,implementation) pairs: 70, namely:
CompilerImplementations
gcc sse
gcc -O2 -fomit-frame-pointer sse
gcc -O3 -fomit-frame-pointer sse
gcc -O -fomit-frame-pointer sse
gcc -Os -fomit-frame-pointer sse
gcc -fno-schedule-insns -O2 -fomit-frame-pointer sse
gcc -fno-schedule-insns -O3 -fomit-frame-pointer sse
gcc -fno-schedule-insns -O -fomit-frame-pointer sse
gcc -fno-schedule-insns -Os -fomit-frame-pointer sse
gcc -funroll-loops sse
gcc -funroll-loops -O2 -fomit-frame-pointer sse
gcc -funroll-loops -O3 -fomit-frame-pointer sse
gcc -funroll-loops -O -fomit-frame-pointer sse
gcc -funroll-loops -Os -fomit-frame-pointer sse
gcc -funroll-loops -fno-schedule-insns -O2 -fomit-frame-pointer sse
gcc -funroll-loops -fno-schedule-insns -O3 -fomit-frame-pointer sse
gcc -funroll-loops -fno-schedule-insns -O -fomit-frame-pointer sse
gcc -funroll-loops -fno-schedule-insns -Os -fomit-frame-pointer sse
gcc -funroll-loops -m64 -O2 -fomit-frame-pointer sse
gcc -funroll-loops -m64 -O3 -fomit-frame-pointer sse
gcc -funroll-loops -m64 -O -fomit-frame-pointer sse
gcc -funroll-loops -m64 -Os -fomit-frame-pointer sse
gcc -funroll-loops -m64 -march=barcelona -O2 -fomit-frame-pointer sse
gcc -funroll-loops -m64 -march=barcelona -O3 -fomit-frame-pointer sse
gcc -funroll-loops -m64 -march=barcelona -O -fomit-frame-pointer sse
gcc -funroll-loops -m64 -march=barcelona -Os -fomit-frame-pointer sse
gcc -funroll-loops -m64 -march=k8 -O2 -fomit-frame-pointer sse
gcc -funroll-loops -m64 -march=k8 -O3 -fomit-frame-pointer sse
gcc -funroll-loops -m64 -march=k8 -O -fomit-frame-pointer sse
gcc -funroll-loops -m64 -march=k8 -Os -fomit-frame-pointer sse
gcc -funroll-loops -m64 -march=nocona -O2 -fomit-frame-pointer sse
gcc -funroll-loops -m64 -march=nocona -O3 -fomit-frame-pointer sse
gcc -funroll-loops -m64 -march=nocona -O -fomit-frame-pointer sse
gcc -funroll-loops -m64 -march=nocona -Os -fomit-frame-pointer sse
gcc -funroll-loops -march=barcelona -O2 -fomit-frame-pointer sse
gcc -funroll-loops -march=barcelona -O3 -fomit-frame-pointer sse
gcc -funroll-loops -march=barcelona -O -fomit-frame-pointer sse
gcc -funroll-loops -march=barcelona -Os -fomit-frame-pointer sse
gcc -funroll-loops -march=k8 -O2 -fomit-frame-pointer sse
gcc -funroll-loops -march=k8 -O3 -fomit-frame-pointer sse
gcc -funroll-loops -march=k8 -O -fomit-frame-pointer sse
gcc -funroll-loops -march=k8 -Os -fomit-frame-pointer sse
gcc -funroll-loops -march=nocona -O2 -fomit-frame-pointer sse
gcc -funroll-loops -march=nocona -O3 -fomit-frame-pointer sse
gcc -funroll-loops -march=nocona -O -fomit-frame-pointer sse
gcc -funroll-loops -march=nocona -Os -fomit-frame-pointer sse
gcc -m64 -O2 -fomit-frame-pointer sse
gcc -m64 -O3 -fomit-frame-pointer sse
gcc -m64 -O -fomit-frame-pointer sse
gcc -m64 -Os -fomit-frame-pointer sse
gcc -m64 -march=k8 -O2 -fomit-frame-pointer sse
gcc -m64 -march=k8 -O3 -fomit-frame-pointer sse
gcc -m64 -march=k8 -O -fomit-frame-pointer sse
gcc -m64 -march=k8 -Os -fomit-frame-pointer sse
gcc -m64 -march=nocona -O2 -fomit-frame-pointer sse
gcc -m64 -march=nocona -O3 -fomit-frame-pointer sse
gcc -m64 -march=nocona -O -fomit-frame-pointer sse
gcc -m64 -march=nocona -Os -fomit-frame-pointer sse
gcc -march=barcelona -O2 -fomit-frame-pointer sse
gcc -march=barcelona -O3 -fomit-frame-pointer sse
gcc -march=barcelona -O -fomit-frame-pointer sse
gcc -march=barcelona -Os -fomit-frame-pointer sse
gcc -march=k8 -O2 -fomit-frame-pointer sse
gcc -march=k8 -O3 -fomit-frame-pointer sse
gcc -march=k8 -O -fomit-frame-pointer sse
gcc -march=k8 -Os -fomit-frame-pointer sse
gcc -march=nocona -O2 -fomit-frame-pointer sse
gcc -march=nocona -O3 -fomit-frame-pointer sse
gcc -march=nocona -O -fomit-frame-pointer sse
gcc -march=nocona -Os -fomit-frame-pointer sse

Compiler output

Implementation: crypto_aead/scream12v3/sse
Compiler: gcc -m64 -march=barcelona -O2 -fomit-frame-pointer
scream.c: scream.c: In function 'LBox16P':
scream.c: scream.c:202:10: warning: implicit declaration of function '__builtin_ia32_pshufb128' [-Wimplicit-function-declaration]
scream.c: A = __builtin_ia32_pshufb128(table, t0);
scream.c: ^~~~~~~~~~~~~~~~~~~~~~~~
scream.c: scream.c:202:8: error: incompatible types when assigning to type 'v16qi {aka __vector(16) char}' from type 'int'
scream.c: A = __builtin_ia32_pshufb128(table, t0);
scream.c: ^
scream.c: scream.c:203:8: error: incompatible types when assigning to type 'v16qi {aka __vector(16) char}' from type 'int'
scream.c: C = __builtin_ia32_pshufb128(table, t1);
scream.c: ^
scream.c: scream.c:207:8: error: incompatible types when assigning to type 'v16qi {aka __vector(16) char}' from type 'int'
scream.c: B = __builtin_ia32_pshufb128(table, t0);
scream.c: ^
scream.c: scream.c:208:8: error: incompatible types when assigning to type 'v16qi {aka __vector(16) char}' from type 'int'
scream.c: D = __builtin_ia32_pshufb128(table, t1);
scream.c: ^
scream.c: scream.c:215:7: error: conversion of scalar 'int' to vector 'v16qi {aka __vector(16) char}' involves truncation
scream.c: A ^= __builtin_ia32_pshufb128(table, in[0]);
scream.c: ^~
scream.c: scream.c:216:7: error: conversion of scalar 'int' to vector 'v16qi {aka __vector(16) char}' involves truncation
scream.c: C ^= __builtin_ia32_pshufb128(table, in[2]);
scream.c: ^~
scream.c: scream.c:220:7: error: conversion of scalar 'int' to vector 'v16qi {aka __vector(16) char}' involves truncation
scream.c: B ^= __builtin_ia32_pshufb128(table, in[0]);
scream.c: ^~
scream.c: ...
scream.c: scream.c: In function 'LBox16P':
scream.c: scream.c:202:10: warning: implicit declaration of function '__builtin_ia32_pshufb128' [-Wimplicit-function-declaration]
scream.c: A = __builtin_ia32_pshufb128(table, t0);
scream.c: ^~~~~~~~~~~~~~~~~~~~~~~~
scream.c: scream.c:202:8: error: incompatible types when assigning to type 'v16qi {aka __vector(16) char}' from type 'int'
scream.c: A = __builtin_ia32_pshufb128(table, t0);
scream.c: ^
scream.c: scream.c:203:8: error: incompatible types when assigning to type 'v16qi {aka __vector(16) char}' from type 'int'
scream.c: C = __builtin_ia32_pshufb128(table, t1);
scream.c: ^
scream.c: scream.c:207:8: error: incompatible types when assigning to type 'v16qi {aka __vector(16) char}' from type 'int'
scream.c: B = __builtin_ia32_pshufb128(table, t0);
scream.c: ^
scream.c: scream.c:208:8: error: incompatible types when assigning to type 'v16qi {aka __vector(16) char}' from type 'int'
scream.c: D = __builtin_ia32_pshufb128(table, t1);
scream.c: ^
scream.c: scream.c:215:7: error: conversion of scalar 'int' to vector 'v16qi {aka __vector(16) char}' involves truncation
scream.c: A ^= __builtin_ia32_pshufb128(table, in[0]);
scream.c: ^~
scream.c: scream.c:216:7: error: conversion of scalar 'int' to vector 'v16qi {aka __vector(16) char}' involves truncation
scream.c: C ^= __builtin_ia32_pshufb128(table, in[2]);
scream.c: ^~
scream.c: scream.c:220:7: error: conversion of scalar 'int' to vector 'v16qi {aka __vector(16) char}' involves truncation
scream.c: B ^= __builtin_ia32_pshufb128(table, in[0]);
scream.c: ^~
scream.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -m64 -march=barcelona -O2 -fomit-frame-pointer sse
gcc -m64 -march=barcelona -O3 -fomit-frame-pointer sse
gcc -m64 -march=barcelona -O -fomit-frame-pointer sse
gcc -m64 -march=barcelona -Os -fomit-frame-pointer sse