Implementation notes: amd64, hydra5, crypto_aead/scream12v1

Computer: hydra5
Microarchitecture: amd64; K10 32nm (300f10)
Architecture: amd64
CPU ID: AuthenticAMD-00300f10-178bfbff
SUPERCOP version: 20240425
Operation: crypto_aead
Primitive: scream12v1
TimeObject sizeTest sizeImplementationCompilerBenchmark dateSUPERCOP version
51620011109 0 034532 812 1088T:refgcc_-march=native_-mtune=native_-O3_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042820240425
5341708068 0 029488 860 1024T:refclang_-march=native_-O2_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024042820240425
5344158276 0 030600 860 1024T:refclang_-mcpu=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024042820240425
5344268276 0 030600 860 1024T:refclang_-march=native_-O3_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024042820240425
11178325326 0 024498 852 1024T:refclang_-march=native_-Os_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024042820240425
11703985754 0 025632 860 1024T:refclang_-march=native_-O_-fomit-frame-pointer_-fwrapv_-Qunused-arguments_-fPIC_-fPIE2024042820240425
12422036903 0 029028 812 1088T:refgcc_-march=native_-mtune=native_-O2_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042820240425
13008336441 0 027612 812 1088T:refgcc_-march=native_-mtune=native_-O_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042820240425
13297535980 0 025471 788 1056T:refgcc_-march=native_-mtune=native_-Os_-fomit-frame-pointer_-fwrapv_-fPIC_-fPIE2024042820240425

Compiler output

Implementation: T:sse
Security model: timingleaks
Compiler: clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE
scream.c: scream.c:185:10: error: '__builtin_ia32_pshufb128' needs target feature ssse3
scream.c: A = __builtin_ia32_pshufb128(table, t0);
scream.c: ^
scream.c: scream.c:186:10: error: '__builtin_ia32_pshufb128' needs target feature ssse3
scream.c: C = __builtin_ia32_pshufb128(table, t1);
scream.c: ^
scream.c: scream.c:190:10: error: '__builtin_ia32_pshufb128' needs target feature ssse3
scream.c: B = __builtin_ia32_pshufb128(table, t0);
scream.c: ^
scream.c: scream.c:191:10: error: '__builtin_ia32_pshufb128' needs target feature ssse3
scream.c: D = __builtin_ia32_pshufb128(table, t1);
scream.c: ^
scream.c: scream.c:198:10: error: '__builtin_ia32_pshufb128' needs target feature ssse3
scream.c: A ^= __builtin_ia32_pshufb128(table, in[0]);
scream.c: ^
scream.c: scream.c:199:10: error: '__builtin_ia32_pshufb128' needs target feature ssse3
scream.c: C ^= __builtin_ia32_pshufb128(table, in[2]);
scream.c: ^
scream.c: scream.c:203:10: error: '__builtin_ia32_pshufb128' needs target feature ssse3
scream.c: B ^= __builtin_ia32_pshufb128(table, in[0]);
scream.c: ^
scream.c: scream.c:204:10: error: '__builtin_ia32_pshufb128' needs target feature ssse3
scream.c: D ^= __builtin_ia32_pshufb128(table, in[2]);
scream.c: ^
scream.c: scream.c:211:10: error: '__builtin_ia32_pshufb128' needs target feature ssse3
scream.c: ...

Number of similar (compiler,implementation) pairs: 5, namely:
CompilerImplementations
clang -march=native -O2 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:sse
clang -march=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:sse
clang -march=native -O -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:sse
clang -march=native -Os -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:sse
clang -mcpu=native -O3 -fomit-frame-pointer -fwrapv -Qunused-arguments -fPIC -fPIE T:sse

Compiler output

Implementation: T:sse
Security model: timingleaks
Compiler: gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE
scream.c: scream.c: In function 'LBox16P':
scream.c: scream.c:185:10: warning: implicit declaration of function '__builtin_ia32_pshufb128'; did you mean '__builtin_ia32_psubb128'? [-Wimplicit-function-declaration]
scream.c: 185 | A = __builtin_ia32_pshufb128(table, t0);
scream.c: | ^~~~~~~~~~~~~~~~~~~~~~~~
scream.c: | __builtin_ia32_psubb128
scream.c: scream.c:185:10: error: incompatible types when assigning to type 'v16qi' {aka '__vector(16) char'} from type 'int'
scream.c: scream.c:186:10: error: incompatible types when assigning to type 'v16qi' {aka '__vector(16) char'} from type 'int'
scream.c: 186 | C = __builtin_ia32_pshufb128(table, t1);
scream.c: | ^~~~~~~~~~~~~~~~~~~~~~~~
scream.c: scream.c:190:10: error: incompatible types when assigning to type 'v16qi' {aka '__vector(16) char'} from type 'int'
scream.c: 190 | B = __builtin_ia32_pshufb128(table, t0);
scream.c: | ^~~~~~~~~~~~~~~~~~~~~~~~
scream.c: scream.c:191:10: error: incompatible types when assigning to type 'v16qi' {aka '__vector(16) char'} from type 'int'
scream.c: 191 | D = __builtin_ia32_pshufb128(table, t1);
scream.c: | ^~~~~~~~~~~~~~~~~~~~~~~~
scream.c: scream.c:198:7: error: conversion of scalar 'int' to vector 'v16qi' {aka '__vector(16) char'} involves truncation
scream.c: 198 | A ^= __builtin_ia32_pshufb128(table, in[0]);
scream.c: | ^~
scream.c: scream.c:199:7: error: conversion of scalar 'int' to vector 'v16qi' {aka '__vector(16) char'} involves truncation
scream.c: 199 | C ^= __builtin_ia32_pshufb128(table, in[2]);
scream.c: | ^~
scream.c: scream.c:203:7: error: conversion of scalar 'int' to vector 'v16qi' {aka '__vector(16) char'} involves truncation
scream.c: 203 | B ^= __builtin_ia32_pshufb128(table, in[0]);
scream.c: | ^~
scream.c: scream.c:204:7: error: conversion of scalar 'int' to vector 'v16qi' {aka '__vector(16) char'} involves truncation
scream.c: ...

Number of similar (compiler,implementation) pairs: 4, namely:
CompilerImplementations
gcc -march=native -mtune=native -O2 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:sse
gcc -march=native -mtune=native -O3 -fomit-frame-pointer -fwrapv -fPIC -fPIE T:sse
gcc -march=native -mtune=native -O -fomit-frame-pointer -fwrapv -fPIC -fPIE T:sse
gcc -march=native -mtune=native -Os -fomit-frame-pointer -fwrapv -fPIC -fPIE T:sse