Websocket masking could a lot faster #1801

gdamore · 2024-03-17T02:35:49Z

The current masking code is fairly naive and masks a byte at a time. But pretty much everyone has 64-bit operations (and some even have 128-bit SIMD operations).

It's possible that optimizers are good enough to mask this efficiently already, but from what I observe on godbolt, even under -O3 both clang and gcc don't optimize this well - at least for x86.

The interesting concerns here will be dealing with misaligned data, which is not an issue on x86, and for modern ARM (aarch64) is usually not an issue.

Fixing this would be substantial for high bandwidth Websocket messages.

gdamore added performance websocket labels Mar 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Websocket masking could a lot faster #1801

Websocket masking could a lot faster #1801

gdamore commented Mar 17, 2024

Websocket masking could a lot faster #1801

Websocket masking could a lot faster #1801

Comments

gdamore commented Mar 17, 2024