GPU Shared Memory #1039

seanlaw · 2024-10-25T15:28:09Z

In all of our GPU calculations, we never utilize "shared memory" and, instead, rely heavily on storing data in the (fast but limited) "on-chip" and "thread local" GPU register. When the register fills up, data is spilled over to (very slow but more abundant) "off-chip" and "thread local" GPU local memory. However, there is a compromise that is GPU "shared memory" that, in my recent experience, can make computations 3x faster! Of course, coding using shared memory can increase the complexity of the codebase and thereby decreasing the long term readability/maintainability.

This GPU tutorial provides an EXCELLENT overview (especially Tutorial 5 was super enlightening)!

seanlaw self-assigned this Oct 25, 2024

seanlaw added the enhancement New feature or request label Oct 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU Shared Memory #1039

GPU Shared Memory #1039

seanlaw commented Oct 25, 2024

GPU Shared Memory #1039

GPU Shared Memory #1039

Comments

seanlaw commented Oct 25, 2024