Optimizing the shuffle #326

TomNicholas · 2023-11-17T07:38:47Z

Cubed currently always implements the shuffle operation as an all-to-all rechunking using the algorithm from rechunker. This creates an intermediate persistent Zarr store, and requires all chunks to be written then all chunks to be read. Can we do better?

We could consider using a different storage service, such as a different Zarr reader, Google Tensorstore (see #187), or maybe redis. These are all still fundamentally the same shuffle operation though.

Another idea is to narrow the number of situations in which we actually need a full rechunk. There are some trivial cases (see #256), but there might be others. @dcherian had an idea for representing rechunk operations as blockwise somehow that I would like to hear more about!

Pedro Lopez pointed us towards the Primula paper, saying it implements an efficient serverless shuffle (for a big sorting operation). I'm not sure I understand it well enough yet, but my impression is that it's actually basically the same save-everything-to-intermediate-blob-storage idea that we're already using, plus some more minor optimizations.

EDIT: Correct link to Primula paper

hammer · 2023-11-21T21:14:04Z

I think the Primula paper link should be https://dl.acm.org/doi/10.1145/3429357.3430522

TomNicholas · 2023-11-21T21:15:22Z

Oh yes! Thanks for spotting that mistake @hammer

TomNicholas · 2024-07-15T14:23:48Z

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimizing the shuffle #326

Optimizing the shuffle #326

TomNicholas commented Nov 17, 2023 •

edited

Loading

hammer commented Nov 21, 2023

TomNicholas commented Nov 21, 2023

TomNicholas commented Jul 15, 2024

Optimizing the shuffle #326

Optimizing the shuffle #326

Comments

TomNicholas commented Nov 17, 2023 • edited Loading

hammer commented Nov 21, 2023

TomNicholas commented Nov 21, 2023

TomNicholas commented Jul 15, 2024

TomNicholas commented Nov 17, 2023 •

edited

Loading