Better handle parallelism in cache priming #19721

ChayimFriedman2 · 2025-04-30T09:56:00Z

Make best use of all available CPU cores.

Closes #19711.

Also change the flags of the prime-caches CLI command, and make it parallel by default.

This saves 0.305ms in the parallel prime-caches on omicron, although the benchmarks were very noisy.

I ran into a Salsa panic while running parallel prime caches on buck2... Need to debug that, but it doesn't reproduce now.

Veykril

So the current setup doing things in phases has the problem that we only parallelize up to the width of the crate graph at the current depth which tends to be less than the number of threads available.

So the idea of this PR is to interleave the import map and crate symbols computation such that we work on those already while potentially waiting on more defmap computations to "unlock".

Is that correct?

Veykril · 2025-05-05T07:10:28Z

crates/ide-db/src/prime_caches.rs

+    // The idea is that if we have a def map available to compute, we should do that first.
+    // This is because def map is a dependency of both import map and symbols. So if we have
+    // e.g. a def map and a symbols, if we compute the def map we can, after it completes,
+    // compute the def maps of dependencies, the existing symbols and the symbols of the


Suggested change

// compute the def maps of dependencies, the existing symbols and the symbols of the

// compute the def maps of dependents, the existing symbols and the symbols of the

what is meant with existing symbols? The one we already have? Why list that given we already computed it

I meant the symbols we already could compute when we computed the def map.

Veykril

Lgtm

crates/ide-db/src/prime_caches.rs

Veykril · 2025-05-05T07:30:35Z

crates/ide-db/src/prime_caches.rs

+            crates_done: crate_def_maps_done,
+            crates_total: crate_def_maps_total,


Feels like we should be using the total sum numbers here and not just the def maps? Unsure, it might be a bit weird to see this be n / n in the status message without finishing yet (due to us working on symbol / import maps still).

Alternatively we could swap out the work_type message once we are done with the crate def maps with something else

Yeah, the problem with this PR is that status report gets more complicated.

ChayimFriedman2 · 2025-05-05T12:52:55Z

So the current setup doing things in phases has the problem that we only parallelize up to the width of the crate graph at the current depth which tends to be less than the number of threads available.

So the idea of this PR is to interleave the import map and crate symbols computation such that we work on those already while potentially waiting on more defmap computations to "unlock".

Is that correct?

Yes, it is.

To make best use of available cores, and don't waste time waiting for other tasks. See the comments in the code for explanation.

And make it parallel by default (and remove the `--parallel` flag) to mirror the IDE cache priming.

ChayimFriedman2 · 2025-05-05T20:57:01Z

@Veykril I addressed most of your comments, but I don't know what to do with the status report given that we can't know the total amount of work as it requires knowing how module each crate has, and we can know that only after the def map completes. I agree seeing n/n crates done and it still working is weird, and if you have some idea I'd love to hear.

Veykril · 2025-05-06T05:46:14Z

Let's just swap the Indexing work type to Collecting Symbols once we are at n/n. I think that's good enough for now. We can try adjusting things once we know how this looks UX wise after merging.

It could be confusing if they see "Indexing n/n" but cache priming does not finish.

ChayimFriedman2 · 2025-05-06T07:30:19Z

Let's just swap the Indexing work type to Collecting Symbols once we are at n/n. I think that's good enough for now. We can try adjusting things once we know how this looks UX wise after merging.

Done that.

rustbot added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Apr 30, 2025

Veykril reviewed May 5, 2025

View reviewed changes

Veykril approved these changes May 5, 2025

View reviewed changes

Veykril mentioned this pull request May 5, 2025

perf: Collect module symbols in parallel #19711

Closed

Veykril added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels May 5, 2025

ChayimFriedman2 added 2 commits May 5, 2025 23:54

Better manage parallel prime caches

27dc8ad

To make best use of available cores, and don't waste time waiting for other tasks. See the comments in the code for explanation.

Add a --num-threads to the prime-caches CLI command

1c7a94f

And make it parallel by default (and remove the `--parallel` flag) to mirror the IDE cache priming.

ChayimFriedman2 force-pushed the more-parallel branch from fac08dc to 1c7a94f Compare May 5, 2025 20:55

Notify the user that we're collecting symbols

f23af92

It could be confusing if they see "Indexing n/n" but cache priming does not finish.

Veykril enabled auto-merge May 6, 2025 07:36

Veykril added this pull request to the merge queue May 6, 2025

Merged via the queue into rust-lang:master with commit 8c43442 May 6, 2025
14 checks passed

ChayimFriedman2 deleted the more-parallel branch May 6, 2025 07:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better handle parallelism in cache priming #19721

Better handle parallelism in cache priming #19721

ChayimFriedman2 commented Apr 30, 2025 •

edited by rustbot

Loading

Veykril left a comment

Veykril May 5, 2025

ChayimFriedman2 May 5, 2025

Veykril left a comment

Veykril May 5, 2025

ChayimFriedman2 May 5, 2025

ChayimFriedman2 commented May 5, 2025

ChayimFriedman2 commented May 5, 2025

Veykril commented May 6, 2025

ChayimFriedman2 commented May 6, 2025

	// compute the def maps of dependencies, the existing symbols and the symbols of the
	// compute the def maps of dependents, the existing symbols and the symbols of the

		crates_done: crate_def_maps_done,
		crates_total: crate_def_maps_total,

Better handle parallelism in cache priming #19721

Better handle parallelism in cache priming #19721

Conversation

ChayimFriedman2 commented Apr 30, 2025 • edited by rustbot Loading

Veykril left a comment

Choose a reason for hiding this comment

Veykril May 5, 2025

Choose a reason for hiding this comment

ChayimFriedman2 May 5, 2025

Choose a reason for hiding this comment

Veykril left a comment

Choose a reason for hiding this comment

Veykril May 5, 2025

Choose a reason for hiding this comment

ChayimFriedman2 May 5, 2025

Choose a reason for hiding this comment

ChayimFriedman2 commented May 5, 2025

ChayimFriedman2 commented May 5, 2025

Veykril commented May 6, 2025

ChayimFriedman2 commented May 6, 2025

ChayimFriedman2 commented Apr 30, 2025 •

edited by rustbot

Loading