Better LRU and Fix for Deadlock #120

0xForerunner · 2025-02-22T01:20:15Z

I discovered a deadlock in the rollup boost server. I've posted a fix here using a concurrent, lock free LRU cache. In general this should make the code here more readable and less prone to errors.

As a general note, I've noticed that the way mutex locks are being handled in this code base seems to be pretty fast and loose. There were several cases where locks were being held far longer than they need to be.

0xForerunner · 2025-02-22T01:21:41Z

src/server.rs

-        let mut block_hash_to_payload_ids = self.block_hash_to_payload_ids.lock().await;
-        let mut payload_id_to_span = self.payload_id_to_span.lock().await;


Deadlock can occur here, as locks are acquired in opposite order as in fn store above.

dmarzzz · 2025-02-22T01:27:48Z

Thanks for the call out and contribution! Would you be willing to open up an issue for any other lock behavior usage you see as problematic? An audit locks issue would also work

0xForerunner · 2025-02-22T01:29:11Z

@dmarzzz Yeah no problem! I'll probably just push another commit to this branch with a general lock cleanup. Already fixed a few so may as well keep it going :)

0xForerunner · 2025-02-22T01:58:01Z

Okay so I think I've cleaned up all the locks excluding those used in tests. Those should probably be cleaned up as some point as well, but let's leave that for another PR.

@dmarzzz lemme know if this looks good to you :)

avalonche · 2025-02-25T05:32:08Z

src/server.rs

-    payload_id_to_span: Arc<Mutex<LruCache<PayloadId, Arc<BoxedSpan>>>>,
-    local_to_external_payload_ids: Arc<Mutex<LruCache<PayloadId, PayloadId>>>,
+    tracer: BoxedTracer,
+    block_hash_to_payload_ids: Cache<B256, Vec<PayloadId>>,


curious how this Cache differs from LruCache?

It just lock free, which means we don't need to worry about locking ourselves. Helps avoid errors! Should be significantly faster as well, not that it really matters in this case haha.

avalonche

LGTM! Thank you for the PR, makes the trace logic much cleaner

0xForerunner · 2025-03-06T10:11:04Z

@avalonche feel free to merge this in when you're ready!

0xForerunner commented Feb 22, 2025

View reviewed changes

avalonche reviewed Feb 25, 2025

View reviewed changes

avalonche approved these changes Feb 25, 2025

View reviewed changes

0xForerunner added 2 commits March 6, 2025 02:02

Better LRU and fix for deadlock

0dc4171

Lock Cleanup

1584704

0xForerunner force-pushed the forerunner/deadlock-fix branch from fc13a4d to 4e1b965 Compare March 6, 2025 10:02

fixup

716ff7f

0xForerunner force-pushed the forerunner/deadlock-fix branch from 4e1b965 to 716ff7f Compare March 6, 2025 10:05

0xForerunner added 2 commits March 6, 2025 03:14

Merge branch 'main' into forerunner/deadlock-fix

c1e30c8

remove await

6d8a7bc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better LRU and Fix for Deadlock #120

Better LRU and Fix for Deadlock #120

0xForerunner commented Feb 22, 2025

0xForerunner Feb 22, 2025

dmarzzz commented Feb 22, 2025

0xForerunner commented Feb 22, 2025

0xForerunner commented Feb 22, 2025

avalonche Feb 25, 2025

0xForerunner Mar 6, 2025

avalonche left a comment

0xForerunner commented Mar 6, 2025

		let mut block_hash_to_payload_ids = self.block_hash_to_payload_ids.lock().await;
		let mut payload_id_to_span = self.payload_id_to_span.lock().await;

Better LRU and Fix for Deadlock #120

Are you sure you want to change the base?

Better LRU and Fix for Deadlock #120

Conversation

0xForerunner commented Feb 22, 2025

0xForerunner Feb 22, 2025

Choose a reason for hiding this comment

dmarzzz commented Feb 22, 2025

0xForerunner commented Feb 22, 2025

0xForerunner commented Feb 22, 2025

avalonche Feb 25, 2025

Choose a reason for hiding this comment

0xForerunner Mar 6, 2025

Choose a reason for hiding this comment

avalonche left a comment

Choose a reason for hiding this comment

0xForerunner commented Mar 6, 2025