refactor: Provide helpful error msg when deserializing big timestamp #494

Xanewok · 2024-06-06T13:43:29Z

Fixes #435

In place of the existing U64OrUsize we provide a newtype Timestamp around u64 with a custom Deserialize impl that leniently first deserializes to a number/string and then attempts to parse the value to see if it's within the valid range.

We keep providing From and Into for the new type, so the existing code keeps using the impls whenever possible.

changeset-bot · 2024-06-06T13:43:32Z

🦋 Changeset detected

Latest commit: 913edb7

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 1 package

Name	Type
@nomicfoundation/edr	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

crates/edr_provider/src/requests/serde.rs

Wodann · 2024-06-07T16:52:53Z

crates/edr_provider/src/requests/serde.rs

+            if let Some(hex_str) = value.strip_prefix("0x") {
+                u64::from_str_radix(hex_str, 16).map_err(|_err| error_msg())
+            } else {
+                value.parse::<u64>().map_err(|_err| error_msg())
+            }


This deviates from what used to happen. We don't want to allow any type of string here, but one that conforms to the Ethereum format: ^0x([1-9a-f]+[0-9a-f]*|0)$ (also see this).

I recommend using the U64 type to deserialize the string here, as that already implements this logic. All you'd have to do is to first check the number of bytes in the hex string after 0x (this part is required). If it exceeds 8 bytes, then return the custom error.

I checked ruint's types and it doesn't have a special case for disallowing leading zeroes, apart from supporting also octal o and binary b out of the box. See from_str et al. in https://docs.rs/crate/ruint/latest/source/src/string.rs.

For example, this passes on main:

#[test] fn test_deserialize_quantity() { let json = r#""0x01""#; let mut deserializer = serde_json::Deserializer::from_str(json); let quantity = deserialize_quantity(&mut deserializer).unwrap(); assert_eq!(quantity, U256::from(0x01)); }

Did this uncover a bug in our other deserializers as well? If so, do you want me to fix this as part of this PR or separately?

I'm also seeing some more inconsistencies, for example we only have Option<u64> using default Deserialize in AccountOverrideOptions for eth_call

edr/crates/edr_rpc_eth/src/override.rs

Line 18 in eb00b15

pub nonce: Option<u64>,

but the official API schema says it accepts only a hex string of the quantity format you mentioned:

Maybe it'd make sense to separate a work item of "go over all of our (de)serialize logic and make sure we're compliant"?

On the other hand per the common networking adage of "be conservative in what you send, be liberal in what you accept" and the fact that we're supposed to be a development platform, I'm not sure if us not being as strict as the spec is necessary the bad thing here.

I'll add it for the agenda for tomorrow's sync; maybe @fvictorio you have some more context or opinions on this.

See https://github.com/recmo/uint/blob/main/src/support/serde.rs#L136 for the serde implementations of ruint. They have additional logic on top of the FromStr implementation.

It also already handles visit_u64 and visit_u128, so it voids the need for any of the additional code you wrote, apart from the helpful error message.

For example, this passes on main:

#[test] fn test_deserialize_quantity() { let json = r#""0x01""#; let mut deserializer = serde_json::Deserializer::from_str(json); let quantity = deserialize_quantity(&mut deserializer).unwrap(); assert_eq!(quantity, U256::from(0x01)); }

Did this uncover a bug in our other deserializers as well? If so, do you want me to fix this as part of this PR or separately?

The example you wrote is valid. 0x01 is a single byte equal to 1. There are no leading zeroes.

@Wodann are you sure? This would also accept 0X prefix as well as octal 0o/0O and binary 0b/0B which I'm pretty sure we do not want to accept, given that we accept to conform only to the spec 0x: https://github.com/recmo/uint/blob/7b8b0d313a4960091c07eb3356c6a2512a46b585/src/string.rs#L200-L210

I don't want to prolong this but since we're already splitting hairs here, it's just worth noting that it'd not be consistent with what we do elsewhere, which is String::deserialize, then check for 0x and then call U64::from_str (on a value that's guaranteed to start with 0x, so okay).

Maybe just being consistent with the rest of the code would be okay, where we do the flow outlined above like everywhere else, and we could focus on compliance/correctness in a separate PR? WDYT?

Right, I forgot that we support direct numbers, which is where your suggestion for U64::deserialize came from, but we should not accept the different modes, so I tried to make it more explicit what we support and why in the code and in the error message in 7635fc0.

Since we also accept JS numbers as the extension to the regular JSON-RPC API, it'd be weird not to support them as a regular string (so also a base 10 number). I've added a comment and made it more explicit about the base, let me know what you think.

I only skimmed this discussion (let me know if there's something in particular I should pay attention to) but this caught my eye:

This would also accept 0X prefix as well as octal 0o/0O and binary 0b/0B which I'm pretty sure we do not want to accept

I checked and the released version of EDR accepts all of these (which is wrong by the spec, and geth doesn't accept):

send("eth_getBlockByNumber", [0, false]) send("eth_getBlockByNumber", ["0", false]) send("eth_getBlockByNumber", ["0b0", false]) send("eth_getBlockByNumber", ["0o0", false])

I will open an issue about this, but we don't need to do anything about that in this PR.

crates/edr_provider/src/requests/serde.rs

crates/edr_provider/tests/eth_request_serialization.rs

Xanewok · 2024-06-10T13:00:18Z

Reshuffled the tests to be first (to double-check what we do on main already) and force-pushed with expanded deserialize_timestamp test; hope you don't mind 🙏

crates/edr_provider/src/requests/serde.rs

Too permissive derialization for the timestamp along with accepting leading zeroes in other values should be done in a separate PR.

crates/edr_provider/src/requests/serde.rs

fvictorio · 2024-06-13T09:48:48Z

hardhat-tests/test/internal/hardhat-network/provider/modules/evm.ts

+            "evm_increaseTime",
+            ["0x1111111111111111111111111111111111111"],
+            "Timestamp must be a positive integer or a string not exceeding 2^64 - 1"
+          );


Can we add 4 test cases more here?

Using exactly 2^64 produces the error too (using both hex-prefixed strings and decimal numbers)

Using exactly 2^64-1 doesn't produce the error (both hex-prefixed and decimal)

I did that for the evm_setNextBlockTimestamp.

However, I now realized that evm_increaseTime actually internally converts the value to an i64... Does that mean we should support negative as well? This also means supporting up to 2**63-1. Is this something we want to pursue as part of this PR?

IIRC, evm_increaseTime was allowed to receive negative values in the past, but this was a bug. We fixed that in EDR, so no need for allowing negative values in the JSON-RPC.

The reason that we use an i64 in ProviderData is that the fn block_time_offset_seconds can return a negative value, if the first block is mined in the past.

Potentially we want to change that in the future and return an error. This might have been designed this way in Hardhat on purpose though, so I'll defer to @fvictorio for historic context.

Potentially we want to change that in the future and return an error.

No, this is not an error condition, it happens all the time. I don't remember exactly why; I think it's because in Hardhat the config is eagerly loaded (which then uses now() for the initial timestamp) but the node is created lazily after the first request, so normally the time offset will be -N, where N is the number of seconds that happened between the start of the task and the first request.

I'm not sure about the i64 part, but I'm happy with just having those tests for evm_setNextBlockTimestamp, so no need to block on that.

It looks the situation may not be obvious on the i64 front, so I'll wait for @Wodann here. When everyone agrees, I think we could finally merge this.

Xanewok commented Jun 6, 2024

View reviewed changes

crates/edr_provider/src/requests/serde.rs Outdated Show resolved Hide resolved

Xanewok commented Jun 6, 2024

View reviewed changes

crates/edr_provider/src/requests/serde.rs Show resolved Hide resolved

Wodann requested changes Jun 7, 2024

View reviewed changes

Wodann assigned Xanewok Jun 7, 2024

Xanewok added 3 commits June 10, 2024 14:56

tests: Add TS integration tests

1c7e0f4

refactor: Provide helpful error msg when deserializing big timestamp

ad1cfc6

refactor: Use the new Timestamp in place of U64OrUsize

1d5d2dc

Xanewok force-pushed the refactor/improve-error-message-for-big-timestamps branch from d927e4d to 1d5d2dc Compare June 10, 2024 12:56

Try to clarify what we support as timestamps in JSON-RPC

7635fc0

Xanewok commented Jun 12, 2024

View reviewed changes

crates/edr_provider/src/requests/serde.rs Show resolved Hide resolved

Xanewok requested a review from Wodann June 12, 2024 13:37

Xanewok assigned Wodann Jun 12, 2024

Use U64::deserialize in deserialize_timestamp for now

771698c

Too permissive derialization for the timestamp along with accepting leading zeroes in other values should be done in a separate PR.

Xanewok commented Jun 13, 2024

View reviewed changes

crates/edr_provider/src/requests/serde.rs Show resolved Hide resolved

fvictorio reviewed Jun 13, 2024

View reviewed changes

Change the error wording and add a test for 2^64-1

913edb7

fvictorio approved these changes Jun 14, 2024

View reviewed changes

Wodann approved these changes Jun 14, 2024

View reviewed changes

Xanewok merged commit 245fd07 into main Jun 14, 2024
41 checks passed

Xanewok deleted the refactor/improve-error-message-for-big-timestamps branch June 14, 2024 13:47

Wodann added this to the EDR v0.4.1 milestone Jul 11, 2024

Wodann removed their assignment Jul 11, 2024

github-actions bot locked as resolved and limited conversation to collaborators Oct 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: Provide helpful error msg when deserializing big timestamp #494

refactor: Provide helpful error msg when deserializing big timestamp #494

Xanewok commented Jun 6, 2024

changeset-bot bot commented Jun 6, 2024 •

edited

Loading

Wodann Jun 7, 2024

Xanewok Jun 10, 2024 •

edited

Loading

Xanewok Jun 10, 2024

Wodann Jun 10, 2024 •

edited

Loading

Wodann Jun 10, 2024 •

edited

Loading

Xanewok Jun 12, 2024

Xanewok Jun 12, 2024

Xanewok Jun 12, 2024 •

edited

Loading

fvictorio Jun 13, 2024

fvictorio Jun 13, 2024

Xanewok commented Jun 10, 2024

fvictorio Jun 13, 2024

Xanewok Jun 13, 2024

Wodann Jun 13, 2024 •

edited

Loading

fvictorio Jun 14, 2024

Xanewok Jun 14, 2024

Wodann Jun 14, 2024

Xanewok Jun 14, 2024

refactor: Provide helpful error msg when deserializing big timestamp #494

refactor: Provide helpful error msg when deserializing big timestamp #494

Conversation

Xanewok commented Jun 6, 2024

changeset-bot bot commented Jun 6, 2024 • edited Loading

🦋 Changeset detected

Choose a reason for hiding this comment

Xanewok Jun 10, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Wodann Jun 10, 2024 • edited Loading

Choose a reason for hiding this comment

Wodann Jun 10, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Xanewok Jun 12, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Xanewok commented Jun 10, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Wodann Jun 13, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

changeset-bot bot commented Jun 6, 2024 •

edited

Loading

Xanewok Jun 10, 2024 •

edited

Loading

Wodann Jun 10, 2024 •

edited

Loading

Wodann Jun 10, 2024 •

edited

Loading

Xanewok Jun 12, 2024 •

edited

Loading

Wodann Jun 13, 2024 •

edited

Loading