Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[syntax-errors] Detect duplicate keys in match mapping patterns #17129

Open
wants to merge 5 commits into
base: main
Choose a base branch
from

Conversation

ntBre
Copy link
Contributor

@ntBre ntBre commented Apr 1, 2025

Summary

Detects duplicate literals in match mapping keys.

This PR also adds a source method to SemanticSyntaxContext to display the duplicated key in the error message by slicing out its range.

Test Plan

New inline tests.

Summary
--

Detects duplicate literals in `match` mapping keys.

This PR also adds a `source` method to `SemanticSyntaxContext` to display the
duplicated key in the error message by slicing out its range.

Test Plan
--

New inline tests.
@ntBre ntBre added rule Implementing or modifying a lint rule preview Related to preview mode features labels Apr 1, 2025
// complex numbers (`1 + 2j`) are allowed as keys but are not literals
// because they are represented as a `BinOp::Add` between a real number and
// an imaginary number
.filter(|key| key.is_literal_expr() || key.is_bin_op_expr())
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We could be a bit stricter here if we want (checking that the BinOp is an Add and the arguments are numbers), but we already report syntax errors for non-complex literals like 1 + 2, so I thought this might be sufficient.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, I think that's fine, the parser should catch invalid complex literals.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We also have to allow f-string expressions. They're allowed for as long as they contain no placeholders and they should compare equal to their string equivalent (I think this is already handled by ComparableExpr).

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think f-strings are not allowed here by CPython, even without placeholders:

>>> match x:
...     case {f"x": 1}: ...
...
  File "<python-input-2>", line 2
    case {f"x": 1}: ...
         ^^^^^^^^^
SyntaxError: mapping pattern keys may only match literals and attribute lookups

In that case, I think our is_literal_expr is doing the right thing.

Copy link
Contributor

github-actions bot commented Apr 1, 2025

ruff-ecosystem results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

Formatter (stable)

✅ ecosystem check detected no format changes.

Formatter (preview)

✅ ecosystem check detected no format changes.

Copy link
Member

@dhruvmanila dhruvmanila left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, I'd suggest that we improve the diagnostic range and expand the check for all keys unless it creates an issue.

Comment on lines 284 to 289
Self::add_error(
ctx,
SemanticSyntaxErrorKind::DuplicateMatchKey(duplicate_key),
mapping.range,
);
break;
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we should avoid breaking here and report all duplicate keys within a single mapping pattern. Or, do you see any limitation or challenges in that approach? What do you think?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was just copying CPython, but that sounds reasonable too!

>>> match x:
...     case {"x": 1, "x": 2}: ...
...
  File "<python-input-223>", line 2
    case {"x": 1, "x": 2}: ...
         ^^^^^^^^^^^^^^^^
SyntaxError: mapping pattern checks duplicate key ('x')

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, in general, we should prefer to surface all errors whenever possible which is one of the main motivation to have an error resilient parser :)

Comment on lines +645 to +657
/// ```pycon
/// >>> match x:
/// ... case {"x": 1, "x": 2}: ...
/// ...
/// File "<python-input-160>", line 2
/// case {"x": 1, "x": 2}: ...
/// ^^^^^^^^^^^^^^^^
/// SyntaxError: mapping pattern checks duplicate key ('x')
/// >>> match x:
/// ... case {x.a: 1, x.a: 2}: ...
/// ...
/// >>>
/// ```
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Interesting!

Also, I think you meant "python" or "py" and not "pycon" xD

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh I thought pycon was for the Python console [1]. That's what I use here on GitHub, although it doesn't usually do much syntax highlighting anyway.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh lol, I had no idea about that. I don't think that matters as those are going to be rendered in an editor or crates.io.

Comment on lines 661 to 665
1 | match x:
2 | case {"x": 1, "x": 2}: ...
| ^^^^^^^^^^^^^^^^ Syntax Error: mapping pattern checks duplicate key `"x"`
3 | case {b"x": 1, b"x": 2}: ...
4 | case {0: 1, 0: 2}: ...
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What happens when there are multiple different duplicate keys? Like case {0: 1, "x": 1, 0: 2, "x": 2}: ....

I think we should highlight all the subsequent duplicate keys i.e., in the above case we should highlight the second 0 and the second "x" instance.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah CPython just notes the first one (also inline with the break above), but I think this is a good idea.

>>> match x:
...     case {"x": 1, "x": 2, 0: 3, 0: 4}: ...
...
  File "<python-input-224>", line 2
    case {"x": 1, "x": 2, 0: 3, 0: 4}: ...
         ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
SyntaxError: mapping pattern checks duplicate key ('x')

// complex numbers (`1 + 2j`) are allowed as keys but are not literals
// because they are represented as a `BinOp::Add` between a real number and
// an imaginary number
.filter(|key| key.is_literal_expr() || key.is_bin_op_expr())
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, I think that's fine, the parser should catch invalid complex literals.

// complex numbers (`1 + 2j`) are allowed as keys but are not literals
// because they are represented as a `BinOp::Add` between a real number and
// an imaginary number
.filter(|key| key.is_literal_expr() || key.is_bin_op_expr())
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We also have to allow f-string expressions. They're allowed for as long as they contain no placeholders and they should compare equal to their string equivalent (I think this is already handled by ComparableExpr).

|
1 | match x:
2 | case {"x": 1, "x": 2}: ...
| ^^^ Syntax Error: mapping pattern checks duplicate key `"x"`
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Another use case for multi-span diagostics :)

Comment on lines 985 to 988
| |_______^ Syntax Error: mapping pattern checks duplicate key `"""x
y
z
"""`
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will break our concise rendering where each message should only be a single line long. I don't have a good recommendation but it's a general concern with including source text as is in diagnostic messages. You can either replace new lines, truncate before the new line (and replace with ...), or not include the name if it is multiline.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I went with escaping the newlines to match CPython (as we discussed on Discord):

>>> match x:
...     case {
...     """x
...     y
...     z
...     """: 1,
...     """x
...     y
...     z
...     """: 2}: ...
...
  File "<python-input-0>", line 2
    case {
         ^
SyntaxError: mapping pattern checks duplicate key ('x\n    y\n    z\n    ')

I added a modified version of std::str::EscapeDefault from the Rust standard library. It's a little more heavily modified than I wanted because many of the functions in the real implementation are private, but it gets the job done and without too much code.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
preview Related to preview mode features rule Implementing or modifying a lint rule
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants