[red-knot] Statically known branches #14759

sharkdp · 2024-12-03T15:07:47Z

Summary

Rendered version of the test suite including a proper introduction to the topic / motivation: click.

This changeset adds support for precise type-inference and boundness-handling of definitions inside control-flow branches with statically-known conditions, i.e. test-expressions whose truthiness we can unambiguously infer as always false or always true. In code:

x = 1

if "z" in "haystack":  # Literal[False]
    x = 2

reveal_type(x)  # revealed: Literal[1]

and:

x = 1

if "y" in "haystack":  # Literal[True]
    x = 2

reveal_type(x)  # revealed: Literal[2]

One option to implement this would have been to add special handling for a limited set of test-expressions in semantic index-building. We would then analyze expressions like sys.version_info >= (x, y), typing.TYPE_CHECKING, True and False without any type inference and consequently close down (or unconditionally open) branches whose truthiness we can analyze in this way. This would simplify the implementation, but is much less general than the approach taken here.

Instead, we collect all necessary information during semantic index building, and then re-analyze the control flow during type-checking. The way this works is by recording the list of 'active' branching conditions for each introduced binding. Consider this example:

if test1:
    x = 1  # active conditions: [test1]

    if test2:
        x = 2  # active conditions: [test1, test2]

if test3:
    x = 3  # active conditions: [test3]

use(x)

In semantic-index building, when we reach the use(x) statement, we have three live bindings. But when we later analyze the test-conditions in type inference, we may potentially conclude that some of them are not present/visible.

One possibility to do so is to detect test-conditions to be always false. For example, if test3 is statically known to be false, we can infer that the x = 3 branch is never taken. This binding is then not considered in type inference and in boundness-handling. For the x = 2 binding, we need to consider the possibility that either test1 or test2 can be statically false. In summary, if any of the active branching conditions is statically false, the binding is cancelled.

Another possibility is that we can infer a test-condition to be always true. This can also hide bindings. For example, if we can infer test2 to always be true, we know that the x = 1 binding is never visible. This is slightly more complex to handle, as we haven't even seen the test2 condition when we analyze the x = 1 binding in semantic-index building.

The way we solve this in type-inference is by going through all live bindings in reverse order (starting at x = 3), stopping whenever we detect a binding that is unconditionally visible. A binding is unconditionally visible if all of its active conditions are statically known to be true. This is slightly complicated by the fact that we have unconditional branching as well. For example, consider:

if test1:
    for _ in iterable():
        x = 1

This binding is not unconditionally visible, even if test1 is always true, as the loop body may never be executed. This is currently handled by adding a BranchingCondition::Unconditional marker to the active conditions of the binding, which always evaluates to an ambiguous truthiness.

Some of this might sound similar to what we achieve with the may_be_unbound and may_be_undeclared flags. However, the new mechanism does not allow us to get rid of these flags, as there are also cases like this:

if test:  # type: bool
    x = 1
else:
    x = 2

The symbol x is always bound, but seeing the x = 1 or x = 2 definitions with a branching condition of test: bool is not enough to see that. We also need the global information that these two branches will be merged (where we logically OR the two may_be_unbound = false flags to a global may_be_unbound = false).

This branch also includes:

statically-known branches handling for Boolean expressions and while loops
support for typing.TYPE_CHECKING
support for narrowing in while loops
new target-version requirements in some Markdown tests which were now required due to the understanding of sys.version_info branches.

closes #12700
closes #14170
closes #14861

Test plan

New Markdown tests for the main feature in statically-known-branches.md
New Markdown tests for typing.TYPE_CHECKING in known_constants.md
New Markdown tests for while-loop narrowing in narrow/while.md
Regression test for boundness-handling in while loops
Adapted tests for EllipsisType, Never, etc

Performance investigation

(thanks to @MichaReiser for some input!)

Measurements

I can reproduce the regression locally. It's not as pronounced as on codspeed, which could be attributed to the fact that the codspeed results do not involve any I/O overhead (they are based on CPU counters, not wall time). So the local results might be "diluted" a bit:

tomllib, -13%

Command	Mean [ms]	Min [ms]	Max [ms]	Relative
`./red_knot_main`	21.8 ± 1.5	18.8	27.0	1.00
`./red_knot_feature`	24.6 ± 1.6	21.2	29.1	1.13 ± 0.11

Black codebase, -7%

Command	Mean [ms]	Min [ms]	Max [ms]	Relative
`./red_knot_main`	123.4 ± 5.4	115.6	135.4	1.00
`./red_knot_feature`	131.6 ± 3.5	123.7	137.3	1.07 ± 0.05

Analysis (ongoing)

With this feature enabled, we need to perform more work. For every binding/declaration, we need to infer the type of all active branching conditions and call Type::bool on it (which may involve looking up other symbols). This can entail loading entire new modules. For example: in the tomllib benchmark, we do not need to resolve sys on main, but we do on this branch. The reason for this are various sys.version_info >= (M, m) conditions in builtins.pyi which now need to be resolved. I tried to account for this by adding a import sys; sys.version_info line in a local copy of tomllib, so the version on main would also have to import sys. This reduces the difference from 13% to 11%, but the comparison is still not fair. There are a lot more symbol-lookups on this feature branch (mostly __bool__).
In semantic-index building, we need to collect, store (and clone, when snapshotting) more information with the addition of the branching-condition data.

To do

Investigate performance regression

codspeed-hq · 2024-12-03T15:13:47Z

CodSpeed Performance Report

Merging #14759 will degrade performances by 21.62%

_{Comparing david/statically-known-branches (0a0e5e6) with main (c3a64b4)}

Summary

❌ 2 regressions
✅ 30 untouched benchmarks

⚠️ Please fix the performance issues or acknowledge them on CodSpeed.

Benchmarks breakdown

	Benchmark	`main`	`david/statically-known-branches`	Change
❌	`red_knot_check_file[cold]`	68 ms	86.8 ms	-21.62%
❌	`red_knot_check_file[incremental]`	4.2 ms	4.4 ms	-4.31%

## Summary `typing_extensions` has a `>=3.13` re-export for the `typing.NoDefault` singleton, but not for `typing._NoDefaultType`. This causes problems as soon as we understand `sys.version_info` branches, so we explicity switch to `typing._NoDefaultType` for Python 3.13 and later. This is a part of #14759 that I thought might make sense to break out and merge in isolation. ## Test Plan New test that will become more meaningful with #12700 --------- Co-authored-by: Micha Reiser <[email protected]>

crates/red_knot_python_semantic/src/semantic_index/use_def.rs

carljm

Impressively comprehensive tests, thank you!!

Per request, I only looked at the mdtests.

Just a few comments, nothing significant.

crates/red_knot_python_semantic/resources/mdtest/annotations/never.md

crates/red_knot_python_semantic/resources/mdtest/known_constants.md

crates/red_knot_python_semantic/resources/mdtest/type_alias.md

crates/red_knot_python_semantic/resources/mdtest/statically-known-branches.md

sharkdp · 2024-12-11T18:34:06Z

crates/red_knot_python_semantic/resources/mdtest/conditional/match.md

-match 0:
-    case 1:
-        y = 2
-    case _:
-        y = 3
-
-reveal_type(y)  # revealed: Literal[2, 3]


We are "too smart" now and infer Literal[3] here, as we can see that the first pattern can never match. To keep the original intent of the test, I changed the match target to an unknown int.

crates/red_knot_python_semantic/resources/mdtest/loops/while_loop.md

sharkdp · 2024-12-11T19:20:35Z

crates/red_knot_python_semantic/resources/mdtest/statically-known-branches.md

+
+# TODO: ideally, this should be Literal[2]
+reveal_type(x)  # revealed: Literal[2, 3]
+```


I just realized that this is problematic in cases with multiple "always True" patterns. At runtime, we'll take the first of these branches. But in our current implementation here, since we always walk definitions backwards, we'll take the last instead:

x = 1 match "a": case "a" if 1 == 1: x = 2 case "a": x = 3 reveal_type(x) # we infer Literal[3], but it should be Literal[2]

Before I attempt to fix this, it would be good to get some feedback on whether or not we want static-truthiness checking in match statements at all. I thought it was a nice stretch-task, but maybe it causes more trouble than it's worth. The examples presented here are not particularly useful, but there might be some use cases involving enums? Not sure.

I don't think we should spend a lot of extra time on implementation work that is specific to statically-known branches with match patterns; it's nice to have, but not clear how often it will be relevant.

I'm curious why this seems to affect match statements with multiple always-matching cases, but not elif chains with multiple always-true branches? The two would seem to have a lot in common. In both cases the key fact is that an always-true condition in an elif or case does not mean "we definitely take this path" (we might have already taken a previous one instead). It only means "we definitely do not take any branch after this one."

I'm curious why this seems to affect match statements with multiple always-matching cases, but not elif chains with multiple always-true branches?

It currently works for elif chains but not for match patterns because of negative constraints. The way it currently works is that we record all constraints (including negative ones) when a binding is created:

if flag: x = 1 # conditional on flag elif True: x = 2 # conditional on ~flag and True elif True: x = 3 # conditional on ~flag, ~True and True

The x = 3 binding is therefore not unconditionally visible, which currently leads us to correctly infer Literal[1, 2] for this case.

But now that I write it out like this, I also realize that this reveals a problem with my approach. Consider this example:

x = 1 if flag: x = 2 # conditional on flag elif True: x = 3 # conditional on ~flag, True

Here, we currently infer Literal[1, 2, 3], while it would be more correct to infer Literal[2, 3]. The problem is that we have the negative ~flag condition in the x = 3 binding, which prevents us from detecting it as "unconditionally visible". And if we would detect it as unconditionally visible, we would run into the problem you mentioned. We would block off any previous bindings and incorrectly infer Literal[3].

We could clearly do better.

I'm currently thinking that there are two problems:

It seems to me like the set of constraints that we apply for static truthiness analysis needs to be different from the set of constraints that we apply for narrowing. For narrowing, we need that ~flag constraint. But for static truthiness analysis, we don't care if flag has ambiguous truthiness or not, we always take an elif True branch.

When iterating the list of bindings in reverse, we should not stop at the first "unconditionally visible" binding, we need to go back to the earliest possible "unconditionally visible" binding.

sharkdp added red-knot Multi-file analysis & type inference no-test Disable CI tests for a pull request labels Dec 3, 2024

This comment was marked as resolved.

Sign in to view

sharkdp force-pushed the david/statically-known-branches branch 2 times, most recently from caa4c88 to 66524e7 Compare December 5, 2024 07:45

sharkdp mentioned this pull request Dec 5, 2024

[red-knot] Fallback for typing._NoDefaultType #14783

Merged

sharkdp force-pushed the david/statically-known-branches branch from 66524e7 to fa5d2b5 Compare December 5, 2024 08:20

MichaReiser reviewed Dec 5, 2024

View reviewed changes

crates/red_knot_python_semantic/src/semantic_index/use_def.rs Outdated Show resolved Hide resolved

sharkdp force-pushed the david/statically-known-branches branch 5 times, most recently from 4ae7a5b to 4bd513d Compare December 6, 2024 13:05

sharkdp removed the no-test Disable CI tests for a pull request label Dec 8, 2024

sharkdp force-pushed the david/statically-known-branches branch 2 times, most recently from 581f787 to 668fb66 Compare December 9, 2024 19:40

carljm reviewed Dec 9, 2024

View reviewed changes

sharkdp force-pushed the david/statically-known-branches branch 4 times, most recently from 5ae6a58 to 4ac3737 Compare December 11, 2024 13:38

sharkdp marked this pull request as ready for review December 11, 2024 16:51

sharkdp commented Dec 11, 2024

View reviewed changes

crates/red_knot_python_semantic/resources/mdtest/loops/while_loop.md Outdated Show resolved Hide resolved

sharkdp commented Dec 11, 2024

View reviewed changes

sharkdp mentioned this pull request Dec 11, 2024

[red-knot] add narrowing support in match class patterns #14740

Open

sharkdp force-pushed the david/statically-known-branches branch from 14b357c to 58a07ae Compare December 11, 2024 19:37

sharkdp mentioned this pull request Dec 11, 2024

[red-knot] Make attributes.md test future-proof #14923

Merged

sharkdp added 29 commits December 13, 2024 10:42

Add doc comment

d2cd3de

Clean up code in infer.rs

fcae092

Update bitset

e29f37b

Reinstate tests

0ec165f

Minor fixes

86747ed

Minor

42da9ee

Bring back TODO comment

33b9136

Remove Clone

c03be89

Visibility struct

72d09eb

Add introduction section

9c78c81

Fix clippy suggestions

acf2ce6

Fix doc test

04e80a8

Minor doc update

bee7cf1

Add tests for elif branches

0746cec

Minor doc updates

6ffa9e8

Update TODO

57c9102

Add 'import sys'

37359bb

Fix x=1,2,3 values in test

0372684

Add tests for multiple 'elif True' / 'elif False' branches

eae4796

Rename to record_ambiguous_branching

e59e8c9

Hard assert

88fef0b

Resolve TODO in tuple.md

b43e9f2

Document current limitations

ae470f0

Snake case

513795e

Remove Truthiness::from_bool

9b4bb86

Make is_ambiguous pub(crate)

741beb7

Derive Debug

d3f460c

Fix rogue conflict resolution

006a537

Minor

0a0e5e6

sharkdp force-pushed the david/statically-known-branches branch from f82d2b6 to 0a0e5e6 Compare December 13, 2024 09:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[red-knot] Statically known branches #14759

[red-knot] Statically known branches #14759

sharkdp commented Dec 3, 2024 •

edited

Loading

codspeed-hq bot commented Dec 3, 2024 •

edited

Loading

This comment was marked as resolved.

carljm left a comment

sharkdp Dec 11, 2024

sharkdp Dec 11, 2024 •

edited

Loading

carljm Dec 12, 2024

sharkdp Dec 12, 2024

[red-knot] Statically known branches #14759

Are you sure you want to change the base?

[red-knot] Statically known branches #14759

Conversation

sharkdp commented Dec 3, 2024 • edited Loading

Summary

Test plan

Performance investigation

Measurements

Analysis (ongoing)

To do

codspeed-hq bot commented Dec 3, 2024 • edited Loading

CodSpeed Performance Report

Merging #14759 will degrade performances by 21.62%

Summary

Benchmarks breakdown

This comment was marked as resolved.

carljm left a comment

Choose a reason for hiding this comment

sharkdp Dec 11, 2024

Choose a reason for hiding this comment

sharkdp Dec 11, 2024 • edited Loading

Choose a reason for hiding this comment

carljm Dec 12, 2024

Choose a reason for hiding this comment

sharkdp Dec 12, 2024

Choose a reason for hiding this comment

sharkdp commented Dec 3, 2024 •

edited

Loading

codspeed-hq bot commented Dec 3, 2024 •

edited

Loading

sharkdp Dec 11, 2024 •

edited

Loading