-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[red-knot] Add GitHub PR annotations when mdtests fail in CI #17150
Conversation
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is cool.
I don't think this should be a cargo feature, I'd use an environment variable instead. It would also be great if we can avoid that it has to run as its own job
89564e9
to
ff28416
Compare
5a3b914
to
e81f60f
Compare
e81f60f
to
494ef81
Compare
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
The PR is again ready for re-review :-) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, and sorry for the back and forth.
No worries, and thank you for the review. It's much better for your suggestions! |
* origin/main: (35 commits) [red-knot] Callable types are disjoint from literals (#17160) [red-knot] Fix inference for `pow` between two literal integers (#17161) [red-knot] Add GitHub PR annotations when mdtests fail in CI (#17150) [red-knot] Fix equivalence of differently ordered unions that contain `Callable` types (#17145) [red-knot] Add initial set of tests for unreachable code (#17159) [`airflow`] Move `AIR302` to `AIR301` and `AIR303` to `AIR302` (#17151) ruff_db: simplify lifetimes on `DiagnosticDisplay` [red-knot] Detect division-by-zero in unions and intersections (#17157) [`airflow`] Add autofix infrastructure to `AIR302` name checks (#16965) [`flake8-bandit`] Mark `str` and `list[str]` literals as trusted input (`S603`) (#17136) [`airflow`] Add autofix for `AIR302` attribute checks (#16977) [`airflow`] Extend `AIR302` with additional symbols (#17085) [`airflow`] Move `AIR301` to `AIR002` (#16978) [`airflow`] Add autofix for `AIR302` method checks (#16976) ruff_db: switch diagnostic rendering over to `std::fmt::Display` [red-knot] Add 'Goto type definition' to the playground (#17055) red_knot_ide: update snapshots red_knot_python_semantic: remove comment about `TypeCheckDiagnostic` ruff_db: delete most of the old diagnostic code red_knot: use `Diagnostic` inside of red knot ...
Summary
This PR adds a CI job that causes GitHub to add annotations to a PR diff when mdtest assertions fail. For example:
Screenshot
Motivation
Debugging mdtest failures locally is currently a really nice experience:
mdtest.py
, you don't even need to recompile anything after changing an assertion in an mdtest, amd the test results instantly live-update with each change to the MarkDown fileDebugging mdtest failures in CI is much more unpleasant, however. Sometimes an error message is just
...which doesn't tell you very much unless you navigate to the line in question that has the failing mdtest assertion. The line in question might not even be touched by the PR, and even if it is, it can be hard to find the line if the PR touches many files. Unlike locally, you can't click on the error and jump straight to the line that contains the failing assertion. You also don't get colourised output in CI (#13939).
GitHub PR annotations should make it really easy to debug why mdtests are failing on PRs, making PR review much easier.
How it works
A new
mdtest_github_output_format
feature is added to thered_knot_python_semantic
crate. When the feature is enabled, mdtest failures are printed to the terminal using a format that causes GitHub to attach annotations to the PR diff. If the feature is not enabled, mdtest failures are printed using the same format that they were before, since that's what is best for local development.A new CI job is added to
ci.yaml
that runs only red-knot's mdtests with this feature flag enabled. The job takes around 1m40s to run to completion on GitHub'subuntu-latest
runner. The job only runs on PRs (not pushes tomain
or onworkflow_dispatch
events), and only runs if red-knot-related code changes as part of the PR.Test Plan
I opened a PR to my fork here with some bogus changes to an mdtest to show what it looks like when there are failures in CI and this job has been added. Scroll down to
crates/red_knot_python_semantic/resources/mdtest/type_properties/is_equivalent_to.md
on the "files changed" tab for that PR to see the annotations.