Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[red-knot] Add GitHub PR annotations when mdtests fail in CI #17150

Merged
merged 5 commits into from
Apr 2, 2025

Conversation

AlexWaygood
Copy link
Member

@AlexWaygood AlexWaygood commented Apr 2, 2025

Summary

This PR adds a CI job that causes GitHub to add annotations to a PR diff when mdtest assertions fail. For example:

Screenshot

image

Motivation

Debugging mdtest failures locally is currently a really nice experience:

  • Errors are displayed with pretty colours, which makes them much more readable
  • If you run the test from inside an IDE, you can CTRL-click on a path and jump directly to the line that had the failing assertion
  • If you use mdtest.py, you don't even need to recompile anything after changing an assertion in an mdtest, amd the test results instantly live-update with each change to the MarkDown file

Debugging mdtest failures in CI is much more unpleasant, however. Sometimes an error message is just

[static-assert-error] Argument evaluates to False

...which doesn't tell you very much unless you navigate to the line in question that has the failing mdtest assertion. The line in question might not even be touched by the PR, and even if it is, it can be hard to find the line if the PR touches many files. Unlike locally, you can't click on the error and jump straight to the line that contains the failing assertion. You also don't get colourised output in CI (#13939).

GitHub PR annotations should make it really easy to debug why mdtests are failing on PRs, making PR review much easier.

How it works

A new mdtest_github_output_format feature is added to the red_knot_python_semantic crate. When the feature is enabled, mdtest failures are printed to the terminal using a format that causes GitHub to attach annotations to the PR diff. If the feature is not enabled, mdtest failures are printed using the same format that they were before, since that's what is best for local development.

A new CI job is added to ci.yaml that runs only red-knot's mdtests with this feature flag enabled. The job takes around 1m40s to run to completion on GitHub's ubuntu-latest runner. The job only runs on PRs (not pushes to main or on workflow_dispatch events), and only runs if red-knot-related code changes as part of the PR.

Test Plan

I opened a PR to my fork here with some bogus changes to an mdtest to show what it looks like when there are failures in CI and this job has been added. Scroll down to crates/red_knot_python_semantic/resources/mdtest/type_properties/is_equivalent_to.md on the "files changed" tab for that PR to see the annotations.

@AlexWaygood AlexWaygood added ci Related to internal CI tooling red-knot Multi-file analysis & type inference labels Apr 2, 2025
Copy link
Contributor

github-actions bot commented Apr 2, 2025

mypy_primer results

No ecosystem changes detected ✅

Copy link
Contributor

github-actions bot commented Apr 2, 2025

ruff-ecosystem results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

Formatter (stable)

✅ ecosystem check detected no format changes.

Formatter (preview)

✅ ecosystem check detected no format changes.

Copy link
Member

@MichaReiser MichaReiser left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is cool.

I don't think this should be a cargo feature, I'd use an environment variable instead. It would also be great if we can avoid that it has to run as its own job

@AlexWaygood AlexWaygood force-pushed the alex/mdtest-github-output-2 branch from 89564e9 to ff28416 Compare April 2, 2025 13:32
@AlexWaygood AlexWaygood marked this pull request as draft April 2, 2025 15:25
@AlexWaygood AlexWaygood force-pushed the alex/mdtest-github-output-2 branch 6 times, most recently from 5a3b914 to e81f60f Compare April 2, 2025 16:43
@AlexWaygood AlexWaygood force-pushed the alex/mdtest-github-output-2 branch from e81f60f to 494ef81 Compare April 2, 2025 16:52
@AlexWaygood AlexWaygood marked this pull request as ready for review April 2, 2025 17:01
@AlexWaygood

This comment was marked as resolved.

@AlexWaygood AlexWaygood requested a review from MichaReiser April 2, 2025 17:05
@AlexWaygood

This comment was marked as resolved.

@AlexWaygood
Copy link
Member Author

Okay great, annotations are still working on the latest version of the PR!

Screenshot

image

@AlexWaygood
Copy link
Member Author

The PR is again ready for re-review :-)

Copy link
Member

@MichaReiser MichaReiser left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, and sorry for the back and forth.

@AlexWaygood
Copy link
Member Author

No worries, and thank you for the review. It's much better for your suggestions!

@AlexWaygood AlexWaygood merged commit 195bb43 into main Apr 2, 2025
23 checks passed
@AlexWaygood AlexWaygood deleted the alex/mdtest-github-output-2 branch April 2, 2025 20:51
dcreager added a commit that referenced this pull request Apr 3, 2025
* origin/main: (35 commits)
  [red-knot] Callable types are disjoint from literals (#17160)
  [red-knot] Fix inference for `pow` between two literal integers (#17161)
  [red-knot] Add GitHub PR annotations when mdtests fail in CI (#17150)
  [red-knot] Fix equivalence of differently ordered unions that contain `Callable` types (#17145)
  [red-knot] Add initial set of tests for unreachable code (#17159)
  [`airflow`] Move `AIR302` to `AIR301` and `AIR303` to `AIR302` (#17151)
  ruff_db: simplify lifetimes on `DiagnosticDisplay`
  [red-knot] Detect division-by-zero in unions and intersections (#17157)
  [`airflow`] Add autofix infrastructure to `AIR302` name checks (#16965)
  [`flake8-bandit`] Mark `str` and `list[str]` literals as trusted input (`S603`) (#17136)
  [`airflow`] Add autofix for `AIR302` attribute checks (#16977)
  [`airflow`] Extend `AIR302` with additional symbols (#17085)
  [`airflow`] Move `AIR301` to `AIR002` (#16978)
  [`airflow`] Add autofix for `AIR302` method checks (#16976)
  ruff_db: switch diagnostic rendering over to `std::fmt::Display`
  [red-knot] Add 'Goto type definition' to the playground (#17055)
  red_knot_ide: update snapshots
  red_knot_python_semantic: remove comment about `TypeCheckDiagnostic`
  ruff_db: delete most of the old diagnostic code
  red_knot: use `Diagnostic` inside of red knot
  ...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci Related to internal CI tooling red-knot Multi-file analysis & type inference
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants