Skip to content

fix: Require medium and large E2E jobs to use constraints-dev.txt to unblock jobs #256

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

courtneypacheco
Copy link
Contributor

@courtneypacheco courtneypacheco commented May 15, 2025

Overview

In the core repo, we are now using a constraints-dev.txt file to constrain our Python dependencies in the CI: https://github.com/instructlab/instructlab/blob/main/.github/workflows/e2e-nvidia-l40s-x4.yml#L175

This means our dependencies in requirements*.txt are actually uncapped and the constraints file are restricting the versions we run in the CI.

In this repo, we are pulling the E2E workflow from the core repo without actually passing in -c constraints-dev.txt. Therefore, we're running our E2E tests against newer package versions than we want in some situations.

As an example, we see PyTorch-induced errors like this below because the core repo requires torch==2.6.0, but this repo is not constraining the version:
Screenshot 2025-05-15 at 9 57 32 AM
(Source: https://github.com/instructlab/eval/actions/runs/15044193998)

…unblock jobs

In the core repo, we are using constraints-dev.txt to constrain our dependencies in the CI: https://github.com/instructlab/instructlab/blob/main/.github/workflows/e2e-nvidia-l40s-x4.yml#L175

In this repo, we are pulling the E2E workflow from the core repo without actually passing in `-c constraints-dev.txt`

Signed-off-by: Courtney Pacheco <[email protected]>
@mergify mergify bot added the CI/CD Affects CI/CD configuration label May 15, 2025
@courtneypacheco
Copy link
Contributor Author

@mergify mergify bot added the ci-failure label May 15, 2025
@booxter
Copy link
Contributor

booxter commented May 16, 2025

Please try approach taken in instructlab/instructlab#3399 and see if it's any better.

@booxter
Copy link
Contributor

booxter commented May 20, 2025

Applying a probable fix for flash-attn issues here: #260

Copy link
Contributor

mergify bot commented May 20, 2025

This pull request has merge conflicts that must be resolved before it can be
merged. @courtneypacheco please rebase it. https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

@mergify mergify bot added the needs-rebase label May 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CI/CD Affects CI/CD configuration ci-failure needs-rebase
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants