Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Project roadmap #71

Open
5 tasks
cmelone opened this issue Jul 31, 2024 · 0 comments
Open
5 tasks

Project roadmap #71

cmelone opened this issue Jul 31, 2024 · 0 comments
Assignees

Comments

@cmelone
Copy link
Collaborator

cmelone commented Jul 31, 2024

This is a tracking issue used to document the current set of features we would like to integrate into gantry.

This thread should also be used to discuss new directions for the project.


Plan

  1. In the pilot phase, we will only be implementing predictions for requests, and ensuring that they will only increase compared to current allocations.
  2. If we see success in the pilot, we'll implement functionality which retries jobs with higher memory allocations if they've been shown to fail due to OOM kills.
  3. Then, we will "drop the floor" and allow the predictor to allocate less memory than the package is used to. At this step, requests will be fully implemented.
  4. Limits for CPU and memory will be implemented.
  5. Next, we want to introduce some experimentation in the system and perform a scaling study.
  6. Design a scheduler that decides which instance type a job should be placed on based on cost and expected usage and runtime.

Evaluation

The success of this framework can be evaluated against a number of factors:

  • Has the cost per job changed?
  • Are jobs being killed due to resource contention?
  • What is the error distribution of our predictions?
  • How much waste is there per build type?
@cmelone cmelone self-assigned this Jul 31, 2024
@cmelone cmelone changed the title Prioritized feature list Project roadmap Jul 31, 2024
@cmelone cmelone pinned this issue Aug 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant