Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Emergency shut off #121

Open
cmelone opened this issue Oct 8, 2024 · 2 comments
Open

Emergency shut off #121

cmelone opened this issue Oct 8, 2024 · 2 comments
Assignees
Labels
feature New feature or request low-priority

Comments

@cmelone
Copy link
Collaborator

cmelone commented Oct 8, 2024

Problem/Opportunity Statement

if we ever get into a scenario where gantry is going completely haywire or is not resolving an OOM, it would be nice to have a mechanism to disable dynamic allocations for specific packages or the entire system. This shouldn’t require going into the container or having any knowledge of how the program works.

What would success / a fix look like?

A simple web page behind gh auth accessible to a small group that has a form to disable dynamic allocations on a package level. For subsequent predictions, it should return no cpu limits and 64GB memory limit.

@cmelone cmelone added the feature New feature or request label Oct 8, 2024
@cmelone cmelone self-assigned this Oct 8, 2024
@cmelone
Copy link
Collaborator Author

cmelone commented Oct 23, 2024

@kwryankrattiger wondering if you think this is necessary given the feature you wrote into spack/spack#41622 to disable emergency allocation using env variables. The motivation here is to pausing gantry for specific packages, rather than disabling the entire system.

@kwryankrattiger
Copy link

I think this might make sense eventually, but to start I am guessing we will only want to completely disable it using the variable.

Managing and fine tuning something like this may even make sense as part of the dynamic mapping config. I wouldn't want to add additional services unless we needed it for some reason.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature New feature or request low-priority
Projects
None yet
Development

No branches or pull requests

2 participants