-
Notifications
You must be signed in to change notification settings - Fork 193
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
/usr/bin/amazon-efs-mount-watchdog - OSError: [Errno 28] No space left on device #154
Comments
Hi @nk74, can you run a I found this online:
I'm wondering if perhaps there's too many files written to this directory. |
On each instances same result : fs-xxxx.xxxxx.20749 fs-xxxx.xxxx.20749+ stunnel-config.fs-xxxx.xxxxx.20749
|
I'm wondering if this is related to "Old EFS Certificates not removed", I'm worried that if you have a long running mount, it could be taking up too many inodes due to these certs not getting cleaned up. Two things:
|
Inodes (instance A & B)
Certs
|
Looks like there's plenty of free inodes. I'll need to dig deeper into why that json.dump method may be throwing that error. We may need to add some logging output to see the size of the file that json dump is attempting to write, and if there is a bug occurring that would cause it to become oversized. |
Have you run into this lately? As part of the 1.35.0 release, we put in a debug line to track the size of the state file that we write to disk, which is the line that we saw crashing earlier in your log (the json.dump). You can enable debug logging with |
On our servers it happens regularly that the servers crash and are inaccessible via SSM / SSH. The only solution is to stop the server (sometimes it restarts normally, sometimes we have to destroy the server)
After investigation I found these elements that correspond with the unavailability of the servers
Storage :
The text was updated successfully, but these errors were encountered: