Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPU out of memory error #5

Open
ajati opened this issue Jan 4, 2024 · 2 comments
Open

GPU out of memory error #5

ajati opened this issue Jan 4, 2024 · 2 comments

Comments

@ajati
Copy link

ajati commented Jan 4, 2024

https://github.com/thuml/SimMTM/blob/main/SimMTM_Forecasting/scripts/pretrain/ECL_script/ECL.sh fails with GPU out of memory error in a A100 80GB.

RuntimeError: CUDA out of memory. Tried to allocate 13.26 GiB (GPU 0; 79.15 GiB total capacity; 60.37 GiB already allocated; 7.43 GiB free; 70.33 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

@linxi20
Copy link

linxi20 commented Jun 16, 2024

Hello, I have also encountered this issue. Have you resolved it now?

@iDestro
Copy link

iDestro commented Jul 2, 2024

I also have this problem, why?

Hardware: 3090

Script: bash ./scripts/pretrain/ECL_script/ECL.sh

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 207.37 GiB (GPU 0; 23.70 GiB total capacity; 8.14 GiB already allocated; 13.37 GiB free; 9.09 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation.  See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants