Memory issue? #11

HOMGH · 2022-11-07T14:19:49Z

Hi,
Thanks for sharing your code.
I got "scripts/ddpm/train_interpreter.sh: line 6: 3842 Killed python train_interpreter.py --exp experiments/${DATASET}/ddpm.json $MODEL_FLAGS" error.

I have ~65G RAM available on my Ubuntu. Considering your note that "it requires ~210Gb for 50 training images of 256x256."
Does it mean that it's not feasible to train the model on my system? How about evaluation?
Thanks in advance.

dbaranchuk · 2022-11-18T20:26:55Z

You can try to i) reduce the number of training images ii) reduce feature dimensionality iii) store features on the disk and preprocess them by batches.

Yi-Lynn · 2023-02-04T07:05:54Z

You can try to i) reduce the number of training images ii) reduce feature dimensionality iii) store features on the disk and preprocess them by batches.

Hi, thanks for the wonderful work and codes you provided. Can you explain how would I implement the last one (iii)? How do I process the features by batches if they are too huge that I cannot even store them on the disk? I have encounted the same problem as @HOMGH and the extracted pixel representations are so huge that before the prepare_data() function return the data, the process I'm running the script will be killed. For example, when I run ddpm on the cat_15 dataset using the original experiment setting with 30 training images, when the program tries to run the following two lines, the process will crash.

X = X.transpose(1,0,2,3).reshape(d,-1).transpose(1,0)  # Here X with size of [30,8448,256,256] is converted to size [30*256*256, 8448]
y = y.flatten()

My solution is to write another prepare_data() function, which processes one image instead of all labelled training images at one time. Then during the training of the pixel classifier, at each epoch I will create a dataloader for one image and iterate through all training images. But there exists some gap between the final evaluation results I got and yours. Do you have any suggestions for that?

Thanks a lot for your help.

Qyunhao · 2023-02-11T13:29:04Z

You can try to i) reduce the number of training images ii) reduce feature dimensionality iii) store features on the disk and preprocess them by batches.

Hi, thanks for the wonderful work and codes you provided. Can you explain how would I implement the last one (iii)? How do I process the features by batches if they are too huge that I cannot even store them on the disk? I have encounted the same problem as @HOMGH and the extracted pixel representations are so huge that before the prepare_data() function return the data, the process I'm running the script will be killed. For example, when I run ddpm on the cat_15 dataset using the original experiment setting with 30 training images, when the program tries to run the following two lines, the process will crash.
X = X.transpose(1,0,2,3).reshape(d,-1).transpose(1,0)  # Here X with size of [30,8448,256,256] is converted to size [30*256*256, 8448]
y = y.flatten()
My solution is to write another prepare_data() function, which processes one image instead of all labelled training images at one time. Then during the training of the pixel classifier, at each epoch I will create a dataloader for one image and iterate through all training images. But there exists some gap between the final evaluation results I got and yours. Do you have any suggestions for that?

Thanks a lot for your help.
Hi，
Can you send me a copy of your code, I really need your code,
thank you very much！

MariemOualha · 2023-03-10T13:31:46Z

You can try to i) reduce the number of training images ii) reduce feature dimensionality iii) store features on the disk and preprocess them by batches.

Hi, thanks for the wonderful work and codes you provided. Can you explain how would I implement the last one (iii)? How do I process the features by batches if they are too huge that I cannot even store them on the disk? I have encounted the same problem as @HOMGH and the extracted pixel representations are so huge that before the prepare_data() function return the data, the process I'm running the script will be killed. For example, when I run ddpm on the cat_15 dataset using the original experiment setting with 30 training images, when the program tries to run the following two lines, the process will crash.
X = X.transpose(1,0,2,3).reshape(d,-1).transpose(1,0)  # Here X with size of [30,8448,256,256] is converted to size [30*256*256, 8448]
y = y.flatten()
My solution is to write another prepare_data() function, which processes one image instead of all labelled training images at one time. Then during the training of the pixel classifier, at each epoch I will create a dataloader for one image and iterate through all training images. But there exists some gap between the final evaluation results I got and yours. Do you have any suggestions for that?

Thanks a lot for your help.

Hello ,@Yi-Lynn , can you send me your code ?
thank you .

SahadevPoudel · 2023-06-26T03:56:14Z

@Yi-Lynn Could you send me your code? please?
@MariemOualha Did you implement? If you have implemented, Could you send me the code?

choidaedae · 2023-08-17T08:25:59Z

@Yi-Lynn Could you send me your code? please? @MariemOualha Did you implement? If you have implemented, Could you send me the code?

@SahadevPoudel Hi, Poudel! I had the same problem, and I succeeded in implementing it in the way shown above. The newly proposed method declares a class called 'DividedImageLabelDataset' from src/datasets.py , which is imported from the train code and used. Please comment if you still want code about it!

yunzhuC · 2024-02-04T12:41:26Z

@Yi-Lynn Could you send me your code? please? @MariemOualha Did you implement? If you have implemented, Could you send me the code?

@SahadevPoudel Hi, Poudel! I had the same problem, and I succeeded in implementing it in the way shown above. The newly proposed method declares a class called 'DividedImageLabelDataset' from src/datasets.py , which is imported from the train code and used. Please comment if you still want code about it!

Hello! I'm experiencing the same problem, but I don't know what I should do to fix it. Can you share the code you modified? Thanks a lot.
This is my email address: [email protected]. Or you can choose other methods that are convenient for you, thanks again.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory issue? #11

Memory issue? #11

HOMGH commented Nov 7, 2022

dbaranchuk commented Nov 18, 2022

Yi-Lynn commented Feb 4, 2023

Qyunhao commented Feb 11, 2023

MariemOualha commented Mar 10, 2023

SahadevPoudel commented Jun 26, 2023

choidaedae commented Aug 17, 2023 •

edited

Loading

yunzhuC commented Feb 4, 2024

Memory issue? #11

Memory issue? #11

Comments

HOMGH commented Nov 7, 2022

dbaranchuk commented Nov 18, 2022

Yi-Lynn commented Feb 4, 2023

Qyunhao commented Feb 11, 2023

MariemOualha commented Mar 10, 2023

SahadevPoudel commented Jun 26, 2023

choidaedae commented Aug 17, 2023 • edited Loading

yunzhuC commented Feb 4, 2024

choidaedae commented Aug 17, 2023 •

edited

Loading