Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Train.sh data loading stops at 70% #18

Open
Ariyaris opened this issue Nov 6, 2023 · 5 comments
Open

Train.sh data loading stops at 70% #18

Ariyaris opened this issue Nov 6, 2023 · 5 comments

Comments

@Ariyaris
Copy link

Ariyaris commented Nov 6, 2023

Dear all,

After installing DiffDock-PP as per installation guide I attempted to run train.sh script.
Unfortunately loading of data stops somewhere at 70%. To be more precise, it becomes very slow and then dies.

Does anybody have advice or a workaround to this challenge?

Thank you!

Luka

@TernencezzZ
Copy link

TernencezzZ commented Nov 16, 2023

Hi, @Ariyaris

I guess that there might be a memory overflow issue during ESM feature extraction. DIPSLoader computes embeds for all receptors and ligands from DIPS dataset in function compute_embeddings.

you can imagine how big it is:

image

Perhaps you can consider using some chunking methods to alleviate the memory pressure.

@onlyonewater
Copy link

hi, @TernencezzZ , could you reproduce the results with this code? if yes, can you tell me what parameters you used? like num_gpus, lr, ns, nv parameters.

@TernencezzZ
Copy link

hi, @onlyonewater

Actually no, due to the limit of my computer memory, I only used the first 8,000 data from the DIPS Dataset for training. I took the configuration in dips_esm.yaml and have only trained for around ten epochs. The loss is still decreasing.

image

image

@onlyonewater
Copy link

so do you use only one GPU? or use multi-GPU to train?

@Ariyaris
Copy link
Author

Perhaps you can consider using some chunking methods to alleviate the memory pressure.

Hi @TernencezzZ

Do you have advice on how I could use chunking methods?
I only have 1 GPU and I am quite the novice in all this so any advice is helpfull.

Yours sincerely,
Ariyaris

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants