Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extremely slow validation during training + cpu overload #51

Open
davidceka opened this issue Jul 17, 2022 · 2 comments
Open

Extremely slow validation during training + cpu overload #51

davidceka opened this issue Jul 17, 2022 · 2 comments

Comments

@davidceka
Copy link

Hi! We're testing your code on our dataset, the training iterations go on smoothly, but the validations in between the iterations are like extremely slow (4000 images in like 6 hours). In addition, every time the model starts validating, the cpu cores (we're testing on a 48 core cpu) all go to 100% even if i set the number of workers to 8 in the json file.
Do you have any idea why, or can I provide you with any information that may help address the issue?

@tfzhou
Copy link
Owner

tfzhou commented Jul 18, 2022

Hi @davidceka, it does not make sense to me that it will cost so much time for validation. I believe that this issue is agnostic with our algorithm, thus I will suggest running a simple model like DeepLab/HRNet to see whether they will work well. Or please try to see whether there are some similar issues in original OpenSeg repo.

@davidceka
Copy link
Author

@tfzhou Thanks for answering! We're also using another model which implements DeepLab and ResNet and we have good results with that, so I don't really know whats going on with that. I also reached out to your email to add some information (if you could kindly check I'd be the most happy). We thought the issue might have been in the size of the images (which were quite large) but we did a resize to 1024x512 to all of them but with the same outcome.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants