Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixed iterator to only store data for that rank. #216

Merged
merged 4 commits into from
Aug 27, 2024
Merged

Conversation

hariharan-devarajan
Copy link
Collaborator

  • we fix reconfiguration logic to build rank map for train and valid.
  • Now we need reconfigure per epoch to shuffle the list.

- we fix reconfiguration logic to build rank map for train and valid.
- Now we need reconfigure per epoch to shuffle the list.
Copy link
Member

@zhenghh04 zhenghh04 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This PR addresses reconfiguration.

In reconfigure, global shuffling is performed. The data loader only have the indices of the local samples. It still has the shuffling on the pytorch / dali data loader which need to be fixed.

Please send another PR to fixed the redundant shuffling issue @hariharan-devarajan

@zhenghh04 zhenghh04 merged commit 8cabad2 into main Aug 27, 2024
12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants