Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset split. #511

Open
Abdielfer opened this issue Apr 28, 2023 · 1 comment
Open

Dataset split. #511

Abdielfer opened this issue Apr 28, 2023 · 1 comment

Comments

@Abdielfer
Copy link
Collaborator

I notice percent in the split dataset do not match the expected proportions. It seems split is made before filtering patches by min_annot_perc.

@Abdielfer
Copy link
Collaborator Author

More details:
When I do split by percent, I expect the %val + %trn to match the total of the patch. Instead, I have fewer tails than I expected in the validation set, or training set, or both. It seems that the split is made from the total of patches, and then each split is filtered to the %of_annotation. This logic leads to the deletion of some tiles and the mismatch of expected number of tiles per split.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant