Skip to content
This repository has been archived by the owner on Mar 12, 2024. It is now read-only.

Is detr need a big batch_size and at least 10k dataset? #294

Closed
cyy21 opened this issue Nov 26, 2020 · 1 comment
Closed

Is detr need a big batch_size and at least 10k dataset? #294

cyy21 opened this issue Nov 26, 2020 · 1 comment

Comments

@cyy21
Copy link

cyy21 commented Nov 26, 2020

i follow the minimum implemention of detr and use the pretrained resnet50 with frozen_bn, and i implement the loss and matcher, the loss and the matcher is tested, then i use 1 img and want to overfit it, the classnum is 1, after 3 epoch the model's output is all same, i print the match indices during training, and found the indices is all different(i think the indices for 1 img should be same), then i just set match indices to be the same, the result has no change, and i notice #169 may meet the same problem. So, what do you think about the problem, Is detr need a big batch_size and at least 10k dataset?

@thibo73800
Copy link

On my side I did follow the same procedure (on a tensorflow implementation) and the model can overfit with just a few samples

@cyy21 cyy21 closed this as completed Dec 4, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants