Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training details on data augmentation #207

Open
zaiweizhang opened this issue Nov 16, 2024 · 1 comment
Open

Training details on data augmentation #207

zaiweizhang opened this issue Nov 16, 2024 · 1 comment

Comments

@zaiweizhang
Copy link

Hi!

This work is very impressive. I have a question regarding the training pipeline for the student model with pseudo labels:

Are you following the exact same loss for the student training compared with V1?
That means: applying the data augmentation (color distortion, gaussian blurring, CutMix) during training.

@LiheYoung
Copy link
Contributor

Yes. But in V2, we find that when training smaller models (e.g., ViT-S and ViT-B-based models) with the pseudo label from the largest ViT-G-based model, the augmentations are not necessary.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants