Training details on data augmentation #207

zaiweizhang · 2024-11-16T09:13:51Z

Hi!

This work is very impressive. I have a question regarding the training pipeline for the student model with pseudo labels:

Are you following the exact same loss for the student training compared with V1?
That means: applying the data augmentation (color distortion, gaussian blurring, CutMix) during training.

LiheYoung · 2024-12-18T07:38:16Z

Yes. But in V2, we find that when training smaller models (e.g., ViT-S and ViT-B-based models) with the pseudo label from the largest ViT-G-based model, the augmentations are not necessary.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training details on data augmentation #207

Training details on data augmentation #207

zaiweizhang commented Nov 16, 2024

LiheYoung commented Dec 18, 2024

Training details on data augmentation #207

Training details on data augmentation #207

Comments

zaiweizhang commented Nov 16, 2024

LiheYoung commented Dec 18, 2024