-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Figure 1 #2
Comments
Thanks for your interest in our work. The code for the meta learning experiments can be found in cifar10_meta_train.py. |
Thank you for your reply. However, this code is not consistent with Figure 1. net_train = copy.deepcopy(net) meta_opt = torch.optim.Adam(learnable_loss.parameters(), lr=1e-3, weight_decay=1e-5) for severity in cfg.CORRUPTION.SEVERITY:
import matplotlib.pyplot as plt outputs = learnable_loss(inputs) inputs_cpu = inputs.cpu().numpy() plt.figure(figsize=(10, 6)) ’‘’ |
Can you share what the plot looks like in your case? |
Is my process correct? I first use a certain loss to train a model on CIFAR10. Then, based on ‘CIFAR10_meta_train. py’, train the ‘meta_loss_transformer’ on four noises of CIFAR10-C. Then draw a picture. The input is a matrix of n * 10, with each value fixed, and then the first column is changed to [-10,40] (for squared loss, it is [-1,2]). Obtain the output of meta_loss_transformer. But the curve drawn so far is incorrect. In addition, there is a question as to the range of squared loss should be [0,1], but why is it [-1,2]. In addition, the output range of the CE loss model I trained is [-3, 18]. Is it my misunderstanding? |
First, the meta loss for the classifier trained with CE loss looks correct i think? The trend seems to match that in the paper. Second, for the squared loss, have you checked the performance of this meta learnt loss? It seems to be random. I recalled that the meta learning process could be unstable but make sure that the trained meta loss is a valid TTA loss. |
Thank you for your work and code!
I was wondering if you could provide the experimental setup and code for Figure 1?
Thank you.
The text was updated successfully, but these errors were encountered: