You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi ~
Thank you for your excellent work. I've been working on this recently, but I can't exactly reproduce the results in the paper through your open-source code. The dataset partitioning in my experiment refers to split files, and then all parameters are set by default. I would like to know if the results in your paper are averaged over the last few epochs or if the best results are obtained. And whether the experimental parameter settings are exactly the same for PACS and VLCS datasets?
Looking forward to your reply!
The text was updated successfully, but these errors were encountered:
Hi ~
Thank you for your interest in our work! The reported results are the average of the results of multiple runs, and each result is selected via validation accuracy.
As mentioned in readme, we find that many hyper-parameters can affect the performance (such as lrbl, epochb, lambdap), which can vary among different tasks/environments/software/hardware/random seeds, and thus carefully tunning is required. We find the best hyper-parameters are similar yet not exactly the same for PACS and VLCS datasets and the default ones show good performance in our environment.
We are sorry for the problem and trying to address it in future work.
@xxgege Thanks for your reply ~ There is one more thing. I am wondering if you can share the MNIST-M dataset split solution used in this experiment? Maybe the python file that made the MNIST-M dataset or a link to the made MNIST-M dataset? Thanks for your help!
Hi ~
Thank you for your excellent work. I've been working on this recently, but I can't exactly reproduce the results in the paper through your open-source code. The dataset partitioning in my experiment refers to split files, and then all parameters are set by default. I would like to know if the results in your paper are averaged over the last few epochs or if the best results are obtained. And whether the experimental parameter settings are exactly the same for PACS and VLCS datasets?
Looking forward to your reply!
The text was updated successfully, but these errors were encountered: