Map for adding cross validation training and evaluation #87

jmamath · 2021-09-21T17:47:11Z

jmamath
Sep 21, 2021

Hello and thank you for this amazing package.

Instead of using replicates, I would be interested in adding a cross validation training and evaluation scheme based on the domain metadata.

Say a dataset has domain: A,B,C. I would like to:

train on 70% of data sampled from A,B and evaluate in distribution on the remaining 30 % from A,B and out of distribution on C.
train on 70% of data sampled from B,C and evaluate in distribution on the remaining 30 % from B,C and out of distribution on A.
train on 70% of data sampled from C,A and evaluate in distribution on the remaining 30 % from C,A and out of distribution on B.

Finally average the in distribution and the out of distribution metric to have the final performance.

Here the 70-30 split is arbitrary and should be modifiable.

I am just starting exploring the package having only replicated the ERM result on the camelyon17 dataset.

It seems that the grouper object might be a good start to implement the following procedure. But, I am still lacking a high level overview of the code. So how would you do this ?

kohpangwei · 2021-09-22T22:30:55Z

kohpangwei
Sep 22, 2021
Maintainer

Great question! We actually do something like this for the PovertyMap dataset, so perhaps that would be a helpful reference?

https://github.com/p-lambda/wilds/blob/main/wilds/datasets/poverty_dataset.py

1 reply

jmamath Sep 27, 2021
Author

Thank you, I will look into it

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Map for adding cross validation training and evaluation #87

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Map for adding cross validation training and evaluation #87

jmamath Sep 21, 2021

Replies: 1 comment · 1 reply

kohpangwei Sep 22, 2021 Maintainer

jmamath Sep 27, 2021 Author

jmamath
Sep 21, 2021

Replies: 1 comment 1 reply

kohpangwei
Sep 22, 2021
Maintainer

jmamath Sep 27, 2021
Author