-
Notifications
You must be signed in to change notification settings - Fork 1
/
DESCRIPTION
73 lines (73 loc) · 3.88 KB
/
DESCRIPTION
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
Package: mlr3resampling
Type: Package
Title: Resampling Algorithms for 'mlr3' Framework
Version: 2024.10.28
Authors@R: c(
person("Toby", "Hocking",
email="[email protected]",
role=c("aut", "cre"),
comment = c(ORCID="0000-0002-3146-0865")),
person("Michel", "Lang", , "[email protected]", role = "ctb",
comment = c(ORCID = "0000-0001-9754-0393", "Author of mlr3 when Resampling/ResamplingCV was copied/modified")),
person("Bernd", "Bischl", , "[email protected]", role = "ctb",
comment = c(ORCID = "0000-0001-6002-6980", "Author of mlr3 when Resampling/ResamplingCV was copied/modified")),
person("Jakob", "Richter", , "[email protected]", role = "ctb",
comment = c(ORCID = "0000-0003-4481-5554", "Author of mlr3 when Resampling/ResamplingCV was copied/modified")),
person("Patrick", "Schratz", , "[email protected]", role = "ctb",
comment = c(ORCID = "0000-0003-0748-6624", "Author of mlr3 when Resampling/ResamplingCV was copied/modified")),
person("Giuseppe", "Casalicchio", , "[email protected]", role = "ctb",
comment = c(ORCID = "0000-0001-5324-5966", "Author of mlr3 when Resampling/ResamplingCV was copied/modified")),
person("Stefan", "Coors", , "[email protected]", role = "ctb",
comment = c(ORCID = "0000-0002-7465-2146", "Author of mlr3 when Resampling/ResamplingCV was copied/modified")),
person("Quay", "Au", , "[email protected]", role = "ctb",
comment = c(ORCID = "0000-0002-5252-8902", "Author of mlr3 when Resampling/ResamplingCV was copied/modified")),
person("Martin", "Binder", , "[email protected]", role = "ctb"),
person("Florian", "Pfisterer", , "[email protected]", role = "ctb",
comment = c(ORCID = "0000-0001-8867-762X", "Author of mlr3 when Resampling/ResamplingCV was copied/modified")),
person("Raphael", "Sonabend", , "[email protected]", role = "ctb",
comment = c(ORCID = "0000-0001-9225-4654", "Author of mlr3 when Resampling/ResamplingCV was copied/modified")),
person("Lennart", "Schneider", , "[email protected]", role = "ctb",
comment = c(ORCID = "0000-0003-4152-5308", "Author of mlr3 when Resampling/ResamplingCV was copied/modified")),
person("Marc", "Becker", , "[email protected]", role = "ctb",
comment = c(ORCID = "0000-0002-8115-0400", "Author of mlr3 when Resampling/ResamplingCV was copied/modified")),
person("Sebastian", "Fischer", , "[email protected]", role = "ctb",
comment = c(ORCID = "0000-0002-9609-3197", "Author of mlr3 when Resampling/ResamplingCV was copied/modified"))
)
Description: A supervised learning algorithm inputs a train set,
and outputs a prediction function, which can be used on a test set.
If each data point belongs to a subset
(such as geographic region, year, etc), then
how do we know if subsets are similar enough so that
we can get accurate predictions on one subset,
after training on Other subsets?
And how do we know if training on All subsets would improve
prediction accuracy, relative to training on the Same subset?
SOAK, Same/Other/All K-fold cross-validation, <doi:10.48550/arXiv.2410.08643>
can be used to answer these question, by fixing a test subset,
training models on Same/Other/All subsets, and then
comparing test error rates (Same versus Other and Same versus All).
Also provides code for estimating how many train samples
are required to get accurate predictions on a test set.
License: GPL-3
URL: https://github.com/tdhock/mlr3resampling
BugReports: https://github.com/tdhock/mlr3resampling/issues
Imports:
data.table,
R6,
checkmate,
paradox,
mlr3 (>= 0.21.1),
mlr3misc
Suggests:
ggplot2,
animint2,
mlr3tuning,
lgr,
future,
testthat,
knitr,
markdown,
nc,
rpart,
directlabels
VignetteBuilder: knitr