rpart fails to predict on titanic task #97

QuayAu · 2018-12-20T09:51:15Z

task = mlr_tasks$get("titanic")
learner = mlr_learners$get("classif.rpart")
resampling = mlr_resamplings$get("cv")

resample(task, learner, resampling)

throws:

 Error in model.frame.default(Terms, newdata, na.action = na.action, xlev = attr(object,  : 
  factor Cabin has new levels A14, A21, B101, B39, B69, B78, C104, C106, C116, C28, C49, C7, C82, C86, C95, D19, D9, F, T

The text was updated successfully, but these errors were encountered:

mllg · 2018-12-20T10:32:02Z

Titanic needs preprocessing, better don't use it for automatic tests.

mllg · 2018-12-20T10:32:25Z

You could create a preprocessed version and add it to the package, though.

berndbischl · 2018-12-20T10:33:56Z

@QuayAu really dont use titanic, for what we discussed yesterday. seems a bad choice

@mllg isnt the posted issue at leats relevant?
what i mean is: mlr3 shouldnt fail on something like this?

mllg · 2018-12-20T10:40:00Z

rpart breaks, there is nothing I can do about it. You need to merge levels or use stratification.

berndbischl · 2018-12-20T10:56:40Z

rpart breaks, there is nothing I can do about it. You need to merge levels or use stratification.

bah, i hate this issue. does anyone know what they are doing in other toolboxes?

is it possible to have a "fallback"? that would only work if the fallback is only on the observations where we break, which would imply testing them all one-by-one. which is totally infeasible as too expensive?

berndbischl · 2018-12-20T10:58:10Z

well. we DO know which levels a learner has seen during training? so we DO know on which observations it WILL break? even without calling it?

berndbischl · 2018-12-20T11:00:38Z

in any case, this does not seem like an mlr3 issue, and if somebody does not post otherwise, i dont see a simple solution here. i will try to think about this further in pipelines

berndbischl · 2018-12-20T11:10:09Z

i tried to outline a solution in the issue here:

mlr-org/mlr3pipelines#71

mllg closed this as completed Dec 20, 2018

berndbischl reopened this Dec 20, 2018

berndbischl closed this as completed Dec 20, 2018

berndbischl mentioned this issue Dec 20, 2018

PipeOp to try repair predicting with unseen factor levels mlr-org/mlr3pipelines#71

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rpart fails to predict on titanic task #97

rpart fails to predict on titanic task #97

QuayAu commented Dec 20, 2018

mllg commented Dec 20, 2018

mllg commented Dec 20, 2018

berndbischl commented Dec 20, 2018 •

edited

Loading

mllg commented Dec 20, 2018

berndbischl commented Dec 20, 2018 •

edited

Loading

berndbischl commented Dec 20, 2018 •

edited

Loading

berndbischl commented Dec 20, 2018

berndbischl commented Dec 20, 2018 •

edited

Loading

rpart fails to predict on titanic task #97

rpart fails to predict on titanic task #97

Comments

QuayAu commented Dec 20, 2018

mllg commented Dec 20, 2018

mllg commented Dec 20, 2018

berndbischl commented Dec 20, 2018 • edited Loading

mllg commented Dec 20, 2018

berndbischl commented Dec 20, 2018 • edited Loading

berndbischl commented Dec 20, 2018 • edited Loading

berndbischl commented Dec 20, 2018

berndbischl commented Dec 20, 2018 • edited Loading

berndbischl commented Dec 20, 2018 •

edited

Loading

berndbischl commented Dec 20, 2018 •

edited

Loading

berndbischl commented Dec 20, 2018 •

edited

Loading

berndbischl commented Dec 20, 2018 •

edited

Loading