Skip to content

beedotkiran/randomforestpruning-ismm-2017

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Cost-Complexity pruning of Random Forests ISMM 2017

  • Paper, Poster,
  • Only Classification tasks have been evaluated. Overview

Datasets

  • Download winequality dataset, and other datasets and change utils.py to add new datasets to test out the pruning algorithm.

Todo

  • Calculate the test leaves id at the same time as train leaves id
    • Then predict with optimal leaf labeling

New references

  • Impact of subsampling and pruning on random forests Paper
  • Understanding variable importances in forests of randomized trees Paper