Skip to content

zl376/class_imb

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Comparison of Decision Tree, Random Foreset and Random Forest with undersampled bootstrap for unbalanced data

Note:

This example shows class imbalance of ~200 ( dominant y=0 ).

  • Precision is always low
  • Using 'class_weight' in Random Forest seems to perform worse with more trees
  • Instead, explicit under-sampling dominant class (y=0) before bootstrap works for Random Forest with more trees