Assignment from Arya.ai
Train Dataset Shape -> (3909,58)
Train Dataset Shape -> (690,58)
Dataset: Sparse and High Dimensional
- Used RandomForest Classifier for feature selection.
- Selected top 30 features with respect to their feature importance.
- For metric I have considered Binary CrossEntropy and AUC score.
- The best model I get is Xgboost.