- Random Forest
- Multinomial Logistic Regression
- Classified 42 diseases using Random Forest
- Acquired Feature Importance table and performed Feature Selection by selecting features above 75 percentile importance
- Used "New_Training" and "New_Testing" datasets to perform Multinomial Logistic Regression
- EDA using a series of Visualizations showing :
- Missing Values
- Correlation Matrix
- Symptom frequencies ... and more!
- Random Forest : 97.619%
- Multinomial Logistic Regression : 92.8571%