Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Related Issues or bug
-There was an issue related to class imbalance in the employee churn dataset, which could lead to biased model predictions, especially towards the majority class. This was addressed through stratified sampling during train-test splits and additional analysis of model performance using metrics like recall and F1-score.
Fixes: #246
Proposed Changes
Added data preprocessing steps to handle categorical variables by encoding them into numerical features.
Implemented class balance visualization using Yellowbrick to detect class imbalance and influence our sampling strategy.
Built and trained a Decision Tree Classifier and Random Forest Classifier with interactive controls for hyperparameter tuning using the interact function.
Evaluated models using training and validation accuracy, alongside feature importance plots to identify key factors influencing employee churn.
Additional Info