-
Notifications
You must be signed in to change notification settings - Fork 544
MNT make a dataset containing no missing values #425
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
LGTM but we should not merge this PR prior to the v1.0 debrief meeting (early September). |
Note to our future selves: probably the appendix section about ames_housing dataset would need to change as well. |
Is it worth keeping Q9 in the linear models wrap-up quiz
after having removed any discussion on missing values? In any case sounds like a question more suitable for M1 |
Nop we could remove it. I did not remove it yet because it would change the order of the question numbering but it is good to add it in the TODO if we merge this |
As we are keeping both house_prices.csv and ames_housing_no_missing.csv, do you think we should create separate notebooks to analyze both versions? Having a notebook for each dataset would come handy if one day we create a lesson about Imputers. We could also just add a message at the end of datasets_ames_housing.py saying/showing how we created ames_housing_no_missing.csv from house_prices.csv. What do you think? |
+1 for a single notebook with the message you suggest. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, just a few minor suggestions:
Co-authored-by: Olivier Grisel <[email protected]>
Co-authored-by: Olivier Grisel <[email protected]>
Merged! |
Co-authored-by: ArturoAmorQ <[email protected]> Co-authored-by: ArturoAmor <[email protected]> Co-authored-by: Olivier Grisel <[email protected]> ba81cd3
Addresses point 1. of #361 (comment)
I will make a PR in
gitlab
regarding the simplification of the wrap-up quiz.Here is the code to remove the missing values:
TODO:
python_scripts/datasets_ames_housing.py