Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Log loss checking #43

Open
gngdb opened this issue Mar 2, 2015 · 2 comments
Open

Log loss checking #43

gngdb opened this issue Mar 2, 2015 · 2 comments
Assignees
Labels
Milestone

Comments

@gngdb
Copy link
Member

gngdb commented Mar 2, 2015

We need to find which classes we're worst at on the validation set (specifically not the test set). To do this we need to be able to visualise well (in an IPython notebook probably) for a given set of predictions on the test set (could save these in pickle or csv and load in for code that is agnostic to model). In the same notebook probably worth having Hinton diagrams for confusion matrices.

The idea with this is that we should be able to look at these difficult classes and work on some feature engineering (in the training set) to patch our model and slightly improve our score.

@gngdb gngdb added this to the Visualisation milestone Mar 2, 2015
@gngdb gngdb added the ready label Mar 2, 2015
@gngdb
Copy link
Member Author

gngdb commented Mar 2, 2015

In this notebook would also want to see a distribution of log loss over classes for the validation set.

@gngdb gngdb self-assigned this Mar 2, 2015
@gngdb gngdb added in progress and removed ready labels Mar 2, 2015
@gngdb
Copy link
Member Author

gngdb commented Mar 8, 2015

Most of this can be found in the notebook called Validation scoring.py. But, it isn't really an exhaustive analysis yet, just looking at some examples where the network scores badly. Needs more work to come up with some kind of boosted addition to our model.

@gngdb gngdb added wontfix and removed in progress labels Mar 14, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant