-
Notifications
You must be signed in to change notification settings - Fork 118
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Missing Training Script? #3
Comments
Hey, thanks for the comment. I do have all of that code but unfortunately it's a bit scattered and not really in good shape to be uploaded to the repo. If anyone else is interested in seeing this, please comment on this issue! I'll clean up my code and upload it if a few people want to see it. |
Welp, it's something I'd be interested in playing with, potentially contributing towards, if you do ever get around to sharing it. :) |
@vzhou842, thank you for your awesome job, it's really admirable! |
@alexandrduduka this library is based on scikit-learn's LinearSVC class, so I'd recommend playing with that if you want to reproduce something similar. As far as improving precision, there are lots of ways to do that (all of which would come at the cost of speed). That's too big of a question for me to answer concisely, but basically you'd have to use more complex / powerful models and possible use better / more data preprocessing. |
@vzhou842 Thank you for the model. Would like to see the script for training and benchmarking you have presented. |
Interested. |
@vzhou842 I would like to see the script for train model. |
I am interested too in seeing the script. |
Can we translate this dataset for other language? E.x. Japanese |
I would love to see the training code please :) |
I definitely want to see it. |
Yes. This would be helpful. Would like to train the model against my own abusive words. |
Definitely will be helpful. |
Hey, I read your blogpost about
profanity-check
, so I've seen the code there.. but I'm wondering whether you have a file separately to that for training? And/or one for validation or "benchmarking"?If so, I'd love to see those in the repo. :)
The text was updated successfully, but these errors were encountered: