-
Notifications
You must be signed in to change notification settings - Fork 64
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Working with all lowercase dataset #32
Comments
Hi @Protossnam and thanks! That's a very good point, I will prepare some uncased models for NER (with the uncased embedding data). |
@kermitt2 Hi there, friend! I've tried to configure and use another glove uncased word embedding - the Edit: I added 3 more train/dev/test files |
@kermitt2 Oh, I didn't config my |
@Protossnam results are excellent, great ! I was not expecting such good results with lowercase. I've only downloaded the uncased embeddings, but I have not yet started to do some training or to adapt the code. I was thinking simply adding a parameter There is no ELMo uncased model unfortunately, so it will not be possible to exploit it for caseless text I think. However, there are BERT uncased models - I am adding the support of BERT currently and the base model is already quite good. Thanks a lot for the news and the 3 train/dev/test files. |
Thanks for the wonderful work here!
I have some text files and want to extract NE from them by running
nerTagger.py
. However, my files contain all lowercase characters and of course, I can't get any NE result.For instance:
Output:
Output:
Expected:
Therefore, should we develop a caseless NER model?
The text was updated successfully, but these errors were encountered: