-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Kinda failures #16
Comments
@everythingability i am surprised that 2000 BCE does not work - could you confirm that. On other items I don't flexidate supports a lot of those yet. Not hard to add them so if you are interested in a PR please go for it ... |
@rufuspollock Yeah, not sure how that happened but it did/may have :-) I can't reproduce it in the Terminal, all seems OK. Not sure how to, or I can contribute, I'm no great coder and even worse with git :-) Attached is a whole heap of "challenging dates", many of which like "2 days after" can't be parsed without a contextual date, or "two thousand years" which definitely shouldn't be <2000 AD>. But anyway, I've attached it hoping it may help you or someone. Just a thought, but years ago I used a chatbot tool called Alicebot that read AIML files, which were essentially just long lists of search_and_replace items, that aimed to get the input down to something more managable... or parseable. I think that that approach might work well (But don't know). Keep up the good work. |
Oh, it doesn't seem to have worked. https://docs.google.com/document/d/1jdbn_m4viKbKgpckkUTwChkC_RuPwPjiMWAZ5HDDmtY/edit?usp=sharing |
Hello, I'm parsing an academic book (with Flair NER) that contains "dates" such as:
and ideally want a "close enough for jazz" year (or not) from a parse... but many of the dates like these fail... even ones that seem pretty easy to get like "2000 BCE".
I even tried...
theDate = theDate.replace("BCE", "BC")
theDate = theDate.replace("th century", "00")
theDate = theDate.replace("rd century", "00")
theDate = theDate.replace("st century", "00")
theDate = theDate.replace("nd century", "00")
..to help it out a bit... but it also fails on items like "the fourteenth century" .
Thanks for the work though. I found a python 2.7 module called dataparse that might be worth a peek? Couldn't install it through.
Tom
The text was updated successfully, but these errors were encountered: