Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Benefit from OpenNLP's new UD models #14188

Open
msfroh opened this issue Feb 1, 2025 · 0 comments · May be fixed by #14194
Open

Benefit from OpenNLP's new UD models #14188

msfroh opened this issue Feb 1, 2025 · 0 comments · May be fixed by #14194
Assignees

Comments

@msfroh
Copy link
Contributor

msfroh commented Feb 1, 2025

Description

We recently upgraded Lucene's dependency on OpenNLP to 2.5. This upgrade offers a new part-of-speech tagging model that works across more languages. The update maintained backward compatibility with the old Penn model by hardcoding it in the token filter.

We should expose the UD model as an option.

I'd like to work on this, so please assign it to me.

@mawiesne, you're the OpenNLP expert, please let me know about potential pitfalls. I don't know OpenNLP, but this seems fun.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant