We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
根据nougat官方库的faq: https://github.com/facebookresearch/nougat#faq Chinese, Russian, Japanese etc. will not work. nougat完全不支持中文的,为何其抽取率会有100%?数据集的饼图显示中文数据的占比是超过50%。
Chinese, Russian, Japanese etc. will not work.
同时nogout几乎只在处理英文论文时会起作用,超出这个范围几乎只会返回[MISSING_PAGE],合理质疑有关nogout的数据真实性。
[MISSING_PAGE]
以及根据marker的测试数据 https://github.com/VikParuchuri/marker#benchmarks 在各个场景marker应当是明显优于nogout的,但为何在这个评测中没体现出来?
The text was updated successfully, but these errors were encountered:
非常感谢您的关注~
Sorry, something went wrong.
感谢您的关注和反馈。
当前抽取率指标的定义是:成功生成 Markdown 文件的数量与总PDF文件数的占比,但该指标不会检查生成 Markdown 的内容是否异常。详细的指标的定义请参考README中“指标”章节介绍。
欢迎推荐更多样化的评估指标,我们会考虑在后续的评测版本中添加。
No branches or pull requests
根据nougat官方库的faq:
https://github.com/facebookresearch/nougat#faq
Chinese, Russian, Japanese etc. will not work.
nougat完全不支持中文的,为何其抽取率会有100%?数据集的饼图显示中文数据的占比是超过50%。
同时nogout几乎只在处理英文论文时会起作用,超出这个范围几乎只会返回
[MISSING_PAGE]
,合理质疑有关nogout的数据真实性。以及根据marker的测试数据
https://github.com/VikParuchuri/marker#benchmarks
在各个场景marker应当是明显优于nogout的,但为何在这个评测中没体现出来?
The text was updated successfully, but these errors were encountered: