-
Notifications
You must be signed in to change notification settings - Fork 45
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
数据集相关问题 #42
Comments
请问您的测试集在哪呢,我在huggingface上面看见了openbmb/VisRAG-Ret-Test-ArxivQA,但是为什么是parquet的格式呢,您提供的 def load_beir_qrels(qrels_file): corpus_ds = load_dataset("openbmb/VisRAG-Ret-Test-ArxivQA", name="corpus", split="train") qrels_path = "xxxx" # path to qrels file which can be found under qrels folder in the repo. |
每个测试集分三个部分:corpus(文档),queries(查询),以及qrels,即查询和文档之间的相关关系。通过示例代码可以访问这三部分 |
实在抱歉,我们在生成训练和测试数据的过程中并没有保存PDF原件~ |
请问您提供的训练集和测试集是否有原始的PDF原件,能否提供一下?
The text was updated successfully, but these errors were encountered: