Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing files in dataset/nq320k.zip? #8

Open
jiaqizhai opened this issue Feb 26, 2024 · 1 comment
Open

Missing files in dataset/nq320k.zip? #8

jiaqizhai opened this issue Feb 26, 2024 · 1 comment

Comments

@jiaqizhai
Copy link

Hi,

Thanks for sharing the code! I noticed that dpr.py and baseline.py refer to a few files that are missing from the zip file, specifically

data/new_nq320k/id.newtitle.json
out/code-002/nq320k.title
dataset/nq320k/qg.json (is this just train.json.qg.json)

Any chance you can provide those files as well?

@sunnweiwei
Copy link
Owner

Hi,

dataset/nq320k/qg.json is train.json.qg.json, sorry for the inconsistent naming.

data/new_nq320k/id.newtitle.json is the title of each doc, and I have uploaded here:
id.newtitle.json

out/code-002/nq320k.title is also the title data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants