Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Confusion about RAMS datasets #7

Open
bellytina opened this issue Jul 31, 2022 · 2 comments
Open

Confusion about RAMS datasets #7

bellytina opened this issue Jul 31, 2022 · 2 comments

Comments

@bellytina
Copy link

bellytina commented Jul 31, 2022

微信图片_20220731140404
Hello,
Thanks for your code! I see #Doc is much smaller than #Event from Table 1, indicating that a document can contain multiple events. So is there a clear boundary between these events, that is, whether different events under the same document will share arguments?
In addition, I found that the doc_key of each instance in the jsonlines is unique. How do you count the number of documents (3194,399 and 400)?
Any help would be great.

@jefflink
Copy link

jefflink commented Aug 3, 2022

Hi @bellytina
The #Doc should be based on the "source_url". The #Events is based on the "doc_key"

@RunxinXu
Copy link
Owner

Thank @jefflink, and that is exactly the answer. @bellytina

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants