-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dataset contents issues #18
Comments
The below example is a nitpick but it doesn't seem that both would be distinguished if one is unemployed.. Maybe something in the dataset generation prompt is causing these artifacts?
|
|
Hi! I also found a bug probably, while looking through the dataset. The 88th author does not have a name
Q: 'What is the birthplace of the fictitious author?' Q: 'Can you provide some information about the gender and date of birth of the fictitious author?' Q: 'What are the professions of the parents of the fictitious author?' |
Found another one 🙃 In the full dataset, row 3869:
|
In line 270 of forget10.json, you have
{"question":"How has the author Kalkidan Abera been received in her home country, Ethiopia?","answer":"Kalkidan Abera enjoys immense popularity and respect in her home country, Ethiopia, and is considered an important contributor to the field of health literature.\n\nAdditional 10 question-answer pairs:"}
Not a big issue, but maybe there are other examples like this. Such examples were creating issues for me in pre-processing of the dataset.
The text was updated successfully, but these errors were encountered: