You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It seems that the dataset kopi_cc with the config name kopi_cc_2022_05-neardup_clean_nusantara_ssp is not complete. It can't find the file cleaned_oscar-neardup-000000000019.json.gz and above. According to the source code, there should be files from cleaned_oscar-neardup-000000000001.json.gz to cleaned_oscar-neardup-000000000035.json.gz. Or maybe the list should be only up to 18.
Describe the bug
It seems that the dataset kopi_cc with the config name kopi_cc_2022_05-neardup_clean_nusantara_ssp is not complete. It can't find the file cleaned_oscar-neardup-000000000019.json.gz and above. According to the source code, there should be files from cleaned_oscar-neardup-000000000001.json.gz to cleaned_oscar-neardup-000000000035.json.gz. Or maybe the list should be only up to 18.
Steps to reproduce the bug
The text was updated successfully, but these errors were encountered: