Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

'None' values in the dataset preprocessing when reproducing the result on GSM8k #3

Open
kulinshah98 opened this issue Dec 15, 2024 · 0 comments

Comments

@kulinshah98
Copy link

Hi,

Thanks for open sourcing the code! I am trying to reproduce the results on Math (GSM8k) benchmark given in Table 1. I am using the yaml file given at the location "LLaMA-Factory/examples/train_full/gpt2_full_ddm-sft.yaml" and given instruction to run the code. However, I am facing the issue of having "None" values in the input data. To be specific, the batch['input_ids'] in the "LLaMA-Factory/src/llamafactory/train/ddm/workflow.py" seems to have None values between prompt and response (after question and before answer of the GSM8k). Because of this None value, I am getting the error "RuntimeError: Could not infer dtype of NoneType" during the start of the training. Can you help me fix this issue? Thank you in advance!

Best,
Kulin

@kulinshah98 kulinshah98 changed the title 'None' values in the dataset when reproducing the result on GSM8k 'None' values in the dataset preprocessing when reproducing the result on GSM8k Dec 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant