'None' values in the dataset preprocessing when reproducing the result on GSM8k #3

kulinshah98 · 2024-12-15T21:11:03Z

Hi,

Thanks for open sourcing the code! I am trying to reproduce the results on Math (GSM8k) benchmark given in Table 1. I am using the yaml file given at the location "LLaMA-Factory/examples/train_full/gpt2_full_ddm-sft.yaml" and given instruction to run the code. However, I am facing the issue of having "None" values in the input data. To be specific, the batch['input_ids'] in the "LLaMA-Factory/src/llamafactory/train/ddm/workflow.py" seems to have None values between prompt and response (after question and before answer of the GSM8k). Because of this None value, I am getting the error "RuntimeError: Could not infer dtype of NoneType" during the start of the training. Can you help me fix this issue? Thank you in advance!

Best,
Kulin

kulinshah98 changed the title ~~'None' values in the dataset when reproducing the result on GSM8k~~ 'None' values in the dataset preprocessing when reproducing the result on GSM8k Dec 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

'None' values in the dataset preprocessing when reproducing the result on GSM8k #3

'None' values in the dataset preprocessing when reproducing the result on GSM8k #3

kulinshah98 commented Dec 15, 2024

'None' values in the dataset preprocessing when reproducing the result on GSM8k #3

'None' values in the dataset preprocessing when reproducing the result on GSM8k #3

Comments

kulinshah98 commented Dec 15, 2024