Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hellaswag result not reproducing #3

Open
shash42 opened this issue Nov 12, 2024 · 0 comments
Open

Hellaswag result not reproducing #3

shash42 opened this issue Nov 12, 2024 · 0 comments

Comments

@shash42
Copy link

shash42 commented Nov 12, 2024

Hi,

Thanks a lot for the codebase! It works quite smoothly and I was able to reproduce results across many datasets using the default sft_config provided, but not on hellaswag. I've attached my plot (weak_ft, strong_ft are floor and ceil respectively).

Specifically, I'm simply running python run.py --dataset="$arg_1" using the default Qwen-1.5-0.5b and Llama-3-8b models as the weak, strong pair. Are the configurations used to produce the main weak to strong plot different (specifically across datasets) than the ones provided by default in sft_config.py. If so, could you please provide detailed config files for reproduction, especially if there are any changes needed for Hellaswag?

Thanks!
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant