Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
With @yulinggu-cs, added a version of the MMLU datasets with multiple prompt variants, to make the datasets effectively 7x larger. These are named
mmlu_stem_var
etc (add_var
suffix).The prompts are called:
[None, "inst", "inst+1", "inst+2", "inst+3", "inst+4", "inst+5"]
where "inst+2" means add instruction line followed by 2-shot example. We also added logging of first few examples for each downstream task (can remove this if it's annoying, but it's a useful sanity check), here's an example for the "inst+2" prompt:
Things to note:
update
usesbatch["ctx_len"]
which might be larger than 2048. I tried to fix this in f64b9ce, but it's a bit iffy (the context length is separately processed inprep_examples
and incollate_fn
which could be confusingself.log_instances -= 1
might not be very robust? If this is too noisy in the logging, we can comment these lines out.