Expose `stop_sequence` at command line #196

lewtun · 2024-05-27T09:52:24Z

Models like llama-3 use a chat template where the expected stop sequence is <|eot_id|> instead of the common EOS token used in other models. This means that generative benchmarks like ifeval continue generating past the EOS token and give incorrect results.

One way to handle this would be to either include this special token as a default in the generative benchmarks or alternatively expose --stop_sequence as an argument in the main script that users can control.

The text was updated successfully, but these errors were encountered:

clefourrier · 2024-07-17T11:52:57Z

Hi @lewtun , just to clarify, is <|eot_id|> a different stop token for the chat template specifically?
And is the model using <|end_of_text|> for the rest?

Other question, do you think fixing #16 would be enough to solve this too?

clefourrier self-assigned this Jul 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose `stop_sequence` at command line #196

Expose `stop_sequence` at command line #196

lewtun commented May 27, 2024

clefourrier commented Jul 17, 2024 •

edited

Loading

Expose stop_sequence at command line #196

Expose stop_sequence at command line #196

Comments

lewtun commented May 27, 2024

clefourrier commented Jul 17, 2024 • edited Loading

Expose `stop_sequence` at command line #196

Expose `stop_sequence` at command line #196

clefourrier commented Jul 17, 2024 •

edited

Loading