questions on mistral 7B with CoT #102

twni2016 · 2024-10-29T00:14:24Z

Hi authors,

Thank you for your nice library! I am trying to use your library to run mistral 7B with CoT on gsm8k. I have several questions on the code when using HFModel:

Which mistral 7B model was used in your paper?
I tried mistralai/Mistral-7B-v0.3 and found the eos_token_id is no longer 13 Based on this, I think using eos_token_id = ["\n\n", ".\n", "\n", ".\n\n"] like Llama3Model is better?
Is this a typo that max_batch_size=batch_sizeis not added when calling HFModel?

Thank you for your time!

The text was updated successfully, but these errors were encountered:

Ber666 · 2024-11-12T08:52:21Z

Hi! Thanks for your questions.

It was v0.1. See the command line here. Sorry that the library was refactored and this information was lost.

We will fix other issues you mentioned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

questions on mistral 7B with CoT #102

questions on mistral 7B with CoT #102

twni2016 commented Oct 29, 2024

Ber666 commented Nov 12, 2024 •

edited

Loading

questions on mistral 7B with CoT #102

questions on mistral 7B with CoT #102

Comments

twni2016 commented Oct 29, 2024

Ber666 commented Nov 12, 2024 • edited Loading

Ber666 commented Nov 12, 2024 •

edited

Loading