You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Traceback (most recent call last):
File "<stdin>", line 1, in<module>
File "/Users/katossky/Projets/leximpact-rapport/.venv/lib/python3.12/site-packages/outlines/generate/api.py", line 512, in __call__
return self._format(completions)
^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/katossky/Projets/leximpact-rapport/.venv/lib/python3.12/site-packages/outlines/generate/api.py", line 488, in _format
return self.format_sequence(sequences)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/katossky/Projets/leximpact-rapport/.venv/lib/python3.12/site-packages/outlines/fsm/types.py", line 45, in float_format_fn
return float(sequence)
^^^^^^^^^^^^^^^
ValueError: could not convert string to float: '99.3926785111 3 002 16000000000000'
It defeats the purpose of getting expected coercible outputs out of outlines. I have nothing to gain from google/mt5-large specifically (I am comparing multilingual LLMs for a scholar thing) but it looks like the same happens for all the T5 and mT5 family, indicating that there might be a larger problem (?).
The text was updated successfully, but these errors were encountered:
katossky
changed the title
space allowed in generation with float constraint resulting in convertion failure
space in generated text despite of float-format constraint results in convert-to-float failure
Sep 27, 2024
Don't know if it is related but with meta-llama/Meta-Llama-3.1-8B-Instruct (lmstudio-community/Meta-Llama-3.1-8B-Instruct-GGUF / Meta-Llama-3.1-8B-Instruct-Q8_0.gguf) I even have word tokens generated.
ValueError: could not convert string to float: '581 Volunteer Volunteer Volunteer Volunteer Volunteer Volunteer Volunteer Volunteer'
Describe the issue as clearly as possible:
With
google/mt5-large
and thefloat
constraint, spaces are generated, and the conversion to float fails.Steps/code to reproduce the bug:
Expected result:
Error message:
Outlines/Python version information:
Version information
Context for the issue:
It defeats the purpose of getting expected coercible outputs out of
outlines
. I have nothing to gain fromgoogle/mt5-large
specifically (I am comparing multilingual LLMs for a scholar thing) but it looks like the same happens for all the T5 and mT5 family, indicating that there might be a larger problem (?).The text was updated successfully, but these errors were encountered: