Benchmarking model performance with and without Outlines #336
MayankAgarwal
started this conversation in
General
Replies: 1 comment 7 replies
-
Not such dataset as far as I know. Do you have examples of such failures? |
Beta Was this translation helpful? Give feedback.
7 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Are there benchmark datasets we can try to assess model performance with and without Outlines, for example: a dataset that either has JSON as output or for whom the output can be modeled as a relatively complex JSON?
The reason I am asking is that while it conceptually makes sense to do constrained generation and how it would help the model generate valid output, sometimes there is likelihood misalignment or likelihood collapse resulting in degenerate output, such as empty strings or spaces in JSON until the maximum tokens are generated. In my opinion, having benchmark performances with and without constraints will provide better insights into the gains Outlines provides.
Looking for everyone's thoughts on this topic and if there are datasets we can start with. Thanks!
Beta Was this translation helpful? Give feedback.
All reactions