Fine-Tuning Chat Model with Domain-Specific Data for Structured Outputs #6374
anantgupta129
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I have domain-specific data that I've preprocessed from PDFs into a
c4_demo.json
format, where each entry looks like[{"text": "<data from file1.pdf"}]
. Now, I want to fine-tune my model for chat-based interactions to generate structured chat outputs. My dataset is currently in the following format:[{"role": "system", "content": ""}, {"role": "user", "content": ""}, {"role": "assistant", "content": ""}, ...]
. Which fine-tuning method should I use for this?Beta Was this translation helpful? Give feedback.
All reactions