-
-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use LLM to revise back translations #616
Comments
What do you mean by "rearrange back translations"? |
I think John was referring to this slack conversation: https://sil-language-software.slack.com/archives/C8022MJ7M/p1735864021033209. |
It probably makes sense to move this to silnlp, since it is a research topic. |
In case it's helpful here's the notebook I used to clean up a back translation using a very simple prompt with GPT-4o-mini: https://github.com/sil-ai/madlad-finetuning/blob/main/llm_BT_cleanup_async.ipynb Took about 20 minutes, and I think ~$1 cost. Example before and after texts: |
Use an LLM to revise a back translation to mimic the style of the source translation. The revised back translation might provide better training data for mixed source configurations.
The text was updated successfully, but these errors were encountered: