Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

German translation dataset very formal / in parts incorrect #120

Open
HAuser1234 opened this issue Apr 22, 2024 · 2 comments
Open

German translation dataset very formal / in parts incorrect #120

HAuser1234 opened this issue Apr 22, 2024 · 2 comments
Labels
bug Something isn't working

Comments

@HAuser1234
Copy link

HAuser1234 commented Apr 22, 2024

Describe the bug
The german translation is mostly good, but especially in data/piles/german/pile_of_specific_actions.csv you can find sentences like: „Schließen Sie die Jalousien im Wohnzimmer“. This would very formally beg the assistant to close the blinds, which probably most germans never do. Therefore the sentence should be „Schließe die Jalousien im Wohnzimmer“.
The problem will probably be that most sentences use this extremely formal language.

Also the responses have the same issue, but there ist is partly even more problematic as the responses sound like a command to the user to do a action instead of the assistant doing it for the user :)

example:
data/piles/german/pile_of_responses.csv
Schließen Sie die Jalousien wie gewünscht. -> should be „Ich schließe die Jalousien wie gewünscht“
or some grammatically completely incorrect sentences: Öffnen Sie die Jalousien für Sie. -> should be the same sentence as before.

This I will probably create issues for some users. If you want I probably can correct some of the errors, but it would be great if others could help too.

Edit: using deepL as the translator instead of google works way better, but needs a api key that is limited to 500000 free characters per month!

@HAuser1234 HAuser1234 added the bug Something isn't working label Apr 22, 2024
@acon96
Copy link
Owner

acon96 commented Apr 22, 2024

I don't know German so I can't really comment on any of this. Will explore other translation options (the OpenAI translator pointed at a locally hosted model seemed like a potentially un-throttled way) and will definitely accept manual translations as they are always better.

@HAuser1234
Copy link
Author

HAuser1234 commented Apr 22, 2024

Yes I tried ChatGPT as a translator and it works (nearly) perfect! When you release the next version that can on my older CPU then I can try the current state and report back. And when you decide to accept manual translations or review other ones let me know!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants