-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Negotiate testing requirements #37
Negotiate testing requirements #37
Comments
https://github.com/browsermt/bergamot-translator-tests/blob/main/models/download-models.sh uses only the de-en model which is downloaded as a tar.gz and unpacked. Aren't these archives (not files as in the download scripts now) expected to be the mode of delivery now for both translateLocally and bergamot-translator? The download scripts in this repo show individual files (https://github.com/browsermt/students/blob/master/csen/download-models.sh). If we're maintaining a model zoo (separate from Mozilla), the models/shortlists/vocabs/prefix-files uploaded should also be tested to work with the source in bergamot-translator. I'm guessing I will be reusing the preferred setup here which is going with binary files now? bergamot-translator will have only one language, while it might be nice to have on Jenkins@lofn CI a pull from here and test all models to be compatible with bergamot-translator with tiny inputs. The extra requirement for the testing purposes is where we continue testing the non-binary codepaths in bergamot-translator, so one archive (de-en) perhaps of the non-converted models/shortlists/vocabs/prefix-files. Linking a few relevant issues (#35, browsermt/bergamot-translator#104). |
I think that one language is sufficient for testing purpose. We should also be keeping compatibility with both binary and non-binary shortlists. In terms of what the end user needs, it's either the binary or the nonbinary shortlist. Considering that the binary shortlist is much much smaller, we should remove it from the archive:
This is what the archive was distributing as of Friday. Let's get rid of all the .bin shortlists except for one archive which would be used for testing? I can then update download-models.sh appropriately? |
You mean get rid of all the .gz shortlists right? |
No, I mean the bergamot-translator-tests one. |
Sounds good. |
should be fixed, waiting for confirmation before i redeploy and close |
@jerinphilip let's see what you need for the test setup before we redeploy the latest-and-greatest models and break your tests.
Could you give me a list of files that you use and if you want anything changed.
The text was updated successfully, but these errors were encountered: