Please help us define materials to be delivered #6
Replies: 3 comments
-
Thank you for opening this interesting discussion! I think when we talk about making a machine learning model open-source, the analogy to the source code in software is the full pipeline of generating the model, along with the model weights. Here's what I believe should be included: 1. Training Code and Environment:
2. Model Weights and Usage Instructions:
3. Repository for Information: |
Beta Was this translation helpful? Give feedback.
-
I think these people could contribute to this discussion: @julien-c, @williamFalcon, @jph00, @rasbt, @clmnt, @dongreenberg. |
Beta Was this translation helpful? Give feedback.
-
In particular, I'd say the definition is that source code should be made freely available and may be redistributed and modified by anyone. I.e., not only be shared, but shared freely. And modifiable to one's liking.
if we are talking about making "the model" open source, then the weights should also be made freely available etc. EDIT: Is it always feasible? Probably not. But in this case it then shouldn't be called open source. I honestly don't blame people if they can't make everything open-source. It's more false advertising if some people/companies put an open-source label on something that isn't. It's like saying "free unlimited internet" but then having a "500 Gb / month" data volume cap. |
Beta Was this translation helpful? Give feedback.
-
Thanks to everyone for the PRs so far!
I would particularly like input on the item in the definition that described what needs to be delivered, in order for a model to be open. In other words, what is the analog of source code, for open source software?
Also, is there a format in which it should be delivered?
Beta Was this translation helpful? Give feedback.
All reactions