diff --git a/docs/knowledge-submissions-past-wikipedia.md b/docs/knowledge-submissions-past-wikipedia.md index 41da068c..fda6dfb4 100644 --- a/docs/knowledge-submissions-past-wikipedia.md +++ b/docs/knowledge-submissions-past-wikipedia.md @@ -24,27 +24,41 @@ Status: - `denied`: Denied by the legal team, and posted on the [avoided list][avoided]. - `submitted`: Sent to the legal team for review - `proposed`: The community would like to propose this as a possible place to take knowledge submissions from. +- `reviewed - manually verify`: Legal team has reviewed this domain and while much of its source material meets our open licensing criteria, not all of it does. Each submission from this source must be manually verified to actually be under an appropriate content license or e.g. definitively in the public domain. + +For the purposes of Knowledge submissions to the InstructLab project, data sourced from items in the `approved` category require no further vetting from the Triage and/or other Maintainer teams. Items in the `reviewed - manually verify` category will require vetting before the submission can be accepted. + +To ensure that the data you would like to include in your knowledge submission meets the project licensing criteria, please make sure to talk to the Taxonomy maintainer team *before* you begin work on your submission. We would hate for you to do a great deal of work only to be told that the data source you selected would not work for the project. Please make sure you review the [Getting Started with Knowledge Submissions](https://github.com/instructlab/taxonomy?tab=readme-ov-file#getting-started-with-knowledge-contributions) documentation prior to submitting your request. | Domain name | Status | Notes | | :-- | :-- | :-- | -| | approved | | +| Wikipedia: | approved | | | Project Gutenberg: | approved | Pre-1927 works; public domain under US copyright law | -| | proposed | | -| | proposed | | -| | proposed | | -| | proposed | | -| NASA: | proposed | See guidelines: | -| Smithsonian Libraries: | proposed | For any material marked \"No Copyright - United States" or "CC0" as described here: | -| European Union (EU): | proposed | Specifically documents submitted under "public registrars": | -| Internet Archive: | proposed | Pre-1927 works; public domain under US copyright law | -| Wikisource (library): | proposed | "free library that anyone can improve" | - -### Next steps - -1. We have to find the correct legal person to find a way to be the correct point person for this project. -1. Collect suggested places from the community and add them to the above table -1. Work with our legal team to get approvals and denials. -1. Inform the triage team and triagers of the new locations we can or can not accept. +| Wikisource (library): | approved | "free library that anyone can improve" | +| OpenStax textbooks family of publications | approved | | +| The Open Organization publications | approved | | +| The Scrum Guide | approved | | +| | reviewed - manually verify | | +| | reviewed - manually verify | | +| | reviewed - manually verify | | +| | reviewed - manually verify| | +| NASA: | reviewed - manually verify | See guidelines: | +| Smithsonian Libraries: | reviewed - manually verify | For any material marked \"No Copyright - United States" or "CC0" as described here: | +| European Union (EU): | reviewed - manually verify | Specifically documents submitted under "public registrars": | +| Internet Archive: | reviewed - manually verify | Pre-1927 works; public domain under US copyright law | +| PLOS family of open access journals: | reviewed - manually verify | | +| Open Practice Library: | reviewed - manually verify | | +| Cynefin.io wiki: | reviewed - manually verify | | +| The Open Education Project: | reviewed - manually verify | | + +### Process steps + +1. Collect suggested places from the community by requesting they submit a pull request against this dev doc. +1. Work with our legal team to adjudicate. [@lhawthorn](https://github.com/lhawthorn) is currently the owner of this step, but is happy to educate & empower other folks to do this work. +1. Inform the triage team and triagers of the new locations we can or can not accept. This is currently done via an announcement in the [daily Triager Standup](https://github.com/instructlab/community/blob/main/Collaboration.md#triager-standup) and via a pull request to update the Knowledge Guide in one of the two locations listed below. + +- Approved sources: +- Rejected sources: [approved]: https://github.com/instructlab/taxonomy/blob/main/docs/KNOWLEDGE_GUIDE.md#accepted-knowledge [avoided]: https://github.com/instructlab/taxonomy/blob/main/docs/KNOWLEDGE_GUIDE.md#avoid-these-topics