Materials that accompany the study "Augmented Datasheets for Speech Datasets and Ethical Decision-Making", accepted at FAccT'23.
The repository contains following files:
- The augmented datasheets templates in .docx and .tex (with options to include the original Datasheets for Datasets questions, or to answer our speech-specific questions only).
- Exemplary datasheets for 5 datasets: LibriSpeech, Common Voice, WHAM!, VoxPopuli, CORAAL.
- The lists of datasets & studies used in the literature review performed.
Please cite: Orestis Papakyriakopoulos, Anna Seo Gyeong Choi, Jerone Andrews, Rebecca Bourke, William Thong, Dora Zhao, Alice Xiang, and Allison Koenecke. 2023. Augmented Datasheets for Speech Datasets and Ethical Decision-Making. In 2023 ACM Conference on Fairness, Accountability, and Transparency (FAccT ’23), June 12–15, 2023, Chicago, IL, USA. ACM, New York,NY, USA, 23 pages https://doi.org/10.1145/3593013.3594049
The augmented datasheets are released under a Creative Commons Attribution-Share Alike 4.0 International License.
For any inquiries, please contact [email protected]