Skip to content

SonyResearch/project_ethics_augmented_datasheets_for_speech_datasets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Augmented Datasheets for Speech Datasets and Ethical Decision-Making

Materials that accompany the study "Augmented Datasheets for Speech Datasets and Ethical Decision-Making", accepted at FAccT'23.

The repository contains following files:

  • The augmented datasheets templates in .docx and .tex (with options to include the original Datasheets for Datasets questions, or to answer our speech-specific questions only).
  • Exemplary datasheets for 5 datasets: LibriSpeech, Common Voice, WHAM!, VoxPopuli, CORAAL.
  • The lists of datasets & studies used in the literature review performed.

Please cite: Orestis Papakyriakopoulos, Anna Seo Gyeong Choi, Jerone Andrews, Rebecca Bourke, William Thong, Dora Zhao, Alice Xiang, and Allison Koenecke. 2023. Augmented Datasheets for Speech Datasets and Ethical Decision-Making. In 2023 ACM Conference on Fairness, Accountability, and Transparency (FAccT ’23), June 12–15, 2023, Chicago, IL, USA. ACM, New York,NY, USA, 23 pages https://doi.org/10.1145/3593013.3594049

The augmented datasheets are released under a Creative Commons Attribution-Share Alike 4.0 International License.

For any inquiries, please contact [email protected]

About

Public code repo for research paper

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages