Skip to content

dlion168/spoken_stereoset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Spoken Stereoset: On Evaluating Social Bias Toward Speaker in Speech Large Language Models

This is the official repository for Spoken Stereoset, a dataset measures stereotypical bias on speech large language models (SLLMs). The construction detail can be found in our paper soon.

Metadata

id: The unique id of instance.
speaker: The speaker of the speech segment in azure TTS.
age/gender: The demogrpahic attribute of the speaker that might link to stereotypical associations.
context: The transcription of spoken context sentence.
irrelevant: An irrelevant continuation to the context.
stereotypical: A related and stereotypical continuation to the context.
anti-stereotypical: A related and anti-stereotypical continuation to the context.
labels: The labels annotated by the annotators for 3 possinle continuations.
annotators: The annotator id of the annotations.\

Contact

If you have any concerns, please contact: [email protected]

About

The official repo for speech based stereoset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published