Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add the complete database #10

Open
felixbur opened this issue Jun 29, 2023 · 6 comments
Open

Add the complete database #10

felixbur opened this issue Jun 29, 2023 · 6 comments

Comments

@felixbur
Copy link
Member

The complete data described in the paper has in addition

  • also the files that were not recognized by 80% of the listeners
  • the laryngogram tracks
@felixbur
Copy link
Member Author

ok, so i would actually start doing this now.
I got here 817 stereo files (audio and laryngo channels).

I would

  • split the channels
  • add the files that are not already in the distribution to files table
  • add the additional files to a table named emotion.categories.extra.gold_standard
  • make an extra folder for the laryngo tracks
  • multiply all tables with the prefix "laryngo", eg, "laryngo.files", "laryngo.emotion.categories.extra.gold_standard" etc.

what do you think, @hagenw ?

@hagenw
Copy link
Member

hagenw commented Nov 21, 2024

split the channels

Why do they have more channels. Stereo recordings, or different speaker in different channels?

make an extra folder for the laryngo tracks

What is the file format of the laryngo tracks? I guess you would like to tread them as media files and create tables having them in the index, correct?

@felixbur
Copy link
Member Author

felixbur commented Nov 21, 2024

  1. left channel= audio, right channel laryngo
  2. all in wav 16kHz format

@felixbur
Copy link
Member Author

btw i see that there also is a naturalness value ("how natural sounds the sample?").

I would add this as well

@schruefer
Copy link
Member

Cool! I'm looking forward to using those additional tracks!

add the additional files to a table named emotion.categories.extra.gold_standard

I would not call this table gold_standard since there is no agreement, which one would expect by this name.
Rather just call it emotion.categories.extra or emotion.categories.disagreement or emotion.categories.ambiguous

@felixbur
Copy link
Member Author

emotion.categories.ambiguous
seems best

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants