From ddc47ea4c9811e3957f2652bc4514075840356fa Mon Sep 17 00:00:00 2001 From: "github-actions[bot]" Date: Sun, 23 Jun 2024 17:45:32 +0000 Subject: [PATCH] Automatically generated CSV and README from JSON update --- README.md | 2 +- src/ser-datasets.csv | 2 +- 2 files changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index bb92635..3d51fca 100644 --- a/README.md +++ b/README.md @@ -18,7 +18,7 @@ The table can be browsed, sorted and searched under https://superkogito.github.i | [MESD](https://data.mendeley.com/datasets/cy34mh68j9/5) | 2022 | 864 audio files of single-word emotional utterances with Mexican cultural shaping. | 6 emotions provides single-word utterances for anger, disgust, fear, happiness, neutral, and sadness. | Audio | 0.097 GB | Spanish (Mexican) | [The Mexican Emotional Speech Database (MESD): elaboration and assessment based on machine learning](https://pubmed.ncbi.nlm.nih.gov/34891601/) | Open | [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/) | | [SyntAct](https://zenodo.org/record/6573016#.ZAjy_9LMJpj) | 2022 | Synthesized database with 997 utterances of three basic emotions and neutral expression based on rule-based manipulation for a diphone synthesizer which we release to the public | 6 emotions: angry, bored, happy, neutral, sad and scared | Audio | 0.941 GB | German | [SyntAct: A Synthesized Database of Basic Emotions](http://felix.syntheticspeech.de/publications/synthetic_database.pdf) | Open | [CC BY-SA 4.0](https://creativecommons.org/licenses/by/4.0) | | [BEAT](https://drive.google.com/drive/folders/1EKuWH8q178QOtFUYaNohdkZbBHQYAmhL) | 2022 | 76-Hour and 30-Speaker of 4 different languages: English (60h), Chinese (12h), Spanish (2h) and Japanese (2h). | 8 emotions: happiness, anger, disgust, sadness, contempt, surprise, fear, and neutral | Audio, Video | 42 GB | English, Chinese, Spanish, Japanese | [A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis](https://www.ecva.net/papers/eccv_2022/papers_ECCV/papers/136670605.pdf) | Open | Non-commercial license | -| [Dusha](https://github.com/salute-developers/golos/tree/master/dusha) | 2022 | 300 000 audio recordings (~350 hours) of Russian speech, their transcripts and emotiomal labels. The dataset has two subsets: acted and real-life | 4 emotions: angry, happy, sad and neutral. Arousal and valence metrics are also available. | Audio | 58 GB | Russian | [Large Raw Emotional Dataset with Aggregation Mechanism](https://arxiv.org/abs/2212.12266) | Open | [Public license with attribution and conditions reserved](https://github.com/salute-developers/golos/blob/master/license/en_us.pdf) | +| [Dusha](https://github.com/salute-developers/golos/tree/master/dusha) | 2022 | 300000 audio recordings (~350 hours) of Russian speech, their transcripts and emotiomal labels. The dataset has two subsets: acted and real-life | 4 emotions: angry, happy, sad and neutral. Arousal and valence metrics are also available. | Audio | 58 GB | Russian | [Large Raw Emotional Dataset with Aggregation Mechanism](https://arxiv.org/abs/2212.12266) | Open | [Public license with attribution and conditions reserved](https://github.com/salute-developers/golos/blob/master/license/en_us.pdf) | | [MAFW](https://mafw-database.github.io/MAFW/) | 2022 | 10045 video-audio clips in the wild. | 11 single-label emotion categories (anger, disgust, fear, happiness, neutral, sadness, surprise, contempt, anxiety, helplessness, and disappointment) and 32 multi-label emotion categories. | Audio, Video | -- | -- | [MAFW: A Large-scale, Multi-modal, Compound Affective Database for Dynamic Facial Expression Recognition in the Wild](https://arxiv.org/abs/2208.00847) | Restricted | Non-commercial research purposes | | [EMOVIE](https://viem-ccy.github.io/EMOVIE/dataset_release.html) | 2021 | 9724 samples with audio files and its emotion human-labeled annotation. | Polarity metrics (positive:+1, negative:-1) | Audio | 0.572 GB | Chinese (Mandarin) | [EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model](https://arxiv.org/abs/2106.09317) | Open | [CC BY-NC-SA 2.0](https://creativecommons.org/licenses/by-nc-sa/2.0/legalcode) | | [emoUERJ](https://zenodo.org/records/5427549) | 2021 | Ten sentences from eight actors, equally divided between genders, and they were free to choose the phrases for record audios in four emotions (377 audios). | happiness, anger, sadness or neutral | Audio | 0.1051 GB | Portuguese (Brazilian) | -- | Open | [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/) | diff --git a/src/ser-datasets.csv b/src/ser-datasets.csv index 0819e9d..28e76b4 100644 --- a/src/ser-datasets.csv +++ b/src/ser-datasets.csv @@ -14,7 +14,7 @@ Dataset,Year,Content,Emotions,Format,Size,Language,Paper,Access,License `MESD `_,2022,864 audio files of single-word emotional utterances with Mexican cultural shaping.,"6 emotions provides single-word utterances for anger, disgust, fear, happiness, neutral, and sadness.",Audio,0.097 GB,Spanish (Mexican),`The Mexican Emotional Speech Database (MESD): elaboration and assessment based on machine learning `_,Open,`CC BY 4.0 `_ `SyntAct `_,2022,Synthesized database with 997 utterances of three basic emotions and neutral expression based on rule-based manipulation for a diphone synthesizer which we release to the public ,"6 emotions: angry, bored, happy, neutral, sad and scared",Audio,0.941 GB,German,`SyntAct: A Synthesized Database of Basic Emotions `_,Open,`CC BY-SA 4.0 `_ `BEAT `_,2022,"76-Hour and 30-Speaker of 4 different languages: English (60h), Chinese (12h), Spanish (2h) and Japanese (2h).","8 emotions: happiness, anger, disgust, sadness, contempt, surprise, fear, and neutral","Audio, Video",42 GB,"English, Chinese, Spanish, Japanese",`A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis `_,Open,Non-commercial license -`Dusha `_,2022," 300 000 audio recordings (~350 hours) of Russian speech, their transcripts and emotiomal labels. The dataset has two subsets: acted and real-life","4 emotions: angry, happy, sad and neutral. Arousal and valence metrics are also available.",Audio,58 GB,Russian,`Large Raw Emotional Dataset with Aggregation Mechanism `_,Open,`Public license with attribution and conditions reserved `_ +`Dusha `_,2022,"300000 audio recordings (~350 hours) of Russian speech, their transcripts and emotiomal labels. The dataset has two subsets: acted and real-life","4 emotions: angry, happy, sad and neutral. Arousal and valence metrics are also available.",Audio,58 GB,Russian,`Large Raw Emotional Dataset with Aggregation Mechanism `_,Open,`Public license with attribution and conditions reserved `_ `MAFW `_,2022,10045 video-audio clips in the wild.,"11 single-label emotion categories (anger, disgust, fear, happiness, neutral, sadness, surprise, contempt, anxiety, helplessness, and disappointment) and 32 multi-label emotion categories.","Audio, Video",--,--,"`MAFW: A Large-scale, Multi-modal, Compound Affective Database for Dynamic Facial Expression Recognition in the Wild `_",Restricted,Non-commercial research purposes `EMOVIE `_,2021,9724 samples with audio files and its emotion human-labeled annotation.,"Polarity metrics (positive:+1, negative:-1)",Audio,0.572 GB,Chinese (Mandarin),`EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model `_,Open,`CC BY-NC-SA 2.0 `_ `emoUERJ `_,2021,"Ten sentences from eight actors, equally divided between genders, and they were free to choose the phrases for record audios in four emotions (377 audios). ","happiness, anger, sadness or neutral",Audio,0.1051 GB,Portuguese (Brazilian),--,Open,`CC BY 4.0 `_