Dataset | Year | Content | Emotions | Format | Size | Language | Paper |
---|---|---|---|---|---|---|---|
MELD | 2019 | 1400 dialogues and 14000 utterances from Friends TV series by multiple speakers. | 7 emotions: Anger, Disgust, Sadness, Joy, Neutral, Surprise and Fear. MELD also has sentiment (positive, negative and neutral) annotation for each utterance. | Audio, Video, Text | ~10.1 GB | English | MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations |
Emov-DB | 2018 | Recordings for 4 speakers- 2 males and 2 females. | The emotional styles are neutral, sleepiness, anger, disgust and amused. | Audio | 5.88 GB | English | |
RAVDESS | 2018 | 7356 recordings by 24 actors. | 7 emotions: calm, happy, sad, angry, fearful, surprise, and disgust | Audio, Video | ~24.8 GB | English | The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS): A dynamic, multimodal set of facial and vocal expressions in North American English |
JL corpus | 2018 | 2400 recording of 240 sentences by 4 actors (2 males and 2 females). | 5 primary emotions: angry, sad, neutral, happy, excited. 5 secondary emotions: anxious, apologetic, pensive, worried, enthusiastic. | Audio | -- | English | An Open Source Emotional Speech Corpus for Human Robot Interaction Applications |
ANAD | 2018 | 1384 recording by multiple speakers. | 3 emotions: angry, happy, surprised. | Audio | ~2 GB | Arabic | |
MSP-IMPROV | 2017 | 20 sentences by 12 actors. | 4 emotions: angry, sad, happy, neutral, other, without agreement | Audio, Video | -- | English | MSP-IMPROV: An Acted Corpus of Dyadic Interactions to Study Emotion Perception |
CREMA-D | 2017 | 7442 clip of 12 sentences spoken by 91 actors (48 males and 43 females). | 6 emotions: angry, disgusted, fearful, happy, neutral, and sad | Audio, Video | -- | English | CREMA-D: Crowd-sourced Emotional Multimodal Actors Dataset |
EMOVO | 2014 | 6 actors who played 14 sentences. | 6 emotions: disgust, fear, anger, joy, surprise, sadness. | Audio | ~355 MB | Italian | EMOVO Corpus: an Italian Emotional Speech Database |
RECOLA | 2013 | 3.8 hours of recordings by 46 participants. | negative and positive sentiment (valence and arousal). | Audio, Video | -- | -- | Introducing the RECOLA Multimodal Corpus of Remote Collaborative and Affective Interactions |
GEMEP Corpus | 2012 | 10 actors portraying 10 states. | 12 emotions: amusement, anxiety, cold anger (irritation), despair, hot anger (rage), fear (panic), interest, joy (elation), pleasure(sensory), pride, relief, and sadness. Plus, 5 additional emotions: admiration, contempt, disgust, surprise, and tenderness. | Audio, Video | -- | French | Introducing the Geneva Multimodal Expression Corpus for Experimental Research on Emotion Perception |
LEGO corpus | 2012 | 347 dialogs with 9,083 system-user exchanges. | Emotions classified as garbage, non-angry, slightly angry and very angry. | Audio | 1.1 GB | -- | A Parameterized and Annotated Spoken Dialog Corpus of the CMU Let’s Go Bus Information System |
TESS | 2010 | 2800 recording by 2 actresses. | 7 emotions: anger, disgust, fear, happiness, pleasant surprise, sadness, and neutral. | Audio | -- | English | BEHAVIOURAL FINDINGS FROM THE TORONTO EMOTIONAL SPEECH SET |
EEKK | 2007 | 26 text passage read by 10 speakers. | 4 main emotions: joy, sadness, anger and neutral. | -- | ~352 MB | Estonian | Estonian Emotional Speech Corpus |
Keio-ESD | 2006 | A set of human speech with vocal emotion spoken by a Japanese male speaker. | 47 emotions including angry, joyful, disgusting, downgrading, funny, worried, gentle, relief, indignation, shameful, etc. | Audio | -- | Japanese | EMOTIONAL SPEECH SYNTHESIS USING SUBSPACE CONSTRAINTS IN PROSODY |
EMO-DB | 2005 | 800 recording spoken by 10 actors (5 males and 5 females). | 7 emotions: anger, neutral, fear, boredom, happiness, sadness, disgust. | Audio | -- | German | A Database of German Emotional Speech |
DES | 2002 | 4 speakers (2 males and 2 females). | 5 emotions: neutral, surprise, happiness, sadness and anger | -- | -- | Danish | Documentation of the Danish Emotional Speech Database |
yonelyvan / ser-datasets Goto Github PK
View Code? Open in Web Editor NEWThis project forked from superkogito/ser-datasets
A collection of datasets for the purpose of emotions recognition/detection in speech
License: MIT License