- Notifications
You must be signed in to change notification settings - Fork1
ku-nlp/EaST-MELD
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
EaST-MELD is an English-Japanese dataset for emotion-aware speech translation based onMELD.EaST-MELD has the following features:
- Based on subtitles and speech from TV dramas.
- EaST-MELD has text/speech in both English and Japanese and two types of emotion labels (Emotion/Sentiment) annotated for each utterance.
- EaST-MELD has an emotion-aware evaluation set (dev/test)
Emotion | Sentiment | English | Japanese |
---|---|---|---|
surprise | negative | This sounds like a hernia. You have to―you―you go to the doctor! | ヘルニアだな医者へ |
fear | negative | No way! ‘Kay look, if I have to go to the doctor for anything it’s gonna be for this thing sticking out of my Stomach! | 行くもんかこの何かが腹から出てくるまではな |
disgust | negative | That’s a hernia. | 脱腸だって |
anger | negative | Why did I have to start working out again? Damn you 15s! | 運動して失敗した ダンベルのバカ! |
# of utterances | |
---|---|
Train | 9,422 |
Dev | 418 |
Test | 418 |
- train, dev_subtitle, test_subtitle: Subtitles is used as Japanese translation.
- dev_transcription, test_transcription: Speech transcription is used as Japanese translation.
id, dialogue_id, utterance_id, Emotion, Sentiment, Text(En), Text(Ja), Season, Episode, Speaker, Starttime(En), Endtime(En), Starttime(Ja), Endtime(Ja)
- csv data includes utterances that do not have a corresponding Japanese translation.
dialogue_id, Utterance_id, Emotion, Sentiment, Speaker, the name of wav files
- yaml data does not include utterances that do not have a corresponding Japanese translation.
- text only
- The lines in the yaml file and the .en/.ja files are mapped to each other.
Please, note that by downloading the dataset, you agree to the following conditions:
- Do not re-distribute the dataset without our permission.
- The dataset can only be used for research purposes. Any other use is explicitly prohibited.
If you are interested in the speech data of EaST-MELD, please contactyahata@nlp.ist.i.kyoto-u.ac.jp .
S. Poria, D. Hazarika, N. Majumder, G. Naik, E. Cambria, R. Mihalcea. MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation. ACL 2019.
About
No description, website, or topics provided.
Resources
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Releases
No releases published
Packages0
No packages published