ZeweiChu/DiscoEvalPublic

NotificationsYou must be signed in to change notification settings
Fork8
Star43

EMNLP DiscoEval paper

License

View license

43 stars 8 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
data		data
discoeval		discoeval
examples		examples
train		train
LICENSE		LICENSE
README.md		README.md

Repository files navigation

DiscoEval

This repository contains the code for DiscoEvalEvaluation Benchmarks and Learning Criteria for Discourse-Aware Sentence Representations (EMNLP 2019).

The structure of this repo:

train: the training code
discoeval: the Discourse Evaluation framework
data: the DiscoEval evaluation datasets

The pretrained models with different training signals can be downloaded fromhttps://drive.google.com/file/d/1I0wFkNb2fmoC7kcj-FPxyVkHKyfX-MoX/view?usp=sharing

The training data (generated from Wikipedia) can be downloaded from

https://drive.google.com/open?id=1WPRJylC7PzLtYcg8-PMNX_ZUNNRkO3Bp

Evaluation example code

train/discoeval_example.py

The code is tested under the following environment/versions:

Python 3.6.2
PyTorch 1.0.0
numpy 1.16.0

Some code in this repo is adopted fromSentEval.

Experiments

	SP	BSO	DC	SSP	PDTB-E	PDTB-I	RST	AVG
baseline	47.3	63.8	61.0	77.8	36.5	39.1	56.7	54.6
SDT	45.8	62.9	60.3	78.0	36.6	39.1	55.7	54.1
SPP	48.4	65.3	60.2	78.4	38.1	39.9	56.4	55.2
NL	46.9	64.0	61.0	78.9	37.6	39.9	56.5	55.0
SPP + NL	48.5	64.7	59.9	78.9	37.8	40.5	56.7	55.3
SDT + NL	46.1	63.0	60.8	78.1	36.7	38.1	56.2	54.1
SPP + SDT	46.5	63.9	60.4	77.6	35.2	38.6	56.3	54.1
ALL	46.1	63.7	60.0	78.6	36.3	37.6	55.3	53.9
Skipthought	47.5	64.6	55.2	77.5	39.3	40.2	59.7	54.8
InferSent	45.8	62.9	56.3	62.2	37.3	38.8	52.3	50.8
DisSent	47.7	64.9	54.8	62.2	42.2	40.7	57.8	52.9
ELMo	47.8	65.6	60.7	79.0	41.3	41.8	57.5	56.2
BERT base	53.1	68.5	58.9	80.3	41.9	42.4	58.8	57.7
BERT large	53.8	69.3	59.6	80.4	44.3	43.6	59.1	58.6

You may notice some difference from the above table with our camera-ready version appeared on EMNLP 2019.The differences are: we removed the hidden states in SSP (previously 2000 by mistake), we regenerated the SP dataset (previously the sentence orders were shuffled, now the sentences are in the original order except the first sentence).

Reference

@inproceedings{mchen-discoeval-19,  author    = {Mingda Chen and Zewei Chu and Kevin Gimpel},  title     = {Evaluation Benchmarks and Learning Criteria for Discourse-Aware Sentence Representations},  booktitle = {Proc. of {EMNLP}},  year      = {2019}}

About

EMNLP DiscoEval paper

Languages

Python100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Folders and files

Latest commit

History

Repository files navigation

DiscoEval

Experiments

Reference

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Contributors2

Uh oh!

Languages

Movatterモバイル変換

License

ZeweiChu/DiscoEval

Folders and files

Latest commit

History

Repository files navigation

DiscoEval

Experiments

Reference

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Contributors2

Uh oh!

Languages

Packages