elbayadm/attn2dPublic

NotificationsYou must be signed in to change notification settings
Fork73
Star502

Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction

License

MIT license

502 stars 73 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 1,308 Commits
.github		.github
docs		docs
examples		examples
fairseq		fairseq
fairseq_cli		fairseq_cli
pip-wheel-metadata/fairseq.dist-info		pip-wheel-metadata/fairseq.dist-info
scripts		scripts
tests		tests
.gitignore		.gitignore
.gitmodules		.gitmodules
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
eval_lm.py		eval_lm.py
generate.py		generate.py
hubconf.py		hubconf.py
interactive.py		interactive.py
preprocess.py		preprocess.py
pyproject.toml		pyproject.toml
score.py		score.py
setup.py		setup.py
train.py		train.py
validate.py		validate.py

Repository files navigation

This is a fork of Fairseq(-py) with implementations of the following models:

Pervasive Attention - 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction

An NMT models with two-dimensional convolutions to jointly encode the source and the target sequences.

Pervasive Attention also provides an extensive decoding grid that we leverage to efficiently train wait-k models.

SeeREADME.

Efficient Wait-k Models for Simultaneous Machine Translation

Transformer Wait-k models (Ma et al., 2019) with unidirectional encoders and with joint training of multiple wait-k paths.

SeeREADME.

Fairseq Requirements and Installation

PyTorch version >= 1.4.0
Python version >= 3.6
For training new models, you'll also need an NVIDIA GPU andNCCL

Installing Fairseq

git clone https://github.com/elbayadm/attn2dcd attn2dpip install --editable.

License

fairseq(-py) is MIT-licensed.The license applies to the pre-trained models as well.

Citation

For Pervasive Attention, please cite:

@InProceedings{elbayad18conll,author ="Elbayad, Maha and Besacier, Laurent and Verbeek, Jakob",title ="Pervasive Attention: 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction",booktitle ="Proceedings of the 22nd Conference on Computational Natural Language Learning",year ="2018", }

For our wait-k models, please cite:

@article{elbayad20waitk,title={Efficient Wait-k Models for Simultaneous Machine Translation},author={Elbayad, Maha and Besacier, Laurent and Verbeek, Jakob},journal={arXiv preprint arXiv:2005.08595},year={2020}}

For Fairseq, please cite:

@inproceedings{ott2019fairseq,title ={fairseq: A Fast, Extensible Toolkit for Sequence Modeling},author ={Myle Ott and Sergey Edunov and Alexei Baevski and Angela Fan and Sam Gross and Nathan Ng and David Grangier and Michael Auli},booktitle ={Proceedings of NAACL-HLT 2019: Demonstrations},year ={2019},}

About

Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction

Releases

No releases published

Packages

No packages published

Contributors139

+ 125 contributors

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Folders and files

Latest commit

History

Repository files navigation

Pervasive Attention - 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction

Efficient Wait-k Models for Simultaneous Machine Translation

Fairseq Requirements and Installation

License

Citation

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages

Contributors139

Languages

Movatterモバイル変換

License

elbayadm/attn2d

Folders and files

Latest commit

History

Repository files navigation

Pervasive Attention - 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction

Efficient Wait-k Models for Simultaneous Machine Translation

Fairseq Requirements and Installation

License

Citation

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages0

Contributors139

Languages

Packages