eric-xw/Video-guided-Machine-TranslationPublic

NotificationsYou must be signed in to change notification settings
Fork6
Star51

Starter code for the VMT task and challenge

You must be signed in to change notification settings

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
README.md		README.md
configs.yaml		configs.yaml
dataloader.py		dataloader.py
eval.py		eval.py
model.py		model.py
train.py		train.py
utils.py		utils.py
vmt_demo.png		vmt_demo.png

Repository files navigation

Video-guided Machine Translation

This repo contains the starter code for theVATEX Translation Challenge for Video-guided Machine Translation (VMT), aiming at translating a source language description into the target language with video information as additional spatiotemporal context.

VMT is introduced in our ICCV oral paper "VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research".VATEX is a new large-scale multilingual video description dataset, which contains over 41,250 videos and 825,000 captions in both English and Chinese and half of these captions are English-Chinese translation pairs.For more details, please check the latest version of the paper:https://arxiv.org/abs/1904.03493.

Prerequisites

Python 3.7
PyTorch 1.4 (1.0+)
nltk 3.4.5

Training

1. Download corpus files and the extracted video features

First, under the vmt/ directory, download train/val/test json file:

./data/download.sh

Then download the I3D video features fromhere for trainval andhere for test

# set up your DIR/vatex_features for storing large video featuresmkdir DIR/vatex_featureswget https://vatex-feats.s3.amazonaws.com/trainval.zip -P DIR/vatex_featuresunzip DIR/vatex_features/trainval.zipwget https://vatex-feats.s3.amazonaws.com/public_test.zip -P DIR/vatex_featuresunzip DIR/vatex_features/public_test.zipcd vmt/ln -s DIR/vatex_features data/vatex_features

2. Training

To train the baseline VMT model:

python train.py

The default hyperparamters are set inconfigs.yaml.

Evaluation

Run

python eval.py

Specify the model name inconfigs.yaml. The script will generate a json file for submission to theVMT Challenge on CodaLab.

Results

The baseline VMT model achieves the following performance on corpus-level bleu score (the numbers here are slightly different from those in the paper due to different evaluation setups. For fair comparison, please compare with the performance here):

Model	EN -> ZH	ZH -> EN
BLEU-4	31.1	24.6

On the evaluation server, we report cumulative corpus-level BLEU score (up to 4-gram) and each individual n-gram score for reference, shown as B-1, ..., B-4.

Model performance is evaluated by cumulative BLEU-4 score in the challenge.

Reference

Please cite our paper if you use our code or dataset:

@InProceedings{Wang_2019_ICCV,author = {Wang, Xin and Wu, Jiawei and Chen, Junkun and Li, Lei and Wang, Yuan-Fang and Wang, William Yang},title = {VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research},booktitle = {The IEEE International Conference on Computer Vision (ICCV)},month = {October},year = {2019}}

About

Starter code for the VMT task and challenge

vatex-challenge.org

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Video-guided Machine Translation

Prerequisites

Training

1. Download corpus files and the extracted video features

2. Training

Evaluation

Results

Reference

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Contributors2

Uh oh!

Languages

Movatterモバイル変換

eric-xw/Video-guided-Machine-Translation

Folders and files

Latest commit

History

Repository files navigation

Video-guided Machine Translation

Prerequisites

Training

1. Download corpus files and the extracted video features

2. Training

Evaluation

Results

Reference

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Contributors2

Uh oh!

Languages

Packages