Movatterモバイル変換

Taking Actions Separately: A Bidirectionally-Adaptive Transfer Learning Method for Low-Resource Neural Machine Translation

Xiaolin Xing,Yu Hong,Minhan Xu,Jianmin Yao,Guodong Zhou

Abstract

Training Neural Machine Translation (NMT) models suffers from sparse parallel data, in the infrequent translation scenarios towards low-resource source languages. The existing solutions primarily concentrate on the utilization of Parent-Child (PC) transfer learning. It transfers well-trained NMT models on high-resource languages (namely Parent NMT) to low-resource languages, so as to produce Child NMT models by fine-tuning. It has been carefully demonstrated that a variety of PC variants yield significant improvements for low-resource NMT. In this paper, we intend to enhance PC-based NMT by a bidirectionally-adaptive learning strategy. Specifically, we divide inner constituents (6 transformers) of Parent encoder into two “teams”, i.e., T1 and T2. During representation learning, T1 learns to encode low-resource languages conditioned on bilingual shareable latent space. Generative adversarial network and masked language modeling are used for space-shareable encoding. On the other hand, T2 is straightforwardly transferred to low-resource languages, and fine-tuned together with T1 for low-resource translation. Briefly, T1 and T2 take actions separately for different goals. The former aims to adapt to characteristics of low-resource languages during encoding, while the latter adapts to translation experiences learned from high-resource languages. We experiment on benchmark corpora SETIMES, conducting low-resource NMT for Albanian (Sq), Macedonian (Mk), Croatian (Hr) and Romanian (Ro). Experimental results show that our method yields substantial improvements, which allows the NMT performance to reach BLEU4-scores of 62.24%, 56.93%, 50.53% and 54.65% for Sq, Mk, Hr and Ro, respectively.

Anthology ID:: 2022.coling-1.395
Volume:: Proceedings of the 29th International Conference on Computational Linguistics
Month:: October
Year:: 2022
Address:: Gyeongju, Republic of Korea
Editors:: Nicoletta Calzolari,Chu-Ren Huang,Hansaem Kim,James Pustejovsky,Leo Wanner,Key-Sun Choi,Pum-Mo Ryu,Hsin-Hsi Chen,Lucia Donatelli,Heng Ji,Sadao Kurohashi,Patrizia Paggio,Nianwen Xue,Seokhwan Kim,Younggyun Hahm,Zhong He,Tony Kyungil Lee,Enrico Santus,Francis Bond,Seung-Hoon Na
Venue:: COLING
SIG:
Publisher:: International Committee on Computational Linguistics
Note:
Pages:: 4481–4491
Language:
URL:: https://aclanthology.org/2022.coling-1.395/
DOI:
Bibkey:
Cite (ACL):: Xiaolin Xing, Yu Hong, Minhan Xu, Jianmin Yao, and Guodong Zhou. 2022.Taking Actions Separately: A Bidirectionally-Adaptive Transfer Learning Method for Low-Resource Neural Machine Translation. InProceedings of the 29th International Conference on Computational Linguistics, pages 4481–4491, Gyeongju, Republic of Korea. International Committee on Computational Linguistics.
Cite (Informal):: Taking Actions Separately: A Bidirectionally-Adaptive Transfer Learning Method for Low-Resource Neural Machine Translation (Xing et al., COLING 2022)
Copy Citation:
PDF:: https://aclanthology.org/2022.coling-1.395.pdf

PDF Cite Search Fix data