CN109492232A

Movatterモバイル変換

Info

Publication number: CN109492232A
Application number: CN201811231017.2A
Authority: CN
Inventors: 苏依拉; 张振; 高芬; 王宇飞; 孙晓骞; 牛向华; 赵亚平; 卞乐乐
Original assignee: Inner Mongolia University of Technology
Current assignee: Inner Mongolia University of Technology
Priority date: 2018-10-22
Filing date: 2018-10-22
Publication date: 2019-03-19

Abstract

Translated fromChinese

本文提出了一种基于Transformer模型的增强语义特征信息的蒙汉机器翻译方法。首先，本发明从蒙古文的语言特点出发，找出其在词干、词缀以及格的附加成分的特征，并将这些语言特征融入到模型的训练之中。其次，本发明以衡量两个单词间的相似程度的分布式表示为研究背景，综合分析了深度和密度、语义重合度对概念语义相似度的影响。本发明在翻译过程中，采用Transformer模型，所述Transformer模型为利用三角函数进行位置编码并基于增强型多头注意力机制构建的多层编码器‑解码器架构，从而完全依赖于注意力机制来绘制输入和输出之间的全局依赖关系，消除递归和卷积。

This paper proposes a Mongolian-Chinese machine translation method based on the Transformer model to enhance semantic feature information. First, the present invention starts from the language characteristics of Mongolian, finds the characteristics of the additional components of stems, affixes and cases, and integrates these language characteristics into the training of the model. Secondly, the present invention takes the distributed representation of measuring the similarity between two words as the research background, and comprehensively analyzes the influence of depth, density, and semantic coincidence on the semantic similarity of concepts. In the translation process of the present invention, the Transformer model is adopted, and the Transformer model is a multi-layer encoder-decoder architecture constructed based on an enhanced multi-head attention mechanism for positional encoding using trigonometric functions, so as to completely rely on the attention mechanism to draw Global dependencies between input and output, eliminating recursion and convolution.