CN120012790A

Movatterモバイル変換

Info

Publication number: CN120012790A
Application number: CN202510494841.0A
Authority: CN
Inventors: 胡俊勇; 王亚平; 刘卫刚; 李文龙; 马满庄; 杨金林; 宋粉; 李娜
Original assignee: Shaanxi Tirain Technology Co ltd
Current assignee: Shaanxi Tirain Technology Co ltd
Priority date: 2025-04-21
Filing date: 2025-04-21
Publication date: 2025-05-16
Anticipated expiration: 2045-04-21
Also published as: CN120012790B

Abstract

Translated fromChinese

本申请公开了一种集成人工智能与多语言音节切分的高精度地名翻译方法，涉及地名翻译技术领域，其首先将输入地名拆分为子词Token序列，利用词嵌入模型的上下文编码模型实现待翻译地名语义上下文关联编码；随后通过局部语义关联性重构强化的方式建模待翻译地名Token上下文语义中的内部结构化信息和依赖关系，识别复合词、黏着语中的隐性音节关联，并对音节边界附近的局部形态特征进行增强，以解决不规则拼写与跨语言干扰问题，进而解码输出音节切分后的地名字符串来进行地名翻译，生成相应的地名目标语言翻译文本，实现从原始地名到目标译名的端到端有效转换。

The present application discloses a high-precision place name translation method integrating artificial intelligence and multilingual syllable segmentation, which relates to the field of place name translation technology. The method first splits the input place name into a subword Token sequence, and uses the context encoding model of the word embedding model to realize the semantic context association encoding of the place name to be translated; then, the internal structured information and dependency relationship in the context semantics of the place name Token to be translated are modeled by means of local semantic association reconstruction and reinforcement, the implicit syllable association in compound words and agglutinative languages is identified, and the local morphological features near the syllable boundary are enhanced to solve the problems of irregular spelling and cross-language interference, and then the place name character string after the output syllable segmentation is decoded to perform place name translation, and the corresponding place name target language translation text is generated, so as to realize the end-to-end effective conversion from the original place name to the target translated name.