CN118536572B

Movatterモバイル変換

Info

Publication number: CN118536572B
Application number: CN202410942681.7A
Authority: CN
Inventors: 戴毅; 李玉; 丁振强; 曹其顺; 岳杨
Original assignee: Shenzhen Xunfang Technology Co ltd
Current assignee: Shenzhen Xunfang Technology Co ltd
Priority date: 2024-07-15
Filing date: 2024-07-15
Publication date: 2024-11-15
Anticipated expiration: 2044-07-15
Also published as: CN118536572A

Abstract

Translated fromChinese

本申请实施例涉及智能对话技术领域，公开了一种训练对话模型的方法，该方法包括：获取若干组对话样本，对话样本包括至少一个问题语句和至少一个回复语句，其中，最后一个回复语句标注为真实标签；从对话样本中分层面获取全局层面的全局文本和说话者层面的说话者文本；将全局文本和说话者文本输入生成网络进行先编码后解码处理，得到预测标签；计算预测标签和真实标签之间的损失，并根据若干组对话样本对应的损失和，对生成网络进行迭代训练，直至收敛，得到对话模型。如此，使得生成网络更好地理解对话样本的上下文逻辑关系，能够有效提高训练得到的对话模型回复的准确性和连贯性。

The embodiment of the present application relates to the field of intelligent dialogue technology, and discloses a method for training a dialogue model, which includes: obtaining several groups of dialogue samples, the dialogue samples include at least one question sentence and at least one reply sentence, wherein the last reply sentence is marked as a true label; obtaining the global text at the global level and the speaker text at the speaker level from the dialogue samples in layers; inputting the global text and the speaker text into the generation network for encoding and then decoding to obtain the predicted label; calculating the loss between the predicted label and the true label, and iteratively training the generation network according to the sum of the losses corresponding to several groups of dialogue samples until convergence to obtain a dialogue model. In this way, the generation network can better understand the contextual logical relationship of the dialogue sample, and can effectively improve the accuracy and coherence of the dialogue model response obtained by training.