CN119336900B

Movatterモバイル変換

Info

Publication number: CN119336900B
Application number: CN202411883290.9A
Authority: CN
Inventors: 彭晗; 阮日青; 周语杰; 黄宇凡; 张金传; 刘星宝; 李沁; 任剑
Original assignee: Xiangjiang Laboratory
Current assignee: Xiangjiang Laboratory
Priority date: 2024-12-19
Filing date: 2024-12-19
Publication date: 2025-03-11
Anticipated expiration: 2044-12-19
Also published as: CN119336900A

Abstract

Translated fromChinese

本发明实施例中提供了一种基于层级专家路由模型与CoT推理的检索优化方法，属于数据处理技术领域，具体包括：创建分层知识库；提取分层知识库中每层各个片段摘要并聚类，创建分层语义库；使用分层语义库中各个中心摘要构建层级索引库并结合智能路由机制形成层级专家路由模型；利用层级专家路由模型确定层级索引库中与问题匹配的目标层级索引；将向量化后的问题与目标层级索引进行相似度计算；返回相似度最高的分层语义库下原始文档的片段；利用CoT推理驱动层级切换，最终合并片段返回大模型，生成问题对应的答案。通过本发明的方案，提高了用户查询的响应速度和准确性，实现了检索路径的自适应调整，提升了复杂查询的检索效率和适应性。

The embodiment of the present invention provides a retrieval optimization method based on a hierarchical expert routing model and CoT reasoning, which belongs to the field of data processing technology, and specifically includes: creating a hierarchical knowledge base; extracting and clustering the abstracts of each fragment in each layer of the hierarchical knowledge base to create a hierarchical semantic base; using the central abstracts in the hierarchical semantic base to build a hierarchical index base and combining it with an intelligent routing mechanism to form a hierarchical expert routing model; using the hierarchical expert routing model to determine the target hierarchical index that matches the question in the hierarchical index base; calculating the similarity between the vectorized question and the target hierarchical index; returning the fragment of the original document under the hierarchical semantic base with the highest similarity; using CoT reasoning to drive hierarchical switching, and finally merging the fragments to return to the large model to generate the answer corresponding to the question. Through the scheme of the present invention, the response speed and accuracy of user queries are improved, the adaptive adjustment of the retrieval path is realized, and the retrieval efficiency and adaptability of complex queries are improved.