CN118964527B

Movatterモバイル変換

Info

Publication number: CN118964527B
Application number: CN202411433147.XA
Authority: CN
Inventors: 陈宇斯
Original assignee: Newchuan Technology Co ltd
Current assignee: Newchuan Technology Co ltd
Priority date: 2024-10-15
Filing date: 2024-10-15
Publication date: 2025-01-17
Anticipated expiration: 2044-10-15
Also published as: CN118964527A

Abstract

Translated fromChinese

本申请涉及自然语言处理技术领域，具体涉及一种智慧法院的电子案宗内容分析方法及装置，具体包括：基于不同电子案宗中包含相同关键词的语句之间的差异获取不同电子案宗之间相同关键词的顺序差异度量，结合各电子案宗中每个关键词的TF‑IDF值及词性构建各电子案宗中每个关键词的度量系数；根据关键词的度量系数构造权重函数，计算每个关键词的权重；根据电子案宗的指纹和关键词度量系数进行聚类，并获取各聚类簇的核心关键词集。提高了对于同一类型案件划分以及关键词提取的准确性，提高了对于电子案宗相似性衡量的准确程度，有助于提升电子案宗内容分析和案件类型划分的准确性。

The present application relates to the field of natural language processing technology, and specifically to a method and device for analyzing the content of electronic case files in a smart court, which specifically includes: obtaining the order difference measurement of the same keywords between different electronic case files based on the differences between sentences containing the same keywords in different electronic case files, and constructing the measurement coefficient of each keyword in each electronic case file in combination with the TF-IDF value and part of speech of each keyword in each electronic case file; constructing a weight function according to the measurement coefficient of the keyword, and calculating the weight of each keyword; clustering according to the fingerprint of the electronic case file and the keyword measurement coefficient, and obtaining the core keyword set of each cluster. It improves the accuracy of the division of cases of the same type and the extraction of keywords, improves the accuracy of the measurement of the similarity of electronic case files, and helps to improve the accuracy of the content analysis of electronic case files and the division of case types.