CN120067237A

Movatterモバイル変換

Info

Publication number: CN120067237A
Application number: CN202510533948.1A
Authority: CN
Inventors: 王燕玲; 沈昕; 许佳
Original assignee: Guangdong Bowei Chuangyuan Technology Co ltd
Current assignee: Guangdong Bowei Chuangyuan Technology Co ltd
Priority date: 2025-04-27
Filing date: 2025-04-27
Publication date: 2025-05-30
Anticipated expiration: 2045-04-27
Also published as: CN120067237B

Abstract

Translated fromChinese

本申请涉及文本处理技术领域，尤其涉及一种基于大数据的法律文档处理方法及系统，方法包括：对案情描述进行分词，得到多个描述词；在案例库中计算各描述词的匹配有效性；将归一化后的匹配有效性与所述描述词的TF‑IDF值的乘积作为加权系数对各描述词的语义向量加权求和，得到案情特征，依据案情特征和历史案例的案例特征间的相似度，得到案情描述的相似案例。通过本申请的技术方案，能够提高相似案例检索结果的准确性。

The present application relates to the field of text processing technology, and in particular to a method and system for processing legal documents based on big data, the method comprising: segmenting the case description to obtain multiple description words; calculating the matching validity of each description word in the case library; taking the product of the normalized matching validity and the TF-IDF value of the description word as the weighting coefficient to weight and sum the semantic vectors of each description word to obtain case features, and obtaining similar cases of the case description based on the similarity between the case features and the case features of historical cases. The technical solution of the present application can improve the accuracy of similar case retrieval results.