chinese-text-segmentation
Here are 44 public repositories matching this topic...
Language:All
Sort:Most stars
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
- Updated
Nov 5, 2025 - C#
Deep Learning Chinese Word Segment
- Updated
May 18, 2018 - C++
"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation: built to be the best PHP Chinese word segmentation module.
- Updated
Dec 16, 2025 - PHP
Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction implemented based on TEXTRANK algorithm. Jcseg had a build-in http server and search modules for lucene,solr,elasticsearch,opensearch
- Updated
Sep 18, 2023 - Java
Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
- Updated
Nov 28, 2025 - Python
zhparser is a PostgreSQL extension for full-text search of Chinese language
- Updated
Oct 21, 2025 - C
Chinese text segmentation with R. R语言中文分词 (文档已更新 🎉 :https://qinwenfeng.com/jiebaR/ )
- Updated
Jul 13, 2020 - C++
中文文本分类、序列标注工具包(pytorch),支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Chinese text classification and sequence labeling toolkit, supports multi class and multi label classification, text similsrity, text summary and NER.
- Updated
Jul 18, 2024 - Python
HanLP中文分词Lucene插件,支持包括Solr在内的基于Lucene的系统
- Updated
Oct 13, 2020 - Java
Tokenizer support Lucene5/6/7/8/9+ version, LTS
- Updated
Dec 18, 2023 - Java
利用深度学习实现中文分词
- Updated
Jul 30, 2017 - Python
开源中文分词工具包,中文分词Web API,Lucene中文分词,中英文混合分词
- Updated
Dec 8, 2020 - Scala
Mandarin Chinese text segmentation and mobile dictionary Android app (中文分词)
- Updated
Feb 1, 2022 - Java
為了《中國哲學書電子化計劃》輸入用-加速鍵入與排版,更好的輸入體驗+文房一寶勝四寶C#+WordVBA文史工具-中文博士寫程式
- Updated
Dec 15, 2025 - C#
Chinese Word Segmention Base on the Deep Learning and LSTM Neural Network
- Updated
Nov 22, 2016 - Python
Jiebago 的性能优化版, 支持从 io.Reader 加载字典
- Updated
Dec 3, 2022 - Go
ik-analyzer for rust; chinese tokenizer for tantivy
- Updated
Jan 19, 2024 - Rust
ChatterBot中文适配版,支持中文分词搜索和中文停用词
- Updated
Mar 28, 2020 - Python
An unsupervised Chinese word segmentation tool.
- Updated
May 13, 2017 - C++
基于 jieba-rs 的中文分词插件
- Updated
Nov 1, 2025 - TypeScript
Improve this page
Add a description, image, and links to thechinese-text-segmentation topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thechinese-text-segmentation topic, visit your repo's landing page and select "manage topics."