Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

word-segmentation

Here are 145 public repositories matching this topic...

Unsupervised text tokenizer for Neural Network-based text generation.

  • UpdatedDec 16, 2025
  • C++

百度NLP:分词,词性标注,命名实体识别,词重要性

  • UpdatedMay 25, 2021
  • C++

Unsupervised text tokenizer focused on computational efficiency

  • UpdatedMar 29, 2024
  • C++

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).

  • UpdatedJun 2, 2025
  • Python
Kiwinagisa

A Japanese tokenizer based on recurrent neural networks

  • UpdatedOct 29, 2025
  • Python

中文文本分类、序列标注工具包(pytorch),支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Chinese text classification and sequence labeling toolkit, supports multi class and multi label classification, text similsrity, text summary and NER.

  • UpdatedJul 18, 2024
  • Python
kiwipiepy

This repository is archived! The maintained MeCab can be foundhttps://github.com/shogo82148/mecab

  • UpdatedOct 15, 2024
  • C++
monpa

MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型

  • UpdatedFeb 20, 2025
  • Python

Improve this page

Add a description, image, and links to theword-segmentation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with theword-segmentation topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp