stopwords
Here are 297 public repositories matching this topic...
Language:All
Sort:Most stars
jieba analysis plugin for elasticsearch 7.0.0, 6.4.0, 6.0.0, 5.4.0,5.3.0, 5.2.2, 5.2.1, 5.2, 5.1.2, 5.1.1
- Updated
Jan 3, 2024 - Java
A collection of languages stemmers and stopwords for Lunr Javascript library
- Updated
Mar 9, 2025 - JavaScript
Largest list of Arabic stop words on Github. أكبر قائمة لمستبعدات الفهرسة العربية على جيت هاب
- Updated
Mar 27, 2024
Default English stopword lists from many different sources
- Updated
Apr 6, 2023 - Python
Persian (Farsi) Stop Words List
- Updated
Dec 27, 2021 - Python
This repository consists of a complete guide on natural language processing (NLP) in Python where we'll learn various techniques for implementing NLP including parsing & text processing and understand how to use NLP for text feature engineering.
- Updated
Jul 4, 2022 - Jupyter Notebook
Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.
- Updated
Jul 20, 2023 - Go
🍊 📄 Text Mining add-on for Orange3
- Updated
Sep 4, 2025 - Python
A data package containing lexicons and dictionaries for text analysis
- Updated
Oct 12, 2021 - R
PHP | A collection of stop words for e.g. search-functions.
- Updated
Feb 8, 2025 - PHP
the list of ~2000 ukrainian stopwords (with numbers)
- Updated
May 20, 2021 - Python
A collection of Persian stopwords - فهرست کلمات ایست فارسی
- Updated
Oct 16, 2021
This project employs emotion detection in textual data, specifically trained on Twitter data comprising tweets labeled with corresponding emotions. It seamlessly takes text inputs and provides the most fitting emotion assigned to it.
- Updated
Oct 29, 2024 - Jupyter Notebook
📒 An Aho-Corasick algorithm based string-searching utility for Go. It supports tokenization, ignoring case, replacing text. So you can use it to find keywords in an article, filter sensitive words, etc.
- Updated
Sep 14, 2022 - Go
📒 An Aho-Corasick algorithm based string-searching utility for Java. It supports tokenization, ignoring case, replacing text. So you can use it to find keywords in an article, filter sensitive words, etc.
- Updated
Sep 14, 2022 - Java
Improve this page
Add a description, image, and links to thestopwords topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thestopwords topic, visit your repo's landing page and select "manage topics."