nlp-resources
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
Here are 133 public repositories matching this topic...
Language:All
Sort:Most stars
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
- Updated
Jun 12, 2025 - Python
Portuguese pre-trained BERT models
- Updated
Jun 17, 2024 - Python
The hands-on NLTK tutorial for NLP in Python
- Updated
Feb 20, 2026 - Jupyter Notebook
A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
- Updated
Oct 25, 2022
chinese NLP corpus of chinese science fiction,chinese science fiction corpus : About 4675 Chinese science fiction novels 大约有4675本科幻小说,中文科幻小说自然语言处理语料库,中文科幻小说文本语料库,中文科幻小说文本数据库,科幻小说语料
- Updated
Oct 22, 2022
Projects and useful articles / links
- Updated
Nov 25, 2024 - Jupyter Notebook
A curated list of beginner resources in Natural Language Processing
- Updated
Jan 10, 2017
This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microsoft Research Asia (MSRA).
- Updated
Mar 16, 2024 - Python
A lexicon for Sudachi
- Updated
Jan 20, 2026 - Python
A curated list of NLP resources for Hungarian
- Updated
Jan 22, 2026
A Dutch RoBERTa-based language model
- Updated
Apr 8, 2024 - Jupyter Notebook
TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)
- Updated
Jun 15, 2022 - Python
summaries of all the papers I read
- Updated
Feb 20, 2026
chinese NLP corpus of chinese science fiction, chinese science fiction corpus: Archive of the Ark Plan of Ula Science Fiction Website 乌拉科幻小说网方舟计划存档,中文科幻小说自然语言处理语料库,中文科幻小说文本语料库,中文科幻小说文本数据库,科幻小说语料
- Updated
Oct 22, 2022
Natural Language Processing (NLP). Covering topics such as Tokenization, Part Of Speech tagging (POS), Machine translation, Named Entity Recognition (NER), Classification, and Sentiment analysis.
- Updated
Jan 4, 2024 - Python
A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.
- Updated
Dec 21, 2021 - JavaScript
An open information extraction system that provides compact extractions
- Updated
Feb 26, 2022 - Java
- Updated
Jan 18, 2023 - HTML
Created by Alan Turing
- Followers
- 25.8k followers
- Website
- github.com/topics/nlp
- Wikipedia
- Wikipedia