nlp-resources
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
Here are 134 public repositories matching this topic...
Language:All
Sort:Most stars
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
- Updated
Jun 12, 2025 - Python
This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide practical guidance for researchers and practitioners.
- Updated
Dec 10, 2025 - Python
Portuguese pre-trained BERT models
- Updated
Jun 17, 2024 - Python
The hands-on NLTK tutorial for NLP in Python
- Updated
May 28, 2024 - Jupyter Notebook
A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
- Updated
Oct 25, 2022
chinese NLP corpus of chinese science fiction,chinese science fiction corpus : About 4675 Chinese science fiction novels 大约有4675本科幻小说,中文科幻小说自然语言处理语料库,中文科幻小说文本语料库,中文科幻小说文本数据库,科幻小说语料
- Updated
Oct 22, 2022
Projects and useful articles / links
- Updated
Nov 25, 2024 - Jupyter Notebook
A curated list of beginner resources in Natural Language Processing
- Updated
Jan 10, 2017
This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microsoft Research Asia (MSRA).
- Updated
Mar 16, 2024 - Python
A lexicon for Sudachi
- Updated
Nov 4, 2025 - Python
A curated list of NLP resources for Hungarian
- Updated
Aug 11, 2025
A Dutch RoBERTa-based language model
- Updated
Apr 8, 2024 - Jupyter Notebook
TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)
- Updated
Jun 15, 2022 - Python
summaries of all the papers I read
- Updated
Dec 14, 2025
chinese NLP corpus of chinese science fiction, chinese science fiction corpus: Archive of the Ark Plan of Ula Science Fiction Website 乌拉科幻小说网方舟计划存档,中文科幻小说自然语言处理语料库,中文科幻小说文本语料库,中文科幻小说文本数据库,科幻小说语料
- Updated
Oct 22, 2022
Natural Language Processing (NLP). Covering topics such as Tokenization, Part Of Speech tagging (POS), Machine translation, Named Entity Recognition (NER), Classification, and Sentiment analysis.
- Updated
Jan 4, 2024 - Python
A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.
- Updated
Dec 21, 2021 - JavaScript
- Updated
Jan 18, 2023 - HTML
Created by Alan Turing
- Followers
- 25.8k followers
- Website
- github.com/topics/nlp
- Wikipedia
- Wikipedia