nlp-machine-learning
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
Here are 6,498 public repositories matching this topic...
Language:All
Sort:Most stars
An open source library for deep learning end-to-end dialog systems and chatbots.
- Updated
Jul 4, 2025 - Python
An Open-Source Framework for Prompt-Learning.
- Updated
Jul 16, 2024 - Python
Structured data extraction and instruction calling with ML, LLM and Vision LLM
- Updated
Jul 4, 2025 - Python
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
- Updated
Jul 9, 2025
ChatGPT带火了聊天机器人,主流的趋势都调整到了GPT类模式,本项目也与时俱进,会在近期更新GPT类版本。基于本项目和自己的语料可以训练出自己想要的聊天机器人,用于智能客服、在线问答、闲聊等场景。
- Updated
Jun 26, 2024 - Python
精选机器学习,NLP,图像识别, 深度学习等人工智能领域学习资料,搜索,推荐,广告系统架构及算法技术资料整理。算法大牛笔记汇总
- Updated
Apr 15, 2024
Datasets, tools, and benchmarks for representation learning of code.
- Updated
Jan 31, 2022 - Jupyter Notebook
Text Classification Algorithms: A Survey
- Updated
Apr 1, 2025 - Python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
- Updated
Apr 14, 2025 - Python
自然语言处理领域下的相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
- Updated
Jan 5, 2024 - Python
The most accurate natural language detection library for Go, suitable for short text and mixed-language text
- Updated
Feb 6, 2025 - Go
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).
- Updated
Feb 4, 2025 - Python
End-to-end neural table-text understanding models.
- Updated
Jul 22, 2024 - Python
A deep dive into embeddings starting from fundamentals
- Updated
Nov 18, 2024 - Jupyter Notebook
Python AI assistant 🧠
- Updated
Nov 17, 2024 - Python
The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
- Updated
Jul 10, 2025 - Rust
Rasa UI is a frontend for the Rasa Framework
- Updated
Dec 30, 2022 - JavaScript
skweak: A software toolkit for weak supervision applied to NLP tasks
- Updated
Sep 2, 2024 - Python
We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.
- Updated
Jul 6, 2024 - Python
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.
- Updated
Oct 25, 2024 - Python
Created by Alan Turing
- Followers
- 25.8k followers
- Website
- github.com/topics/nlp
- Wikipedia
- Wikipedia