dataset
Here are 13,717 public repositories matching this topic...
Language:All
Sort:Most stars
A collective list of free APIs
- Updated
Feb 19, 2026 - Python
Label Studio is a multi-type data labeling and annotation tool with standardized output format
- Updated
Feb 20, 2026 - TypeScript
Faker is a Python package that generates fake data for you.
- Updated
Feb 6, 2026 - Python
pix2tex: Using a ViT to convert images of equations into LaTeX code.
- Updated
Jan 18, 2025 - Python
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
- Updated
Feb 20, 2026 - Python
A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval
- Updated
Feb 20, 2026 - JavaScript
A MNIST-like fashion product database. Benchmark 👇
- Updated
Jun 13, 2022 - Python
Open source annotation tool for machine learning practitioners.
- Updated
Feb 17, 2026 - Python
Techniques for deep learning with satellite & aerial imagery
- Updated
Feb 19, 2026
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
- Updated
Feb 6, 2026
Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
- Updated
Mar 13, 2023
Documentation on how to access and use the Quick, Draw! Dataset.
- Updated
Mar 11, 2025
CSGHub is a brand-new open-source platform for managing LLMs, developed by the OpenCSG team. It offers both open-source and on-premise/SaaS solutions, with features comparable to Hugging Face. Gain full control over the lifecycle of LLMs, datasets, and agents, with Python SDK compatibility with Hugging Face. Join us! ⭐️
- Updated
Feb 4, 2026 - Vue
Browser compatibility data for Web technologies as displayed on MDN
- Updated
Feb 20, 2026 - JSON
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
- Updated
Feb 18, 2026 - Python
Transformer: PyTorch Implementation of "Attention Is All You Need"
- Updated
Jul 15, 2025 - Python
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
- Updated
Oct 19, 2025 - Python
SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.
- Updated
Jul 6, 2025 - TypeScript
Improve this page
Add a description, image, and links to thedataset topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thedataset topic, visit your repo's landing page and select "manage topics."