huggingface-datasets
Here are 189 public repositories matching this topic...
Language:All
Sort:Most stars
Curate High Quality Datasets, Train, Evaluate and Ship! 🚀
- Updated
Dec 18, 2025 - Python
Generic template to bootstrap your PyTorch project.
- Updated
Oct 12, 2023 - Python
[EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models
- Updated
Aug 22, 2023 - Python
An open-source alternative to Yahoo Finance's market data APIs with higher reliability.
- Updated
Dec 8, 2025 - Python
Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, backup, re-embed (using any model) or access your vector data from any vector databases or repository.
- Updated
Dec 15, 2025 - Jupyter Notebook
Forecast evaluation library
- Updated
Dec 16, 2025 - Python
中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine
- Updated
May 22, 2024 - Python
Pytorch-like dataloaders for JAX.
- Updated
Dec 16, 2025 - Jupyter Notebook
Translate large dataset to any language with google translation api and multithreads processing, no key required!
- Updated
Oct 31, 2025 - Python
A gopeed-extension for downloading models and datasets from huggingface, hf-mirror and modelscope. Huggingface download
- Updated
Nov 4, 2025 - JavaScript
Automate Fashion Image Captioning using BLIP-2. Automatic generating descriptions of clothes on shopping websites, which can help customers without fashion knowledge to better understand the features (attributes, style, functionality etc.) of the items and increase online sales by enticing more customers.
- Updated
Aug 7, 2023 - Jupyter Notebook
使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory
- Updated
Sep 8, 2024 - Python
Mount remote repositories, models and datasets managed by Git LFS instantly.
- Updated
Jun 21, 2025 - Go
Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in dataset using pandas
- Updated
Jul 6, 2023 - Python
huggingface-go : 高速下载 huggingface 的模型和数据集
- Updated
Aug 27, 2025 - Go
🤗 AeroPath: An airway segmentation benchmark dataset with challenging pathology
- Updated
Aug 1, 2025 - Jupyter Notebook
Getting started with Hugging Face
- Updated
Mar 13, 2025 - Jupyter Notebook
A collection of Italian benchmarks for LLM evaluation
- Updated
Dec 2, 2025 - Python
NLP model that predicts subreddit based on the title of a post
- Updated
Mar 22, 2023 - Jupyter Notebook
PySpark custom data source for Hugging Face Datasets
- Updated
Aug 12, 2025 - Python
Improve this page
Add a description, image, and links to thehuggingface-datasets topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thehuggingface-datasets topic, visit your repo's landing page and select "manage topics."