open-datasets
Here are 152 public repositories matching this topic...
Language:All
Sort:Most stars
Community Datasets added by users and made available for use at large
- Updated
Nov 29, 2025 - HTML
DataHub commons. Wiki catalog of interesting and important datasets
- Updated
Mar 11, 2025
A review of change detection methods, including codes and open data sets for deep learning. From paper: change detection based on artificial intelligence: state-of-the-art and challenges.
- Updated
Jul 30, 2021
Open Public Domain Exercise Dataset in JSON format
- Updated
Feb 16, 2025 - TypeScript
A comprehensive, global, open source database of power plants
- Updated
Jan 26, 2022 - HTML
Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs
- Updated
Nov 20, 2025 - Python
Chronic Disease Prediction Using Medical Notes
- Updated
Sep 26, 2019 - Python
Geojson and topojson files for all municipalities, by regions and provinces
- Updated
May 14, 2024 - Shell
Metadata and versioning details for the Common Voice dataset
- Updated
Oct 9, 2025 - JavaScript
Open Data Portals and Sites around the world
- Updated
Nov 21, 2025 - Nunjucks
A publicly-editable collection of open science resources, including tools, datasets, meta-resources, etc.
- Updated
Aug 12, 2022
Index of Clojure libraries available on github.
- Updated
Nov 23, 2025 - Clojure
Code for our DLS'21 paper - BODMAS: An Open Dataset for Learning based Temporal Analysis of PE Malware. BODMAS is short for Blue Hexagon Open Dataset for Malware AnalysiS.
- Updated
Mar 31, 2024 - Python
YOLOv7 to detect bone fractures on X-ray images
- Updated
Apr 3, 2023 - Python
Data repository of JSON files that are filed by US Senators on efdsearch.senate.gov where they must report their stock trades. This is the same data as on senatestockwatcher.com
- Updated
Mar 16, 2021 - Python
A Dataset for Cover Song Identification and Understanding
- Updated
Feb 23, 2023 - Python
Vehicular trajectories processing for Didi GAIA Open Data Set
- Updated
Apr 4, 2023 - Jupyter Notebook
🔬 Code-free deep segmentation for computational pathology
- Updated
Jun 14, 2024 - Jupyter Notebook
Database with information about Nuclear Power Plants worldwide.
- Updated
Mar 3, 2024
Historical data on COVID-19 vaccination doses administered in Germany, per state.
- Updated
May 12, 2022 - HTML
Improve this page
Add a description, image, and links to theopen-datasets topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with theopen-datasets topic, visit your repo's landing page and select "manage topics."