- Notifications
You must be signed in to change notification settings - Fork0
Datasets Used in AI4DB Research Work
License
Wind-Gone/ai4db-datasets
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
ai4db-datasets is a curated collection of benchmarks and datasets used in various database-related research papers. This repository aims to facilitate research in database systems, query optimization, indexing, and machine learning for databases by providing ready-to-use dataset import instructions and references to academic work.
Each dataset entry includes:
- A brief description of the dataset and its research significance.
- How to obtain and load the dataset into database systems (e.g., PostgreSQL, MySQL, etc.).
- References to research papers that have used the dataset.
- Additional setup or preprocessing steps, if applicable.
This repository welcomes contributions from researchers and practitioners who use and maintain datasets for database-related studies.
Important
I believe the most important aspect is not just providing information, but ensuring that the provider has successfully achieved data importation themselves.
We encourage researchers and developers to contribute new datasets following the template below:
Each new dataset entry should include:
- A short description of the dataset.
- The purpose and research areas where it has been used.
Links to official sources, repositories, or dataset downloads.
Commands or scripts to load the dataset into a database system.
- A list of academic papers or projects utilizing the dataset.
To contribute, fork the repository, add your dataset following the template, and submit a pull request.
This repository is open for public contributions. Ensure compliance with the licenses of individual datasets before using them in research.
For questions or dataset requests, feel free to open an issue or submit a pull request!