synthetic-data
Here are 689 public repositories matching this topic...
Language:All
Sort:Most stars
Code for Machine Learning for Algorithmic Trading, 2nd edition.
- Updated
Aug 18, 2024 - Jupyter Notebook
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
- Updated
Mar 18, 2025 - Python
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
- Updated
Mar 21, 2025 - Python
Open source data anonymization and synthetic data platform for developers. Anonymize your production data and sync it across your environments so that developers can safely use it.
- Updated
Mar 21, 2025 - Go
The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.
- Updated
Mar 19, 2025 - Python
A procedural Blender pipeline for photorealistic training image generation
- Updated
Jan 13, 2025 - Python
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
- Updated
Mar 17, 2025 - Python
Synthetic data generation for tabular data
- Updated
Mar 21, 2025 - Python
Synthetic Patient Population Simulator
- Updated
Mar 6, 2025 - Java
SDG is a specialized framework designed to generate high-quality structured tabular data.
- Updated
Mar 6, 2025 - Python
UnrealCV: Connecting Computer Vision to Unreal Engine
- Updated
Mar 12, 2025 - C++
Synthetic data generators for tabular and time-series data
- Updated
Mar 12, 2025 - Jupyter Notebook
The Declarative Data Generator
- Updated
Sep 27, 2024 - Rust
Conditional GAN for generating synthetic tabular data.
- Updated
Mar 17, 2025 - Python
PostgreSQL database anonymization and synthetic data generation tool
- Updated
Mar 20, 2025 - Go
Synthetic data curation for post-training and structured data extraction
- Updated
Mar 21, 2025 - Python
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
- Updated
Feb 2, 2025 - Python
A framework for comprehensive diagnosis and optimization of agents using simulated, realistic synthetic interactions
- Updated
Mar 11, 2025 - Python
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
- Updated
Feb 28, 2025 - Python
Curated list of open source tooling for data-centric AI on unstructured data.
- Updated
Nov 15, 2023
Improve this page
Add a description, image, and links to thesynthetic-data topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thesynthetic-data topic, visit your repo's landing page and select "manage topics."