data-transformation
Here are 815 public repositories matching this topic...
Language:All
Sort:Most stars
☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️
- Updated
Jan 12, 2025 - Python
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
- Updated
Dec 2, 2024 - Python
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
- Updated
Dec 17, 2025 - Go
Logical Replication extension for PostgreSQL 17, 16, 15, 14, 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version upgrades.
- Updated
Aug 23, 2025 - C
A block-based API for NSValueTransformer, with a growing collection of useful examples.
- Updated
Oct 1, 2021 - Objective-C
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
- Updated
Jun 8, 2024 - Go
Advanced and Fast Data Transformation in R
- Updated
Dec 9, 2025 - C
Microsoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Microsoft Program Synthesis using Example SDK.
- Updated
Nov 19, 2025 - C#
💄 Durable and asynchronous data imports for consuming data at scale and publishing testable SDKs.
- Updated
Feb 19, 2025 - PHP
Official Repository of "LLM × DATA" Survey Paper
- Updated
Nov 2, 2025
Like awk, but with SQL and table joins
- Updated
Nov 25, 2024 - Tcl
An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R
- Updated
Dec 13, 2025 - R
📄 Concise selector to extract JSON from HTML.
- Updated
Jul 6, 2024 - TypeScript
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
- Updated
Jun 26, 2023 - Python
Clojure Query: A Command-line Data Processor for JSON, YAML, EDN, XML and more
- Updated
Dec 9, 2025 - Clojure
A curated list of Clojure resources for dealing with domain-specific languages.
- Updated
Jul 30, 2024
A simple Spark-powered ETL framework that just works 🍺
- Updated
Oct 2, 2025 - Scala
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
- Updated
Dec 4, 2025 - HTML
Data transformation and utility functions for R
- Updated
Jul 30, 2025 - R
🤖 An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python 3.6 required.
- Updated
Apr 2, 2025 - Python
Improve this page
Add a description, image, and links to thedata-transformation topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thedata-transformation topic, visit your repo's landing page and select "manage topics."