etl-automation
Here are 153 public repositories matching this topic...
Language:All
Sort:Most stars
superglue (YC W25) builds integrations and tools from natural language. Get production-grade tools for long tail and enterprise systems.
- Updated
Nov 28, 2025 - TypeScript
Structured Data Extractor for AI Agents. Search your documents or the web for specific data and get it back in JSON or Markdown in a single tool call.
- Updated
Mar 29, 2025 - Python
an app engine for your business. Seamlessly implement business logic with a powerful API. Out of the box CMS, blog, forum and email functionality. Developer friendly & easily extendable for your next SaaS/XaaS project. Built with Rails 6, Devise, Sidekiq & PostgreSQL
- Updated
Nov 26, 2025 - Ruby
Treats CSV and JSON files as SQL tables, and exports SQL SELECTs back to CSV or JSON.
- Updated
Oct 22, 2025 - Kotlin
[EOL] Real-Time Event Streaming & Change Data Capture
- Updated
Nov 4, 2025 - Shell
The Virtual Data Warehouse is a code generation and template management tool. It is part of the data solution automation ecosystem - the 'engine' for data solution automation.
- Updated
Jul 13, 2025 - Handlebars
- Updated
Oct 28, 2024 - HTML
Generic interface exchange format for Data Warehouse Automation and ETL generation.
- Updated
Jul 8, 2024 - C#
The goal of this project is to illustrate Extract Transform Load (ETL) using Python and SQL. ETL is a process commonly done in computing, which takes raw data, cleans it and stores it for later use. The extraction phase targets and retrieves the data. Transform manipulates and cleans the data. Then load stores the data, typically in a data wareh…
- Updated
Sep 7, 2021 - Jupyter Notebook
DIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics control framework that can be used to monitor, log, audit and control data integration / ETL processes.
- Updated
Nov 25, 2025 - TSQL
Data Engineering portfolio projects, resources used to study data tools...
- Updated
Mar 25, 2024 - Jupyter Notebook
proof of concept to generate Airbyte low-code YAML connectors from API documentation
- Updated
Mar 11, 2024 - Python
BETL. Meta data driven ETL generation using T-SQL
- Updated
Jun 29, 2022
Automatically download and transform Hetzner invoices.
- Updated
May 28, 2020 - Python
Amazon Redshift Serverless RSQL ETL Framework
- Updated
Apr 1, 2025 - TypeScript
This is a PHP project which combines ETL with different strategies to extract data from multiple databases, files, and services, transform it and load it into multiple destinations.
- Updated
Oct 22, 2025 - PHP
An ASP NET MVC 6 Web GUI (Net core) for easy reports generation using ReportGenerator
- Updated
Apr 28, 2023 - C#
End-to-end data engineering processes for the NIGERIA Health Facility Registry (HFR). The project leveraged Selenium, Pandas, PySpark, PostgreSQL and Airflow
- Updated
Sep 1, 2022 - Python
Improve this page
Add a description, image, and links to theetl-automation topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with theetl-automation topic, visit your repo's landing page and select "manage topics."