etl
Here are 4,916 public repositories matching this topic...
Language:All
Sort:Most stars
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
- Updated
Jul 18, 2025 - Python
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
- Updated
Jul 18, 2025 - Python
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
- Updated
Jul 18, 2025 - Python
An orchestration platform for the development, production, and observation of data assets.
- Updated
Jul 18, 2025 - Python
🧙 Build, run, and manage data pipelines for integrating and transforming data.
- Updated
Jul 18, 2025 - Python
Fancy stream processing made operationally mundane
- Updated
Jul 18, 2025 - Go
Zero-ETL, infinite possibilities. Live query APIs, code & more with SQL. No DB required.
- Updated
Jul 17, 2025 - Go
Flink CDC is a streaming data integration tool
- Updated
Jul 10, 2025 - Java
The developer first cloud governance platform
- Updated
Jul 18, 2025 - Go
Privacy and Security focused Segment-alternative, in Golang and React
- Updated
Jul 18, 2025 - Go
Build data pipelines, the easy way 🛠️
- Updated
Jun 6, 2023 - TypeScript
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
- Updated
Jul 14, 2025 - Python
Open Source Data Security Platform for Developers to Monitor and Detect PII, Anonymize Production Data and Sync it across environments.
- Updated
Jul 18, 2025 - Go
Spreadsheet with AI, Code, Connections
- Updated
Jul 18, 2025 - Rust
Maestro: Netflix’s Workflow Orchestrator
- Updated
Jul 11, 2025 - Java
A curated list with resources about node-based UIs
- Updated
Jun 29, 2025
Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQueryhttps://goo.gl/oY5BCQ
- Updated
Apr 30, 2025 - Python
Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.
- Updated
Jul 18, 2025 - Go
Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage
- Updated
Jul 18, 2025 - Go
Scalable and efficient data transformation framework - backwards compatible with dbt.
- Updated
Jul 18, 2025 - Python
Improve this page
Add a description, image, and links to theetl topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with theetl topic, visit your repo's landing page and select "manage topics."