etl
Here are 5,861 public repositories matching this topic...
Language:All
Sort:Most stars
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
- Updated
Dec 17, 2025 - Python
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
- Updated
Dec 17, 2025 - Python
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
- Updated
Dec 18, 2025 - Python
An orchestration platform for the development, production, and observation of data assets.
- Updated
Dec 18, 2025 - Python
🧙 Build, run, and manage data pipelines for integrating and transforming data.
- Updated
Dec 17, 2025 - Python
Fancy stream processing made operationally mundane
- Updated
Dec 17, 2025 - Go
Zero-ETL, infinite possibilities. Live query APIs, code & more with SQL. No DB required.
- Updated
Dec 16, 2025 - Go
Flink CDC is a streaming data integration tool
- Updated
Dec 16, 2025 - Java
Data pipelines for cloud config and security data. Build cloud asset inventory, CSPM, FinOps, and vulnerability management solutions. Extract from AWS, Azure, GCP, and 70+ cloud and SaaS sources.
- Updated
Dec 17, 2025 - Go
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale
- Updated
Dec 18, 2025 - Rust
Privacy and Security focused Segment-alternative, in Golang and React
- Updated
Dec 17, 2025 - Go
Build data pipelines, the easy way 🛠️
- Updated
Jun 6, 2023 - TypeScript
Open Source Data Security Platform for Developers to Monitor and Detect PII, Anonymize Production Data and Sync it across environments.
- Updated
Aug 30, 2025 - Go
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
- Updated
Dec 15, 2025 - Python
Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!
- Updated
Dec 17, 2025 - Rust
Spreadsheet with AI, Code, Connections
- Updated
Dec 18, 2025 - TypeScript
Maestro: Netflix’s Workflow Orchestrator
- Updated
Dec 4, 2025 - Java
A curated list with resources about node-based UIs
- Updated
Jun 29, 2025
A system for agentic LLM-powered data processing and ETL
- Updated
Nov 29, 2025 - Python
Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQueryhttps://goo.gl/oY5BCQ
- Updated
Aug 27, 2025 - Python
Improve this page
Add a description, image, and links to theetl topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with theetl topic, visit your repo's landing page and select "manage topics."