etl
Here are 6,623 public repositories matching this topic...
Language:All
Sort:Most stars
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
- Updated
Feb 20, 2026 - Python
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
- Updated
Feb 20, 2026 - Python
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
- Updated
Feb 20, 2026 - Python
An orchestration platform for the development, production, and observation of data assets.
- Updated
Feb 20, 2026 - Python
🧙 Build, run, and manage data pipelines for integrating and transforming data.
- Updated
Feb 20, 2026 - Python
Fancy stream processing made operationally mundane
- Updated
Feb 20, 2026 - Go
Zero-ETL, infinite possibilities. Live query APIs, code & more with SQL. No DB required.
- Updated
Feb 20, 2026 - Go
Flink CDC is a streaming data integration tool
- Updated
Feb 13, 2026 - Java
Data pipelines for cloud config and security data. Build cloud asset inventory, CSPM, FinOps, and vulnerability management solutions. Extract from AWS, Azure, GCP, and 70+ cloud and SaaS sources.
- Updated
Feb 20, 2026 - Go
Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!
- Updated
Feb 20, 2026 - Rust
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale
- Updated
Feb 20, 2026 - Rust
Privacy and Security focused Segment-alternative, in Golang and React
- Updated
Feb 20, 2026 - Go
Open Source Data Security Platform for Developers to Monitor and Detect PII, Anonymize Production Data and Sync it across environments.
- Updated
Aug 30, 2025 - Go
Build data pipelines, the easy way 🛠️
- Updated
Jun 6, 2023 - TypeScript
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
- Updated
Feb 20, 2026 - Python
Spreadsheet with AI, Code, Connections
- Updated
Feb 20, 2026 - Rust
Maestro: Netflix’s Workflow Orchestrator
- Updated
Feb 18, 2026 - Java
A system for agentic LLM-powered data processing and ETL
- Updated
Feb 2, 2026 - Python
A curated list with resources about node-based UIs
- Updated
Jun 29, 2025
Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQueryhttps://goo.gl/oY5BCQ
- Updated
Jan 25, 2026 - Python
Improve this page
Add a description, image, and links to theetl topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with theetl topic, visit your repo's landing page and select "manage topics."