data-ingestion
Here are 171 public repositories matching this topic...
Language:All
Sort:Most stars
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
- Updated
Mar 24, 2025 - Java
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
- Updated
Mar 24, 2025 - Python
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
- Updated
Mar 25, 2025 - Java
Concurrent and multi-stage data ingestion and data processing with Elixir
- Updated
Mar 18, 2025 - Elixir
Pravega - Streaming as a new software defined storage primitive
- Updated
Mar 2, 2025 - Java
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
- Updated
Mar 24, 2025 - Go
Copy to/from Parquet in S3 or Azure Blob Storage from within PostgreSQL
- Updated
Mar 24, 2025 - Rust
Orbital automates integration between data sources (APIs, Databases, Queues and Functions). BFF's, API Composition and ETL pipelines that adapt as your specs change.
- Updated
Mar 21, 2025 - TypeScript
Use SQL to build ELT pipelines on a data lakehouse.
- Updated
May 25, 2022 - JavaScript
A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way 🌰
- Updated
Mar 22, 2025 - Python
The Data Engineering Book - หนังสือวิศวกรรมข้อมูล ของคนไทย เพื่อคนไทย
- Updated
Oct 28, 2023 - JavaScript
Apache Paimon Rust The rust implementation of Apache Paimon.
- Updated
Oct 1, 2024 - Rust
- Updated
Feb 11, 2020 - Java
Sample code for the AWS Big Data Blog Post Building a scalable streaming data processor with Amazon Kinesis Data Streams on AWS Fargate
- Updated
Mar 25, 2025 - Python
Enables custom tracing of Java applications in Dynatrace
- Updated
Sep 3, 2024 - Java
Download and warehouse historical trading data
- Updated
Mar 17, 2023 - Elixir
The Data Integration Library project provides a library of generic components based on a multi-stage architecture for data ingress and egress.
- Updated
Mar 3, 2025 - Java
Improve this page
Add a description, image, and links to thedata-ingestion topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thedata-ingestion topic, visit your repo's landing page and select "manage topics."