datapipeline
Here are 238 public repositories matching this topic...
Language:All
Sort:Most stars
大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批流,私域营销等模块
- Updated
Nov 15, 2025 - Java
Roadmap for Data Engineering
- Updated
Jun 20, 2024 - Java
Simple stream processing pipeline
- Updated
Jun 17, 2024 - Python
Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applications
- Updated
Oct 5, 2024 - Python
High Performance Tensorflow Data Pipeline with State of Art Augmentations and low level optimizations.
- Updated
Apr 22, 2022 - Python
Step by step instructions to create a production-ready data pipeline
- Updated
Dec 23, 2024 - Jupyter Notebook
Tensorflow 2 Tutorials (use tensorflow and keras in a better way!)
- Updated
May 16, 2024 - Jupyter Notebook
Terraform module designed to easily backup EFS filesystems to S3 using DataPipeline
- Updated
Oct 14, 2025 - HCL
Awesome list for datapipeline
- Updated
Feb 6, 2023
Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌
- Updated
May 15, 2020 - Scala
kedro cli plugin for generating a static kedro viz site (html, css, js) that can be deployed on many serverless tools.
- Updated
Jan 6, 2023 - Python
Ethereum client written in Go, modified for full-hierarchy data exports and block specimen production
- Updated
Nov 20, 2025 - Go
Building Json data pipeline within Snowflake using Streams and Tasks
- Updated
Nov 15, 2019 - TSQL
Domain-specific language to help build and maintain AWS Data Pipelines
- Updated
Aug 22, 2018 - Scala
This course is designed to provide learners with the fundamental skills needed for data engineering using Python. The objective is to introduce anyone interested in the topic to Python's data engineering-related features.
- Updated
Aug 15, 2024 - Jupyter Notebook
A GitHub Action to lint, test, build-docs, package, and run your kedro pipelines. Supports any Python version you'll give it (that is also supported by pyenv).
- Updated
Oct 12, 2025 - Shell
Go library that provides easy-to-use interfaces and tools for TensorFlow users, in particular allowing to train existing TF models on .tar and .tgz datasets
- Updated
Mar 13, 2024 - Go
A data pipeline project build on databricks and azure to demostrate lifecycle of a cloud data project.
- Updated
Jan 12, 2022 - Jupyter Notebook
High speed message passing between various queues and services
- Updated
Aug 2, 2017 - Go
Modeling tool like DBT to use SQL Alchemy core with a DataFrame interface like
- Updated
Apr 23, 2023 - Python
Improve this page
Add a description, image, and links to thedatapipeline topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thedatapipeline topic, visit your repo's landing page and select "manage topics."