data-pipeline-automation
Here are 11 public repositories matching this topic...
Language:All
Sort:Most stars
End-to-end data pipeline transforming Olist e-commerce data through Azure cloud services. Implements medallion architecture (Bronze-Silver-Gold) with multi-source ingestion, Spark-based processing, and OLTP-to-OLAP optimization for analytics-ready datasets.
- Updated
Nov 4, 2025 - Jupyter Notebook
FabricEngineer is a comprehensive Python package designed specifically for Microsoft Fabric developers to streamline data transformation workflows and automate complex ETL processes. This package provides enterprise-grade solutions for building robust data pipelines with minimal boilerplate code.
- Updated
Aug 27, 2025 - Jupyter Notebook
A full-stack financial application featuring real-time forex rates and machine learning-powered price predictions across multiple currency pairs. Built with automated data pipelines and Prophet time series forecasting.
- Updated
Oct 29, 2025 - Jupyter Notebook
A complete reference implementation of a local-first ecosystem for AI-powered analytics. This repository contains the source code for the SBDK.dev website, the central hub for the SBDK suite of open-source tools.
- Updated
Nov 25, 2025 - TypeScript
End-to-End PV Monitoring & Streaming Pipeline with Delta Lake
- Updated
Aug 12, 2025
An end-to-end, multi-container solution for collecting and visualizing stock market data
- Updated
Nov 21, 2025 - Python
Data automation involves automating the extraction, transformation, and loading (ETL) processes to streamline data workflows. GitHub Actions enables automated execution of tasks, such as building, testing, and deploying code, in response to events. This integration simplifies continuous deployment and ensures repeatable data pipeline operations
- Updated
Feb 25, 2025 - HTML
Automated data pipeline scraping fuel prices daily, apply transformations and load the data to a PostgreSQL database.
- Updated
Dec 4, 2025 - Python
“End-to-end data engineering pipeline for real-time weather data using Python, PostgreSQL, and Streamlit.”
- Updated
Nov 14, 2025 - Python
This project is an end-to-end machine learning pipeline for predicting diamond prices based on various features. It includes data preprocessing, model training, and prediction scripts, and is designed for easy setup and use by anyone familiar with Python and data science.
- Updated
Jul 26, 2025 - Jupyter Notebook
Automated pipeline to scrape and compare gaming gear prices & ratings from Amazon and eBay, including laptops, mice, Nintendo Switch, PS5, and Xbox. Outputs daily CSVs, SQL-ready datasets, and marketplace insights for competitive analysis.
- Updated
Sep 20, 2025 - Jupyter Notebook
Improve this page
Add a description, image, and links to thedata-pipeline-automation topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thedata-pipeline-automation topic, visit your repo's landing page and select "manage topics."