medallion-architecture
Here are 204 public repositories matching this topic...
Language:All
Sort:Most stars
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
- Updated
Apr 23, 2025 - TSQL
A cloud-native data pipeline and visualization project analyzing Formula 1 racing data using Azure, Databricks, Delta Lake, Tableau, and Python for insightful EDA and interactive dashboards.
- Updated
Jul 23, 2025 - Jupyter Notebook
🦆 Batch data pipeline with Airflow, DuckDB, Delta Lake, Trino, MinIO, and Metabase. Full observability and data quality.
- Updated
Nov 5, 2025 - Python
A production-ready PySpark project template with medallion architecture, Python packaging, unit tests, integration tests, CI/CD automation, Databricks Asset Bundles, and DQX data quality framework.
- Updated
Nov 13, 2025 - Python
Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and attributes, this project reveals detailed insights into card mechanics, strengths, and trends to support BI and strategic analysis.
- Updated
Jan 23, 2025 - Python
Databricks DLT Apparel Pipeline Project: Learn medallion architecture, streaming, and data engineering with Delta Live Tables. Includes synthetic data, step-by-step guide, and certification prep.
- Updated
Nov 4, 2025 - Python
Building a modern data warehouse with SQL server, including ETL processes, data modeling, and analytics.
- Updated
May 3, 2025 - TSQL
'Talk to Your Factory' demo leveraging Edge (Azure IoT Operations), Cloud (Microsoft Fabric), and a Factory Agent (Azure OpenAI), to streamline factory operations. It allows real-time, natural language communication with factory systems, helping operators quickly identify issues, boost efficiency, and minimize downtime.
- Updated
Apr 16, 2025 - Python
Revolutionary AI ETL with Medallion Architecture: Zero-touch autonomous & HITL pipelines on Databricks
- Updated
Aug 7, 2025 - Jupyter Notebook
This project implements a Lakehouse Medallion Architecture using modern Data Stack tools such as Fivetran, Snowflake and dbt. The ficticious organization is an e-commerce company.
- Updated
Sep 30, 2024 - Python
Unified Data Foundation with Microsoft Fabric with Options to Integrate with Azure Databricks and Microsoft Purview
- Updated
Dec 11, 2025 - Jupyter Notebook
Building a modern data warehouse with SQL Server, including ETL processes, data modeling and analytics
- Updated
Oct 1, 2025 - TSQL
This repo provides a step-by-step approach to building a modern data warehouse using PostgreSQL. It covers the ETL (Extract, Transform, Load) process, data modeling, exploratory data analysis (EDA), and advanced data analysis techniques.
- Updated
Mar 7, 2025 - PLpgSQL
development scaffold for test driven pyspark structured streaming with fast local testing
- Updated
Nov 21, 2025 - Python
Building a modern data warehouse with Microsoft SQL Server, including ETL processes, data modeling and as well as analytics.
- Updated
Dec 3, 2025 - TSQL
Building a Data Lakehouse using the Medallion architecture.
- Updated
Sep 1, 2024 - Jupyter Notebook
End-to-end data pipeline transforming Olist e-commerce data through Azure cloud services. Implements medallion architecture (Bronze-Silver-Gold) with multi-source ingestion, Spark-based processing, and OLTP-to-OLAP optimization for analytics-ready datasets.
- Updated
Nov 4, 2025 - Jupyter Notebook
Enterprise-grade Data Platform for NYC Taxi Analytics. Orchestrated with Airflow (Astro) & dbt, served via FastAPI & Power BI. Features Medallion Architecture, Data Quality Observability (Slack), and Star Schema modeling.
- Updated
Dec 10, 2025 - Python
Extract data from many databases of Labor, Invalids and Social Affairs sectors and convert to appropriate structure and format, then upload to shared data warehouse and data mart. Thanks to that, people of state agencies can easily retrieve and analyze data based on the compiled data warehouse.
- Updated
Sep 5, 2024 - PLpgSQL
End-to-end Azure Data Engineering project using Medallion Architecture, Databricks, Synapse, and Power BI.
- Updated
May 5, 2025 - Jupyter Notebook
Improve this page
Add a description, image, and links to themedallion-architecture topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with themedallion-architecture topic, visit your repo's landing page and select "manage topics."