data-lineage
Here are 78 public repositories matching this topic...
Language:All
Sort:Most stars
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
- Updated
Dec 18, 2025 - TypeScript
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
- Updated
Dec 16, 2025 - HTML
Collect, aggregate, and visualize a data ecosystem's metadata
- Updated
Dec 11, 2025 - Java
SQL Lineage Analysis Tool powered by Python
- Updated
Dec 17, 2025 - Python
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
- Updated
Dec 16, 2025 - Java
One framework to develop, deploy and operate data workflows with Python and SQL.
- Updated
Dec 8, 2025 - Python
This dbt package captures metadata, artifacts, and test results so you can detect anomalies, monitor data quality, and build metadata tables. It powers Elementary OSS and feeds the wider context layer used by Elementary Cloud’s full Data & AI Control Plane.
- Updated
Dec 17, 2025 - Python
Metrics Observability & Troubleshooting
- Updated
Feb 29, 2024 - HTML
Generate and Visualize Data Lineage from query history
- Updated
Aug 4, 2023 - Python
Main repo including core data model, data marts, data quality tests, and terminology sets.
- Updated
Dec 18, 2025 - HTML
Enterprise Information Service
- Updated
Dec 11, 2025 - Java
A data framework for biology. Makes your data queryable, traceable, reproducible, and FAIR. One API: lakehouse, lineage, feature store, ontologies, LIMS, ELN.
- Updated
Dec 17, 2025 - Python
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
- Updated
Jun 3, 2024 - Java
Visualize column-level data lineage in Spark SQL
- Updated
May 13, 2022 - Scala
数据血缘,Hive/Sqoop/HBase/Spark等,发送到kafka后,解析处理使用neo4j生成血缘
- Updated
Aug 6, 2021 - Java
🦆 Batch data pipeline with Airflow, DuckDB, Delta Lake, Trino, MinIO, and Metabase. Full observability and data quality.
- Updated
Nov 5, 2025 - Python
End-to-end DataOps platform deployed by Terraform.
- Updated
Mar 22, 2025 - Python
A boilerplate solution for processing image and PDF documents for regulated industries, with lineage and pipeline operations metadata services.
- Updated
Oct 25, 2021 - Python
Improve this page
Add a description, image, and links to thedata-lineage topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thedata-lineage topic, visit your repo's landing page and select "manage topics."