data-testing
Here are 34 public repositories matching this topic...
Language:All
Sort:Most stars
⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas)https://www.soda.io
- Updated
Dec 17, 2025 - Python
re_data - fix data issues before your users & CEO would discover them 😊
- Updated
Apr 30, 2024 - HTML
Code review for data in dbt
- Updated
Jan 3, 2025 - Python
Data validation toolkit for assessing and monitoring data quality.
- Updated
Dec 2, 2025 - Python
Various files useful for manual testing and test automation etc.
- Updated
Jul 25, 2022
Great Expectations Airflow operator
- Updated
Dec 5, 2025 - Python
re_data - fix data issues before your users & CEO would discover them 😊
- Updated
May 6, 2024 - Python
A simple and easy to use Data Validation library for Python.
- Updated
Dec 5, 2025 - Python
Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.
- Updated
Dec 11, 2025 - Scala
DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data quality test generation and execution by data profiling, new dataset hygiene review, AI generation of data quality validation tests, ongoing testing of data refreshes, & continuous anomaly monitoring
- Updated
Dec 16, 2025 - Python
Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
- Updated
Jun 22, 2022 - Python
Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.
- Updated
Jul 14, 2023 - Jupyter Notebook
⚡ Prevent downstream data quality issues by integrating the Soda Library into your CI/CD pipeline.
- Updated
Oct 1, 2024 - Python
This library is inspired by the Great Expectations library. The library has made the various expectations found in Great Expectations available when using the inbuilt python unittest assertions.
- Updated
Feb 3, 2022 - Python
Spark Data Test - A PySpark-based automation testing utility to compare Spark DataFrames
- Updated
Oct 20, 2025 - Python
Example API implementation for Data Caterer
- Updated
Aug 8, 2025 - Scala
DataBridge Quality Control
- Updated
Aug 25, 2025 - Go
data and pipeline testing with and for SQL
- Updated
Apr 17, 2025 - Python
Simple DB Fixtures for Sails.js v1 (fake data for testing).
- Updated
Nov 16, 2024 - JavaScript
Data generation and validation tool for any data source
- Updated
Feb 20, 2024 - Scala
Improve this page
Add a description, image, and links to thedata-testing topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thedata-testing topic, visit your repo's landing page and select "manage topics."