databricks
Here are 1,376 public repositories matching this topic...
Language:All
Sort:Most stars
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
- Updated
Dec 15, 2025 - Python
📊 Cube Core is open-source semantic layer and LookML alternative for AI, BI and embedded analytics
- Updated
Dec 17, 2025 - Rust
🏆 实时 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构 🏆 Real-Time coding-free, powerful and secure ORM 🚀 providing APIs and Docs without coding by Backend, and the returned JSON of API can be customized by Frontend(Client) users
- Updated
Dec 7, 2025 - Java
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
- Updated
Jun 30, 2023 - Python
Simple and Distributed Machine Learning
- Updated
Dec 15, 2025 - Scala
A native Rust library for Delta Lake, with bindings into Python
- Updated
Dec 16, 2025 - Rust
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
- Updated
Sep 24, 2025 - C#
🔥🔥🔥 Open source Reverse ETL - alternative to hightouch and census.
- Updated
Dec 17, 2025 - Ruby
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
- Updated
Dec 17, 2025 - Java
DataOps for Microsoft Data Platform technologies.https://aka.ms/dataops-repo
- Updated
Dec 16, 2025 - Shell
This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.
- Updated
Dec 12, 2025 - Python
Synmetrix – production-ready open source semantic layer on Cube
- Updated
Feb 7, 2025 - JavaScript
Databricks Terraform Provider
- Updated
Dec 17, 2025 - Go
🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.
- Updated
Apr 22, 2025 - Python
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
- Updated
Dec 16, 2025 - Python
Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...
- Updated
Nov 10, 2022
Drop-in replacement for Apache Spark UI
- Updated
Dec 1, 2025 - TypeScript
Databricks framework to validate Data Quality of pySpark DataFrames and Tables
- Updated
Dec 16, 2025 - Python
Improve this page
Add a description, image, and links to thedatabricks topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thedatabricks topic, visit your repo's landing page and select "manage topics."