rdd
Here are 210 public repositories matching this topic...
Language:All
Sort:Most stars
C# and F# language binding and extensions to Apache Spark
- Updated
Dec 11, 2025 - C#
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
- Updated
Jun 26, 2023 - Python
Spark RDD with Lucene's query and entity linkage capabilities
- Updated
Sep 8, 2025 - Scala
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
- Updated
May 19, 2021 - Scala
A framework for Spatio-Temporal Data Analytics on Spark
- Updated
May 4, 2021 - Scala
Pyspark in Google Colab: A simple machine learning (Linear Regression) model
- Updated
Apr 15, 2019 - Jupyter Notebook
Code/Notes for the Data Engineering Zoomcamp by DataTalksClub
- Updated
Mar 16, 2023 - Jupyter Notebook
Causal Inference Using Quasi-Experimental Methods
- Updated
Jan 15, 2021
Spark access to Common Information Model (CIM) files
- Updated
Jul 6, 2023 - Scala
Guide to Clojure REPL Driven Development with Emacs Doom
- Updated
May 12, 2025 - HTML
Sentiment Analysis and Data Visualization
- Updated
May 20, 2018 - Python
openmrs - mysql - debezium - kafka - spark - scala
- Updated
Mar 18, 2020 - TSQL
rddapp: Regression Discontinuity Design Application
- Updated
Sep 2, 2025 - HTML
A bunch of low-level basic methods for data processing and monitoring with Scala Spark
- Updated
Jun 29, 2018 - Scala
Improve this page
Add a description, image, and links to therdd topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with therdd topic, visit your repo's landing page and select "manage topics."