spark-kafka-integration
Here are 12 public repositories matching this topic...
Sort:Most stars
Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficiently execute streaming, machine learning or SQL workloads that require fast iterative access to datasets.This project will have sample programs for Spark in Scala language .
- Updated
Nov 16, 2022 - Scala
A structured streaming was applied to the robot data from ROS-Gazebo simulation environment using Apache Spark. Data is collected in Kafka, analyzed by Apache Spark and stored in Cassandra.
- Updated
Feb 6, 2022 - Python
Example for Data Reading from and Writing to from Kafka Topic using Apache Spark DataFrame and DataSet
- Updated
Oct 13, 2017 - Scala
Use this project to join data from multiple csv files. Currently in this project we support one to one and one to many join. Along with this you can find how to use kafka producer efficiently with spark.
- Updated
Jul 1, 2022 - Java
spark-kafka-integration
- Updated
Mar 20, 2019 - Scala
Apache Spark and Apache Kafka integration and Spark Analytics over DataFrame.
- Updated
Feb 18, 2017 - Java
A Log Analytics demo based on Spark Structured Streaming + Kafka
- Updated
Aug 8, 2019 - Python
- Updated
Jan 4, 2019 - Scala
- Updated
May 20, 2018 - Java
Dumps events stored in Kafka to any Hadoop supported file system using Spark Streaming
- Updated
May 14, 2018 - Scala
Improve this page
Add a description, image, and links to thespark-kafka-integration topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thespark-kafka-integration topic, visit your repo's landing page and select "manage topics."