big-data-projects
Here are 22 public repositories matching this topic...
Language:All
Sort:Most stars
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
- Updated
May 19, 2021 - Scala
Big data projects implemented by Maniram yadav
- Updated
May 5, 2018 - PigLatin
Drools processor for Apache NiFi
- Updated
Oct 23, 2019 - Java
COMP90024 - Cluster and Cloud Computing - 2020S1 - Assignment 2
- Updated
Nov 25, 2020 - HTML
Eskimo is a state of the art Big Data Infrastructure and Management Web Console to build, manage and operate Big Data 2.0 Analytics clusters on Kubernetes. This is the git repository of Eskimo Community Edition.
- Updated
Sep 14, 2023 - Java
This repo contains Big Data Project, its about "Real Time Twitter Sentiment Analysis via Kafka, Spark Streaming, MongoDB and Django Dashboard".
- Updated
May 20, 2024 - Jupyter Notebook
This project aims to predict smartphone prices using a combination of batch and stream processing techniques in a Big Data environment. The architecture follows the Lambda Architecture pattern, providing both real-time and batch processing capabilities to users.
- Updated
Apr 15, 2024 - Python
personal solutions to big data problem scenarios using scala
- Updated
Feb 11, 2018 - Scala
COMP90024 - Cluster and Cloud Computing - 2020S1 - Assignment 1
- Updated
Apr 24, 2020 - Python
A Multilevel Streaming Data Analytics Infrastructure for Predictive Analytics
- Updated
Feb 12, 2022
Trabalho da faculdade de Tópicos de Big Data em Python (ainda não finalizado), o código é especificamente para ler a planilha de Unidades Federativas do Brasil em Estado de Insegurança Alimentar.
- Updated
Nov 13, 2024 - Python
Supporting code for big-data analysis in linguistics
- Updated
Apr 23, 2023 - Jupyter Notebook
Simplified Hadoop Setup and Configuration Automation
- Updated
Sep 2, 2023 - Shell
As Final Project for the Big Data Systems & Analysis class, we designed and implemented a system that once integrated with a live chat (Twitch, YouTube, Zoom, Twitter, etc.) can group together similar questions so that the host can better address them.
- Updated
Jan 13, 2023 - Jupyter Notebook
Welcome to the Big Data Lab Google Colab repository! 🚀
- Updated
Oct 16, 2023 - Jupyter Notebook
This repository contains about my weekly projects from Big Data Analytics II course at my college
- Updated
May 31, 2022 - SAS
Building a next-generation hybrid data pipeline architecture that combines the power of Microsoft Fabric, Azure Cloud, and Power BI. This pipeline is engineered to tackle the challenges of real-time data ingestion, multi-layered processing, and analytics, delivering business-critical insights.
- Updated
Dec 29, 2024 - Python
- Updated
May 2, 2018 - Jupyter Notebook
This repository contains about my weekly projects from Big Data Analytics II course at my college
- Updated
May 29, 2022 - SAS
Improve this page
Add a description, image, and links to thebig-data-projects topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thebig-data-projects topic, visit your repo's landing page and select "manage topics."