mapreduce-python
Here are 97 public repositories matching this topic...
Sort:Most stars
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
- Updated
Dec 3, 2024 - HTML
《大数据挖掘技术》@复旦 课程项目,试图从搜狗实验室用户查询日志数据(2008)中找出搜索记录中有较高支持度关键词的频繁二项集。在实现层面上,我搭建了一个由五台服务器组成的微型 Hadoop 集群,并且用 Python 实现了 Parallel FP-Growth 算法中的三个 MapReduce 过程。
- Updated
Mar 29, 2021 - Python
KMeans, Cure and Canpoy algorithms are demonstrated using Pyspark.
- Updated
May 19, 2021 - Jupyter Notebook
基于Item-based CF和XGBRegressor完成的用户对商品的推荐系统
- Updated
Jul 6, 2021 - Python
Using hadoop to utilize data from an automobile tracking platform that tracks the history of important incidents after the initial sale of a new vehicle.
- Updated
Feb 19, 2022 - Python
Lambda to start EMR and run a map reduce job
- Updated
Aug 16, 2019 - Python
This repository have codes that extracts meaningful information from News headline data-set.
- Updated
Apr 28, 2019 - Python
A REST-based service that translates the SQL query into MapReduce and Spark jobs. It runs these jobs and provides the JSON object. SQL to MapReduce and Spark translator.
- Updated
Sep 30, 2020 - Python
⚡️공개용 맵리듀스 플랫폼인 Spark를 사용하여 데이터마이닝을 해보자⚡️
- Updated
Dec 11, 2020 - Jupyter Notebook
Pulled 10GB ofYelp Business data through the terminal via Kaggle API. The data was then pushed to and AWS S3 Bucket bucket for storage and analyzed on a Elastic MapReduce Cluster on a Jupyter Notebook using PySpark
- Updated
Aug 21, 2023 - Jupyter Notebook
A repository containing the source codes for the assignments done as a part of the Big Data course (UE18CS322) at PES University.
- Updated
Jan 14, 2021 - Python
Market basket analysis of finding frequent itemsets using SON algorithm in Spark
- Updated
Oct 6, 2018 - Python
MapReduce is a programming model and an associated implementation for processing and generating large data sets. Users specify a map function that processes a key/value pair to generate a set of intermediate key/value pairs, and a reduce function that merges all intermediate values associated with the same intermediate key. Many real-world tasks…
- Updated
Apr 24, 2024 - Python
Lightweight and extensible library to execute MapReduce-like jobs in Python
- Updated
Jan 2, 2025 - Python
Performing Map reduce to get the page rank on the WDC data.
- Updated
Nov 24, 2023 - Python
Programs for MapReduce written in java with least complexity!
- Updated
Feb 3, 2024 - Java
Understand how map reduce works for parsing a text data with parallel processing of sub tasks using multi threading
- Updated
Jan 15, 2021 - Python
Improve this page
Add a description, image, and links to themapreduce-python topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with themapreduce-python topic, visit your repo's landing page and select "manage topics."