Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

mapreduce-python

Here are 97 public repositories matching this topic...

《大数据挖掘技术》@复旦 课程项目,试图从搜狗实验室用户查询日志数据(2008)中找出搜索记录中有较高支持度关键词的频繁二项集。在实现层面上,我搭建了一个由五台服务器组成的微型 Hadoop 集群,并且用 Python 实现了 Parallel FP-Growth 算法中的三个 MapReduce 过程。

  • UpdatedMar 29, 2021
  • Python

KMeans, Cure and Canpoy algorithms are demonstrated using Pyspark.

  • UpdatedMay 19, 2021
  • Jupyter Notebook

基于Item-based CF和XGBRegressor完成的用户对商品的推荐系统

  • UpdatedJul 6, 2021
  • Python

Using hadoop to utilize data from an automobile tracking platform that tracks the history of important incidents after the initial sale of a new vehicle.

  • UpdatedFeb 19, 2022
  • Python

Lambda to start EMR and run a map reduce job

  • UpdatedAug 16, 2019
  • Python

This repository have codes that extracts meaningful information from News headline data-set.

  • UpdatedApr 28, 2019
  • Python

A REST-based service that translates the SQL query into MapReduce and Spark jobs. It runs these jobs and provides the JSON object. SQL to MapReduce and Spark translator.

  • UpdatedSep 30, 2020
  • Python

⚡️공개용 맵리듀스 플랫폼인 Spark를 사용하여 데이터마이닝을 해보자⚡️

  • UpdatedDec 11, 2020
  • Jupyter Notebook

Pulled 10GB ofYelp Business data through the terminal via Kaggle API. The data was then pushed to and AWS S3 Bucket bucket for storage and analyzed on a Elastic MapReduce Cluster on a Jupyter Notebook using PySpark

  • UpdatedAug 21, 2023
  • Jupyter Notebook

Hadoop MapReduce Python

  • UpdatedDec 21, 2022
  • Python

A repository containing the source codes for the assignments done as a part of the Big Data course (UE18CS322) at PES University.

  • UpdatedJan 14, 2021
  • Python

Market basket analysis of finding frequent itemsets using SON algorithm in Spark

  • UpdatedOct 6, 2018
  • Python

MapReduce is a programming model and an associated implementation for processing and generating large data sets. Users specify a map function that processes a key/value pair to generate a set of intermediate key/value pairs, and a reduce function that merges all intermediate values associated with the same intermediate key. Many real-world tasks…

  • UpdatedApr 24, 2024
  • Python

Performing Map reduce to get the page rank on the WDC data.

  • UpdatedNov 24, 2023
  • Python

Programs for MapReduce written in java with least complexity!

  • UpdatedFeb 3, 2024
  • Java

Modified from big-data-europe/docker-hadoop

  • UpdatedFeb 14, 2024
  • Python

Mapreduce Presentation

  • UpdatedSep 26, 2022

Understand how map reduce works for parsing a text data with parallel processing of sub tasks using multi threading

  • UpdatedJan 15, 2021
  • Python

Improve this page

Add a description, image, and links to themapreduce-python topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with themapreduce-python topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp