Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

apache-spark

spark logo

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Here are 2,077 public repositories matching this topic...

mlflow

The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.

  • UpdatedNov 6, 2025
  • Python
SynapseMLlakeFS

酷玩 Spark: Spark 源代码解析、Spark 类库等

  • UpdatedMay 18, 2022
  • Scala

Interactive and Reactive Data Science using Scala and Spark.

  • UpdatedMay 16, 2023
  • JavaScript

Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.

  • UpdatedNov 5, 2025
  • Go

BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray

  • UpdatedOct 14, 2025
  • Jupyter Notebook

Apache Spark docker image

  • UpdatedApr 21, 2023
  • Shell

A curated list of awesome Apache Spark packages and resources.

  • UpdatedOct 24, 2024
  • Shell

Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

  • UpdatedAug 16, 2021
  • Java

SQL data analysis & visualization projects using MySQL, PostgreSQL, SQLite, Tableau, Apache Spark and pySpark.

  • UpdatedJul 18, 2022
  • Jupyter Notebook

The Internals of Apache Spark

  • UpdatedJul 5, 2025
goodreads_etl_pipeline

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

  • UpdatedJan 28, 2025
  • Scala

PySpark + Scikit-learn = Sparkit-learn

  • UpdatedDec 31, 2020
  • Python
graphframes

GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs

  • UpdatedOct 28, 2025
  • Scala

(Deprecated) Scikit-learn integration package for Apache Spark

  • UpdatedDec 3, 2019
  • Python

Created by Matei Zaharia

Released May 26, 2014

Followers
433 followers
Repository
apache/spark
Website
github.com/topics/spark
Wikipedia
Wikipedia

Related topics

hadoop scala

[8]ページ先頭

©2009-2025 Movatter.jp