Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

big-data

Here are 5,304 public repositories matching this topic...

awesome-scalability

ClickHouse® is a real-time analytics database management system

  • UpdatedFeb 7, 2026
  • C++

Apache Spark - A unified analytics engine for large-scale data processing

  • UpdatedFeb 7, 2026
  • Scala

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • UpdatedMar 20, 2024
  • Python

Apache Flink

  • UpdatedFeb 7, 2026
  • Java
thingsboard

Open-source IoT Platform - Device management, data collection, processing and visualization.

  • UpdatedFeb 7, 2026
  • Java
presto

The official home of the Presto distributed SQL query engine for big data

  • UpdatedFeb 7, 2026
  • Java

The Data Engineering Cookbook

  • UpdatedJan 17, 2026
  • Python

PredictionIO, a machine learning server for developers and ML engineers.

  • UpdatedJan 9, 2021
  • Scala

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

  • UpdatedFeb 7, 2026
  • Java

A distributed, fast open-source graph database featuring horizontal scalability and high availability

  • UpdatedOct 22, 2025
  • C++

CMAK is a tool for managing Apache Kafka clusters

  • UpdatedAug 2, 2023
  • Scala
kafka-ui

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.

  • UpdatedFeb 7, 2026
  • Java
quickwit

Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.

  • UpdatedFeb 6, 2026
  • Rust

The most widely used Python to C compiler

  • UpdatedFeb 7, 2026
  • Cython

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

  • UpdatedFeb 7, 2026
  • C++

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

  • UpdatedFeb 7, 2026
  • Scala

Improve this page

Add a description, image, and links to thebig-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thebig-data topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2026 Movatter.jp