Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

hadoop

Here are 3,570 public repositories matching this topic...

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • UpdatedMar 20, 2024
  • Python

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

  • UpdatedMay 16, 2025
  • Python

🏆 实时 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构 🏆 Real-Time coding-free, powerful and secure ORM 🚀 providing APIs and Docs without coding by Backend, and the returned JSON of API can be customized by Frontend(Client) users

  • UpdatedJul 8, 2025
  • Java
presto

The official home of the Presto distributed SQL query engine for big data

  • UpdatedJul 18, 2025
  • Java

Apache Hadoop

  • UpdatedJul 18, 2025
  • Java

Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learn...

  • UpdatedJul 18, 2025
  • Java

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

  • UpdatedJul 18, 2025
  • Java

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

  • UpdatedAug 7, 2023
school-of-sre

At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.

  • UpdatedAug 13, 2024
  • HTML

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

  • UpdatedJul 18, 2025
  • Jupyter Notebook

Alluxio, data orchestration for analytics and machine learning in the cloud

  • UpdatedApr 29, 2025
  • Java

1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, tmux..

  • UpdatedJul 8, 2025
  • Shell

Apache Hive

  • UpdatedJul 18, 2025
  • Java

Apache Calcite

  • UpdatedJul 16, 2025
  • Java

Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White

  • UpdatedMar 17, 2020
  • Makefile

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.

  • UpdatedApr 2, 2025
  • Java

Apache Nutch is an extensible and scalable web crawler

  • UpdatedJul 16, 2025
  • Java

大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料

  • UpdatedJun 6, 2025

Improve this page

Add a description, image, and links to thehadoop topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thehadoop topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp