Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Interactive and Reactive Data Science using Scala and Spark.

License

NotificationsYou must be signed in to change notification settings

spark-notebook/spark-notebook

Repository files navigation

Gitter

The Spark Notebook is the open source notebook aimed at enterprise environments, providing Data Scientists and Data Engineers with an interactive web-based editor that can combine Scala code, SQL queries, Markup and JavaScript in a collaborative manner to explore, analyse and learn from massive data sets.

notebook intro

The Spark Notebook allows performingreproducible analysis with Scala, Apache Spark and the Big Data ecosystem.

Features Highlights

Apache Spark

Apache Spark is available out of the box, and is simply accessed by the variablesparkContext orsc.

Multiple Spark Context Support

One of the top most useful feature brought by the Spark Notebook is its separation of the running notebooks.Each started notebook will spawn a new JVM with its own SparkSession instance. This allows a maximal flexibility for:

  • dependencies without clashes
  • access different clusters
  • tune differently each notebook
  • external scheduling (on the roadmap)

Metadata-driven configuration

We achieve maximum flexibility with the availability of multiplesparkContexts by enablingmetadata driven configuration.

Scala

The Spark Notebook supports exclusively the Scala programming language, theUnpredicted Lingua Franca for Data Science and extensibly exploits the JVM ecosystem of libraries to drive an smooth evolution of data-driven software from exploration to production.

The Spark Notebook is available for *NIX and Windows systems in easy to use ZIP/TAR, Docker and DEB packages.

Reactive

All components in the Spark Notebook are dynamic and reactive.

The Spark Notebook comes with dynamic charts and most (if not all) components can be listened for and can react to events. This is very helpful in many cases, for example:

  • data entering the system live at runtime
  • visually plots of events
  • multiple interconnected visual componentsDynamic and reactive components mean that you don't have write the html, js, server code just for basic use cases.

Quick Start

Go toQuick Start for our 5-minutes guide to get up and running with the Spark Notebook.

C'mon on toGitterto discuss things, to get some help, or to start contributing!

Learn more

Testimonials

Spark Notebook gives us a clean, useful way to mix code and prose when we demo and explain our tech to customers. The Spark ecosystem needed this.

It allows our analysts and developers (15+ users) to run ad-hoc queries, to perform complex data analysis and data visualisations, prototype machine learning pipelines. In addition, we use it to power our BI dashboards.

Adopters

NameLogoURLDescription
KensuKensuwebsiteLifting Data Science to the Enterprise level
Agile LabAgile LabwebsiteThe only Italian Spark Certified systems integrator
CloudPhysicsCloudPhysicswebsiteData-Driven Inisghts for Smarter IT
AliyunAlibaba - Aliyun ECSproductSpark runtime environment on ECS and management tool of Spark Cluster running on Aliyun ECS
EMBL European Bioinformatics InstituteEMBL - EBIwebsiteEMBL-EBI provides freely available data from life science experiments, performs basic research in computational biology and offers an extensive user training programme, supporting researchers in academia and industry.
MetailMetailwebsiteThe best body shape and garment fit company in the world. To create and empower everyone’s online body identity.
kt NexRkt NexRwebsitethe kt NexR is one of the leading BigData company in the Korea from 2007.
SkymindwebsiteAt Skymind, we’re tackling some of the most advanced problems in data analysis and machine intelligence. We offer start-of-the-art, flexible, scalable deep learning for industry.
AminowebsiteA new way to get the facts about your health care choices.
VintedVintedwebsiteOnline marketplace and a social network focused on young women’s lifestyle.
VingleVinglewebsiteVingle is the community where you can meet someone like you.
47 Degreeswebsite47 Degrees is a global consulting firm and certified Typesafe & Databricks Partner specializing in Scala & Spark.
BarclaysBarclayswebsiteBarclays is a British multinational banking and financial services company headquartered in London.
SwisscomSwisscomwebsiteSwisscom is the leading mobile service provider in Switzerland.
KnoldusknolduswebsiteKnoldus is a global consulting firm and certified "Select" Lightbend & Databricks Partner specializing in Scala & Spark ecosystem.

[8]ページ先頭

©2009-2025 Movatter.jp