Giter Club home page Giter Club logo

spark-notebook's Introduction

Spark Notebook

Gitter Build Status

Originally forked from the amazing scala-notebook, almost entirely refactored for Massive Dataset Analysis using Apache Spark.

The tool allows performing reproducible analysis with Scala, Apache Spark and more.

This is achieved through an interactive web-based editor that can combine Scala code, SQL queries, Markup or even JavaScript in a collaborative manner.

The Spark is available out of the box, and is simply accessed by the variable sparkContext.

Quick Start

Want to try out Spark Notebook? Do these steps.

  • Go to spark-notebook.io.
  • Download one of the builds (master is the latest, but unstable).
  • Extract the file somewhere convenient.
  • Open a terminal/command window.
  • Change to the root directory of the expanded distribution.
  • Execute the command bin/spark-notebook (*NIX) or bin\spark-notebook (Windows).
  • Open your browser to localhost:9000.

For details and cofiguration options, see Launch.

Run straight from sources (for geeks)

Checkout the sources and run:

$ sbt run

Learn more

C'mon on to Gitter to discuss things, to get some help, or to start contributing!

The documentation is being rewritten. Meanwhile, read the slightly outdated docs about the advanced features and configuration, the different cluster deployment options (Amazon EMR, Mesons, YARN), and to find answers to FAQs.

Testimonials

Skymind - The Deeplearning4j

Spark Notebook gives us a clean, useful way to mix code and prose when we demo and explain our tech to customers. The Spark ecosystem needed this.

It allows our analysts and developers (15+ users) to run ad-hoc queries, to perform complex data analysis and data visualisations, prototype machine learning pipelines. In addition, we use it to power our BI dashboards.

Adopters

Name Logo URL Description
Data Fellas Data Fellas website Mad Data Science and Scalable Computing
Agile Lab Agile Lab website The only Italian Spark Certified systems integrator
CloudPhysics CloudPhysics website DATA-DRIVEN INSIGHTS FOR SMARTER IT
Aliyun Alibaba - Aliyun ECS product Spark runtime environment on ECS and management tool of Spark Cluster running on Aliyun ECS
EMBL European Bioinformatics Institute EMBL - EBI website EMBL-EBI provides freely available data from life science experiments, performs basic research in computational biology and offers an extensive user training programme, supporting researchers in academia and industry.
Metail Metail website The best body shape and garment fit company in the world. To create and empower everyone’s online body identity.
kt NexR kt NexR website the kt NexR is one of the leading BigData company in the Korea from 2007.
Skymind Skymind website At Skymind, we’re tackling some of the most advanced problems in data analysis and machine intelligence. We offer start-of-the-art, flexible, scalable deep learning for industry.
Amino Amino website A new way to get the facts about your health care choices.
Vinted Vinted website Online marketplace and a social network focused on young women’s lifestyle.
Vingle Vingle website Vingle is the community where you can meet someone like you.
47 Degrees 47 Degrees website 47 Degrees is a global consulting firm and certified Typesafe & Databricks Partner specializing in Scala & Spark.
Barclays Barclays website Barclays is a British multinational banking and financial services company headquartered in London.
Swisscom Swisscom website Swisscom is the leading mobile service provider in Switzerland.
Knoldus knoldus website Knoldus is a global consulting firm and certified "Select" Lightbend & Databricks Partner specializing in Scala & Spark ecosystem.

spark-notebook's People

Contributors

agile-lab avatar andypetrella avatar antonkulaga avatar bigsnarfdude avatar cbvoxel avatar copumpkin avatar ericacm avatar eronwright avatar folone avatar gitter-badger avatar hanxue avatar huitseeker avatar kencoder avatar ljank avatar mandubian avatar meh-ninja avatar minyk avatar mrt avatar nathan-gs avatar nightscape avatar panaeon avatar paulp avatar petervandenabeele avatar rpcmoritz avatar shijinkui avatar stevenbeeckman avatar theclaymethod avatar uberwach avatar vidma avatar xtordoir avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.