Sparkly Notebook: Interactive Analysis and Visualization with Spark


Details
Now with Zeppelin, Lightning! We have limited seating please do RSVP!
How would you explore your rich data with Spark? In this presentation we would walkthrough how you could setup, configure and start running with reproducible, collaborative, rich interactive notebooks and visualize your Big Data.
This session will be demo and walkthrough heavy, running on all real code. We will have a few cool visualizations, discuss a number of latest open source projects, and technology overviews on component involved.This will be a talk applicable to Data Scientists, Data Infrastructure Engineers or Developers alike.
Avvo (www.avvo.com (http://www.avvo.com/)) will sponsor food, drinks (all ages & 21+). There will also be Avvo swags!
Agenda:
6:00pm: Come to Avvo, socialize over food, drinks and beer!
6:30pm: Introduction
6:35pm: Part 1: Notebooks
-
REPL to notebook
-
IPython with PySpark
-
ecosystem of visuals powered by your cluster
-
Zeppelin - polyglot, reproducible results and collaboration
7:05pm: Part 2: Streaming machine learning
-
Spark's Streaming k-means
-
Lightning - visualizing streaming clusters
7:35pm: Q&A
This presentation will include code examples in Python, Scala, SQL.
Parking: Available at lower-cost or free on the street. Under the building is the Union Station Garage ( http://www.yelp.com/biz/union-station-parking-garage-seattle-2 , enter on 4th Ave S, heading North before S Jackson St), cost is $7 for 0-1 hour, $9 for 1-2 hours (closes at 9pm). Uwajimaya Village Parking is across the street at $7.5 (free with Uwajimaya purchases)

Sparkly Notebook: Interactive Analysis and Visualization with Spark