Spark Double Header


Details
Please join us at Endgame for pizza and beer for a Spark double header. Info below on our two talks and speakers:
Classical Distributed Computing Studies: Can Catalyst save us from Amdahl's Law? (Sorry, no.)
A quick look at a couple of relevant theories of Computer Science and how they play a role in designing data processing systems with Spark.
Rich Seymour is a Senior Data Scientist at Endgame working with a full stack of assorted tools, from Apache Spark to React.js and everything in between. Prior to Endgame, Rich spent a large chunk of time getting a PhD in Materials Science performing molecular dynamics simulations on high performance computing clusters.
Exploratory Analysis with Spark SQL and Zeppelin
Apache Zeppelin, and other similar analytics notebooks, are becoming more and more popular for exploratory analysis and prototyping data pipelines. In this talk I’ll cover how Spark SQL DataFrames combined with Zeppelin provide a simple yet powerful combination for exploring and visualizing your data.
We’ll cover:
· Loading data from built-in and 3rd party data sources
· Visualizing your queries with Zeppelin’s charts
· Optimizing your queries for faster response times
· Performing more advanced queries with User Defined Functions & Windowing Functions
Silvio Fiorito started his career in software development at the height of the dot-com boom in Northern Virginia where he worked on some of the earliest web properties. The last few years his focus has been in security and big data development. Disappointed with the limitations of current frameworks, he started experimenting with Spark early on at version v0.6 and has been hooked since. His passion is in creating more powerful and insightful user experiences for exploratory analysis leveraging tools like Spark. Silvio is a Databricks Certified Spark Developer and MetiStream Spark Instructor.
Parking and Metro:
There is a public garage located below Endgame's building, and several more within a few blocks. Endgame is located directly across from the Clarendon metro stop on the Orange line.

Spark Double Header