Skip to content

Spark Deep Dive

Photo of Brian Husted
Hosted By
Brian H.
Spark Deep Dive

Details

Introduction

Please join us for an exciting evening that will dive deep into Spark best practices, design philosophies, Dataframes and product integration with Couchbase. This will be an excellent opportunity to gain a better understanding of how Spark is being leveraged from engineers that use it everyday. The Jailbreak brewery will provide us with a casual environment along with WIFI, Pizza, and an opportunity to sample some uniquely crafted beers. Pizza will be provided at no charge and there will be a cash bar for beers. We look forward to seeing everyone there!

A big thank you to our meetup sponsor Raytheon (http://www.raytheon.com/).

Agenda

5:00 - 5:30 - Networking, Pizza, and Beers

5:30 - 6:15 - Couchbase on Spark

6:15 - 7:00 - Spark Dataframes, by Markus Dale, US Govt

7:00 - 7:45 - Spark Deep Dive, by Corey Nolet, Tetra Concepts (http://tetraconcepts.com/)

7:45 - 8:00 - Discussion and networking

More about the Talks

Spark Deep Dive

This deep dive attempts to "de-mystify" Spark by touching on some of the main design philosophies and diving into some of the more advanced features that make it such a flexible and powerful cluster computing framework. Corey will discuss some common pitfalls and best practices for building, configuring, and deploying Spark applications.

Spark Dataframes

The DataFrame abstraction, introduced in Apache Spark 1.3, is a distributed collection of data organized into named columns. It gives developers a DSL to expressively manipulate that data while the underlying Catalyst query optimizer ensures performance. Markus will provide an overview of the DataFrames API and some sample use cases/comparisons to RDDs.

Couchbase on Spark

Matt Ingenthron from Couchbase will talk about how you can use core Spark components and higher level integrations like Spark streaming and Spark SQL together with services from Couchbase such as the N1QL query language and DCP. Come ready to learn as there will be plenty of code and demos!

Our Speakers

Matt Ingenthron

Matt is an Engineering Director and one of the leaders Couchbase development with a deep software development background. He had previously helped scale Java, Ruby on Rails and AMP web applications for large scale web deployments. He has been a contributor to the memcached project, one of the maintainers of the Java spymemcached client and a core developer on Couchbase. He is currently heading up Couchbase's efforts in helping developers be most effective with Couchbase and continues to lead the team in getting the right bits needed for Node.js, Java, .NET and PHP developers (among others).

Markus Dale is a senior software developer with DoD where he focuses on large scale data processing. He also developed and taught a Hadoop for developers class for UMBC Training. His blog is http://uebercomputing.com (http://uebercomputing.com/).

Corey Nolet is a senior software engineer with Tetra Concepts. He is an active proponent of open source and has been involved in several Apache projects. Corey is a committer on Apache Accumulo and has a passion for helping drive the big data community toward more real-time data exploration & processing capabilities. Heholds a bachelor’s degree in Computer Science from the University of Maryland, Baltimore County. He is currently finishing up his masters degree in the same field with a focus on big data and data science.

Photo of Distributed Computing Maryland group
Distributed Computing Maryland
See more events
Jailbreak Brewing Company
9445 Washington Blvd N Ste F · Laurel, MD