Apache Spark

Hadoop disrupted decades of data management practices and technologies by introducing an Open Source massively parallel processing framework. The Hadoop community and the component ecosystem it has developed have been an unqualified success.

The widely anticipated Apache Spark project is the newest addition to that ecosystem.

"The Spark buzz keeps increasing; almost everybody I talk with expects Spark to win big, probably across several use cases."

-- Monash Research 3/17/14


"Spark is on the rise, to an even greater degree than I thought last month"

-- Monash Research 4/30/14

The Spark software stack includes:

Spark - the core data-proccessing engine

Shark - interface for interactive querying

Spark Streaming - for streaming data analysis

MLib - for machine learning

GraphX - for graph analysis

Spark is quickly establishing itself as a leading environment for doing fast, iterative in-memory and streaming analysis.

This talk will give an introduction to the Spark stack, explain how Spark achieves lighting fast results, and how it complements your existing Apache Hadoop investment.

We're pleased to welcome back our good friend Keys Botzum for this talk.  Keys is Senior Principal Technologist with MapR Technologies, where he wears many hats. His primary responsibility is interacting with customers in the field, but he also teaches classes, contributes to documentation, and works with engineering teams. He has over 15 years of experience in large scale distributed system design. Previously, he was a Senior Technical Staff Member with IBM, and a respected author of many articles on the WebSphere Application Server as well as a book.


6:00 - Food, socializing, networking...

6:30 - Presentation

8:00 - More networking at a location TBD

Join or login to comment.

  • Keys B.

    Here are the slides I presented last week: http://www.slideshare.net/MapRTechnologies/spark-overviewjune2014

    If you have any questions, please feel free to contact me.

    July 11

  • Michael G.

    Great talk !

    June 24

  • Karthikeyan M.

    Guys, I couldn't make it due to some other urgent work at office. Would be happy to get a copy of the materials. Thanks!

    June 24

  • Becky


    For everyone attending the PhillyDB meet up on Tuesday, June 24th please email me your first and last name and phone number to [masked] . I need to provide your contact information to building security so you can be allowed access in the building.

    Thomson Reuters address is:
    1500 Spring Garden Street, Philadelphia, PA 19130
    * Please enter the building at the door located between Salad Works and Dunkin Donuts and sign in with the front desk on the first floor. Parking:
    * You may be able to find street parking.
    * There is a parking lot at 14th & 15th on Spring Garden Street. I'm looking forward to seeing everyone next Tuesday. Thank you,
    Becky Goldich

    June 19

    • Hao

      In case Becky needs help with network connectivity—if you are presenting, please send her and me ([masked]) your name and email address; I plan on attending and can help set you up.

      June 20

    • Martin D.

      Sent email with my info. Can't arrive until after 7, hope that's ok.

      June 24

People in this
Meetup are also in:

Imagine having a community behind you

Get started Learn more

I started the group because there wasn't any other type of group like this. I've met some great folks in the group who have become close friends and have also met some amazing business owners.

Bill, started New York City Gay Craft Beer Lovers

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy