addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscontroller-playcrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramFill 1light-bulblinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonprintShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

Joint Spark London and Machine Learning Meetup

We are hosting a joint meetup between Spark London and Machine Learning London. Given the excitement in the machine learning community around Spark at the moment a joint meetup is in order!

Michael Armbrust from the Apache Spark core team will be flying over from the States to give us a talk in person. Thanks to our sponsors, Cloudera, MapR and Databricks for helping make this happen.

The first part of the talk will be about MLlib, the machine learning library for Spark, and the second part, on Spark SQL.

Don't sign up if you have already signed up on the Spark London page though!


Abstract for part one:

In this talk, we’ll introduce Spark and show how to use it to build fast, end-to-end machine learning workflows. Using Spark’s high-level API, we can process raw data with familiar libraries in Java, Scala or Python (e.g. NumPy) to extract the features for machine learning. Then, using MLlib, its built-in machine learning library, we can run scalable versions of popular algorithms. We’ll also cover upcoming development work including new built-in algorithms and R bindings.


Abstract for part two: 

In this talk, we'll examine Spark SQL, a new Alpha component that is part of the Apache Spark 1.0 release. Spark SQL lets developers natively query data stored in both existing RDDs and external sources such as Apache Hive. A key feature of Spark SQL is the ability to blur the lines between relational tables and RDDs, making it easy for developers to intermix SQL commands that query external data with complex analytics. In addition to Spark SQL, we'll explore the Catalyst optimizer framework, which allows Spark SQL to automatically rewrite query plans to execute more efficiently.



Join or login to comment.

  • Stefano

    Very informative, great demo and speaker

    1 · June 20, 2014

  • Jencir L.

    The speaker was energetic, vivid, clear and hands-on -- top-rated.

    1 · June 19, 2014

  • Christopher

    Hi, is there anyway of knowing my position in the waiting list, to get an idea whether to turn up "just in case" people drop out?

    June 19, 2014

  • Shaswar B.

    Somebody removed me by mistake?

    June 18, 2014

  • Peter M.

    How big is big enough?

    June 7, 2014

    • martin g.

      I'll be making an announcement on venue change early next week.

      1 · June 7, 2014

Our Sponsors

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy