addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramlinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

PyData Amsterdam: the H20 edition

Hi everybody,

It's PyData time again.

In an unexpected twist, a flood of H2O folks are visiting town, bringing goodies and knowledge and they asked us if we wanted to know the latest things boiling in the H2O cauldron. eBay/Marktplaats were kind enough to offer a location for the meetup. Thank you folks!


Melanie, Vincent, Marcel, Gabriele, and Giovanni

18:00 doors open up 
19:05 doors close - be on time! 
19:15 first talk - Sparkling Water 2.0
20:00 break
20:30 second talk - Deep Water

Sparkling Water 2.0 - Jakub Háva (45 mins)

Sparkling Water integrates the H2O open source distributed machine learning platform with the capabilities of Apache Spark. It allows users to leverage H2O’s machine learning algorithms with Apache Spark applications via Scala, Python, R or H2O’s Flow GUI which makes Sparkling Water a great enterprise solution. Sparkling Water 2.0 was built to coincide with the release of Apache Spark 2.0 and introduces several new features. These include the ability to use H2O frames as Apache Spark’s SQL datasource, transparent integration into Apache Spark machine learning pipelines, the power to use Apache Spark algorithms via the Flow GUI and easier deployment of Sparkling Water in a Python environment. In this talk we will introduce the basic architecture of Sparkling Water and provide an overview of the new features available in Sparkling Water 2.0. The talk will also include a live demo showing how to integrate H2O algorithms into Apache Spark pipelines – no terminal needed!

About Jakub

Jakub (or “Kuba”) finished his bachelors degree in computer science at Charles University in Prague, and is currently finishing his master’s in software engineering as well. As a bachelors thesis, Kuba wrote a small platform for distributed computing of tasks of any type. On his current masters studies he’s developing a cluster monitoring tool for JVM based languages which should make debugging and reasoning about performance of distributed systems easier using a concept called distributed stack traces. At H2O, Kuba mostly works on Sparkling Water project.

Project “Deep Water” (H2O integration with other deep learning libraries) - Arno Candel (30 mins)

The “Deep Water" project is about integrating our H2O platform with other open-source deep learning libraries such as TensorFlow, mxnet and Caffe. I will talk about the motivation and potential benefits of this project and then carry out a live demo using mxnet as the GPU backend.

About Arno

Dr. Arno Candel is the Chief Architect at Arno is also the main author of H2O’s Deep Learning and key contributor to H2O's GBM and DRF algorithms. Arno spent the last 5 years designing and implementing high-performance machine-learning algorithms. Previously, he spent a decade in high-performance computing and ran his code on the world’s largest supercomputers as a staff scientist at SLAC National Accelerator Laboratory, where he participated in US DOE scientific computing initiatives and collaborated with CERN on next-generation particle accelerators.

Join or login to comment.

  • Massimiliano U.

    Enjoyable presentations and the crowd was in a good mood. Great to meet new people.

    November 29

  • Vincent Damian W.

    One slidedeck has been found: Will try to retreive the link for the other one.

    4 · November 17

  • Petar J.

    Great Meetup! It was nice to learn something about H2O.

    1 · November 17

  • Vincent Damian W.

    A new record, PyData has been filled up in less than 3 hours! Much excitement! We'll look into the possibility of expanding, but mainly a friendly reminder; please please please ensure that you can actually show up. If not; please remove the RSVP asap.

    4 · November 8

  • Vincent Damian W.

    Hi all, I just updated the schedule. 18:00 doors open up
    19:05 doors close - be on time!
    19:15 first talk - Sparkling Water 2.0
    20:00 break
    20:30 second talk - Deep Water

    Please come on time. We cannot guarantee somebody can open up the doors for you after 19:05 because security leaves the building.

    November 16

  • Jan M.

    Unfortunately I cannot come, I released my spot.

    2 · November 15

  • anna

    hi all! Anyone who has rvsp'd but not coming, can you PLEASE update your status? I am, along with many others, on the waiting list, desperately looking for a spot to become available :D THANKS!

    1 · November 15

  • Vincent Damian W.

    This will be another public announcement to please be friendly and make sure if you can make it. If you cannot make it, please remove yourself from the RSVP so that somebody else might.

    November 14

Our Sponsors

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy