Past Meetup

Data Science Study Session with H2O @ WeWork

This Meetup is past

80 people went

Location visible to members

Details

This event will take place at WeWork SOHO Grand. Please Use Lafayette Street Entrance.

** Please RSVP with your first and last name.**

You are invited to join us for a data science study session. On Wednesday, April 13th, Erin LeDell from H2O will present and lead a machine learning workshop on H2O R and Python packages.

Schedule

6:30 PM - 6:45 PM Food, Drinks, Settling Down

6:45 PM - 7:00 PM Short Introductions

7:00 PM - 8:30 PM Erin's Keynote Presentation & Workshop 8:30 PM - 9:00 PM Networking Food and drinks will be provided, courtesy of H2O.

Summary

The focus of this workshop is machine learning using the H2O R and Python packages. H2O is an open source distributed machine learning platform designed for big data, with the added benefit that it's easy to use on a laptop (in addition to a multi-node Hadoop or Spark cluster). The core machine learning algorithms of H2O are implemented in high-performance Java; however, fully featured APIs are available in R, Python, Scala, REST/JSON and also through a web interface. Since H2O's algorithm implementations are distributed, this allows the software to scale to very large datasets that may not fit into RAM on a single machine. H2O currently features distributed implementations of generalized linear models, gradient boosting machines, random forest, deep neural nets, dimensionality reduction methods (PCA, GLRM), clustering algorithms (K-means), and anomaly detection methods, among others. The ability to create stacked ensembles, or "super learners," from a collection of supervised base learners is provided via the h2oEnsemble R package. R and Python Jupyter notebooks with H2O machine learning code examples will be demoed live and made available on GitHub for attendees to follow along on their laptops. For those interested in running the code on a multi-node Amazon EC2 cluster, an H2O AMI is also available. Biography

Erin LeDell is a Statistician and Machine Learning Scientist at H2O.ai, the company that produces the open source machine learning platform, H2O. She is the author of a handful of machine learning related software packages, including the h2oEnsemble R package for ensemble learning with H2O. Erin received her Ph.D. in Biostatistics with a Designated Emphasis in Computational Science and Engineering from UC Berkeley. Before joining H2O.ai, she was the Principal Data Scientist at Wise.io and Marvin Mobile Security (acquired by Veracode in 2012) and the founder of DataScientific, Inc.