Skip to content

Hands-on: Data Science at Scale with HAWQ and MADlib and Hadoop

Photo of Alex Zeltov
Hosted By
Alex Z.
Hands-on: Data Science at Scale with HAWQ and MADlib and Hadoop

Details

6:00 PM- 6:30 PM: drinks, mingling

6:30 PM - 8:30PM: Hands-on: Data Science at Scale with HAWQ and MADlib and Hadoop

In this Meetup we’ll learn about Apache HAWQ (http://hawq.incubator.apache.org/), the elastic, parallel processing query engine that operates on all your data directly within Hadoop. We’ll also learn about Apache MADlib (http://madlib.incubator.apache.org/), the big data machine-learning library that provides commonly used data science algorithms capable of leveraging the parallel processing capabilities of HAWQ.

The main part of this event will be a guided hands-on where we use Apache Zeppelin as the notebook to perform a data science investigation of our data in Hadoop by invoking MADlib functions in Python, R, and directly with SQL.

Feel free to come watch the extended demonstration. If you want to play-along with your own sandbox, please bring a system that meets these minimum requirements. The software will be distributed by a USB drive:

· VirtualBox 4.2 or later, or VMWare 5.0 or later installed Pre-downloaded Sandbox VM with HAWQ

· 15 GBs free disk space

This meetup will be at new location @ WEWORK MARKET ST.

1601 Market Street Philadelphia PA 19103 (19th floor)

About our sponsor:

WeWork is a community for creators. We transform buildings into

beautiful, collaborative workspaces and provide the infrastructure, services,

events and technology so our members can focus on doing what they love.

WeWork currently has 111 locations in 29 cities across the world with over

70,000 members. Book a tour at wework.com now!

Photo of Future of Data: Philadelphia group
Future of Data: Philadelphia
See more events