addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwchatcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgoogleimageimagesinstagramlinklocation-pinmagnifying-glassmailminusmoremuplabelShape 3 + Rectangle 1outlookpersonplusprice-ribbonImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruseryahoo

January Meeting – Building Data Products

  • Jan 28, 2013 · 5:30 PM
  • Orbitz Worldwide Headquarters

We're excited to welcome Cloudera's Director of Data Science, Josh Wills, for our January meeting. Josh will be talking about best practices for creating analytical applications with large data sets. Please join us for what's sure to be an interesting and informative talk – more info is below.

Please note: space will be limited for this meeting and we want to make sure we can accommodate everybody who's interested in coming. Please only RSVP if you're certain you'll be able to make it.

Title: Building Data Products


Data scientists – the analytical professionals who straddle the line between statistician and software engineer – are in demand like never before. Due to the scarcity of data science talent, it has become increasingly important for data scientists to spend less time answering one-off questions and more time building data products that enable a broad class of users to interact with large data sets, ask detailed questions, and make valid inferences. In this talk, we will give an overview of the current best practices around creating analytical applications on Hadoop, including dashboards, ETL pipelines, data APIs, and machine learning models.


Josh Wills is Cloudera’s Director of Data Science, working with customers and engineers to develop Hadoop-based solutions across a wide-range of industries. Prior to joining Cloudera, Josh worked at Google, where he worked on the ad auction system and then led the development of the analytics infrastructure used in Google+. He earned his Bachelor’s degree in Mathematics from Duke University and his Master’s in Operations Research from The University of Texas – Austin.




Join or login to comment.

  • Barbara K.

    Could you suggest some companies for an MIT Math and CS freshman to apply to a summer internship?

    February 8, 2013

    • Thomas J M.

      SapientNitro/Iota..ask for TJ.

      February 8, 2013

  • Rob L.

    January 29, 2013

  • Anil S.

    Overall Impression:
    - Excellent Organization by Rob and Jon.
    - Thanks to Josh for taking the time to make a presentation.
    - Time well spent listening the presentation.

    I am familiar with Hadoop, Map Reduce, Machine Learning Algorithms etc.

    Things I really liked/learned from the presentation:
    a) Historical context to Google developing GFS, Map Reduce etc. This is something only a Googler knows. (Thanks Josh for sharing)
    b) Application of Classification, K-Means and Random Forest algorithms to different situations.

    Suggestions to the presenter:
    The introduction to Hadoop, Hive, Impala is great. This would cater to folks who are new to Hadoop and its ecosystem.
    It would be nice to get your perspectives on how to build data products in different verticals such as Finance, Healthcare etc. You shared your experiences from Google which was valuable. Please share your field experiences on different data products from your Cloudera engagements.

    1 · January 29, 2013

  • Robert P.

    Excellent Presentation, speaker was incredibly knowledgeable and friendly. I look forward to learning more next time!

    January 29, 2013

  • A former member
    A former member

    Great talk. Would love to see Josh come back for a Cloudera Data Science class in Chicago.

    January 29, 2013

  • Sanjay K.

    Can Jon or someone upload video/presentation online?

    January 29, 2013

  • John H.

    Outstanding, entertaining. Especially enjoyed the early history of google and how that influenced the technology. Very enlightening-and funny. Kudos to Rob and Jon for a great speaker, great night.

    January 29, 2013

  • Lou D.

    Excellent meetup. Dynamic presentation!

    January 28, 2013

  • Matthew L.

    Top notch speaker and very informative subject material.

    January 28, 2013

Our Sponsors

  • Orbitz Worldwide

    A leading global online travel company and technology innovator.

  • Cloudera

    The leader in Apache Hadoop-based software and services.

  • HortonWorks

    A leading provider of support and services for Apache Hadoop.

  • TechNexus

    Chicago’s first collaborative ecosystem for tech entrepreneurs.

  • Oracle

    Industry leading hardware and software solutions for data management.

  • Couchbase

    Open source NoSQL for mission-critical systems.

  • Terracotta

    In-memory data management for the enterprise.

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy