addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscontroller-playcrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramFill 1light-bulblinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonprintShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

Data Science Perspectives on Hadoop + Machine Learning

  • Sep 6, 2012 · 6:30 PM
  • This location is shown only to members

In this session we will be discussing some of the issues, ideas, and challenges around Machine Learning and Hadoop from a data scientist perspective.

Agenda

6.30 Welcome and networking around, pizza+beer

7pm talks start

"From square to round wheels... moving from batch to real-time machine learning" by Michael Cutler CTO at Tumra.

Big Data technologies like Map/Reduce and the tools that utilise it are inherently batch in nature - they start, process, and end in jobs that last anywhere from minutes to hours at a time. By the time a batch job has finished there is already a queue of ‘stationary data’ waiting to be processed in the next batch run. This approach has its limitations, if you rely on a batch-process to train a machine learning model it could be ‘too late’ to respond to rapid changes. Recently there is a clear trend towards processing streams of ‘moving data’, such that it is never at rest (until it is archived). In this presentation Michael will walk you through some of the challenges and techniques to implement real-time online machine learning algorithms. Rather than pontificate about the merits of these approaches Michael will give you access to a live demo to interact with! Michael is CTO at Tumra. Prior to joining Tumra, he was a senior researcher in the R&D labs for British Sky Broadcasting.

Q&A

5 min break

"Machine Learning on Hadoop: Present and Future" by Josh Wills, Director Data Science @ Cloudera.

In this talk Josh will talk about industrial machine learning, machine learning and Hadoop, and things industry needs from academia, as well as some challenges and new things happening. Josh is the director of data science at Cloudera, and one of the main contributors to Cloudera’s most recent open source project, Crunch, a Java library that aims to make writing, testing, and running MapReduce pipelines easy, efficient, and even fun.
Prior to joining Cloudera, he was a software engineer at Google.

Q&A

More beer and networking

9.30pm-ish Session ends

 

 

 

Join or login to comment.

  • Bruce D.

    Very insightful talks from Michael and Josh. I learned loads. And as always a friendly and welcoming community.

    September 7, 2012

  • A former member
    A former member

    Very interesting. Perhaps a bit too technical for me but clearly I am a total newbie to the field.

    September 7, 2012

  • Phil H.

    Two very interesting talks and I met some great people. Plus free beer and pizza, cheers!

    September 7, 2012

  • Claudio L.

    Very good talks, will be back!

    September 7, 2012

  • Richard B.

    Excellent - learned a lot about the pitfalls of map/reduce...very interested in the upcoming classes as I'm a lapsed coder/engineer and would be good to re-learn a lot of this stuff.

    September 7, 2012

  • Christian P.

    Great speakers and organisation. Personally, I missed some novel or in-depth insight though.

    September 7, 2012

  • Heather S.

    Two excellent speakers with lots and lots to say. (If there is an option for replaying it at a slightly slower speed with footnotes that would be very cool. Let me know and I will do it.)

    September 6, 2012

  • Andrew M.

    Very good speakers, and very well organised.

    September 6, 2012

  • Marc

    Very knowledgeable speakers. Introduced a lot of topics for further research.

    September 6, 2012

  • A former member
    A former member

    Very good

    September 6, 2012

  • Michael N.

    Two absolutely mindblowingly good talk by two great speakers!

    September 6, 2012

  • A former member
    A former member

    Two talks telling the same plain truth about ML at scale today. Well worth attending.

    September 6, 2012

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy