align-toparrow-leftarrow-rightbackbellblockcalendarcamerachatcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-crosscrosseditfacebookglobegoogleimagesinstagramlocation-pinmagnifying-glassmailmoremuplabelShape 3 + Rectangle 1outlookpersonplusImported LayersImported LayersImported Layersshieldstartwitteryahoo

Storm, Probabilistic Data Structures, & Scalable Machine Learning

  • Apr 24, 2013 · 6:30 PM
  • This location is shown only to members

Please note the address of the venue for this meetup


We are happy to announce this meetup as part of Big Data Week.


6.30pm Welcome, networking, free beer+pizza

7pm Talks Start


“Approximate Methods for Scalable Data Mining” by Andrew Clegg, Data Scientist & Tech Manager of Analytics Team @Pearson Probabilistic data structures let you trade off accuracy for scalability, by allowing a small and measurable amount of error in return for huge improvements in efficiency. Andrew’s talk provides an overview with use cases

“Storm + Trident and The Holistic Architecture: Using Hadoop for batch and Storm for real time” by Yodit Stanton, Freelance Data Scientist, Developer & Systems Architect. Computing arbitrary functions on an arbitrary dataset in real time is a daunting problem. There is no single tool that provides a complete solution. Instead, you have to use a variety of tools and techniques to build a complete Big Data system. A Holistic Architecture may solve the problem of computing arbitrary functions on arbitrary data in real time by decomposing the problem into three layers: the batch layer, the serving layer, and the speed layer


"PredictionIO - An Open Source Scalable Machine Learning Architecture" by Simon Chan Product Lead @ Prediction.IO To deal with big data in a production environment, a horizontally non-blocking and scalable system is needed. PredictionIO provides a flexible architecture for data engineers to evaluate algorithms and apply them to real applications. The whole stack is built on top of open source software while PreodictionIO itself is an open source Scala project.  Simon will introduce the system design and answer any questions you may have as developers or data scientists.


9.30pm-ish meetup ends

Join or login to comment.

  • Jonathan H.

    My friends at Pentaho challenge beliefs on what Big Data really means. It has been hijacked; it is not just hype.

    April 29, 2013

    • Carlos

      Your post is completely unrelated to the content of this meetup page. Why do you post this blatant marketing plug here? This is not an advertising board

      3 · April 29, 2013

  • Tony G.

    excellent speakers

    1 · April 26, 2013

  • Simon C.

    Full source code of PredictionIO can be found here:

    Thanks for all the feedback during and after the talk!
    If you are interested in our future development, follow us @predictionio

    1 · April 25, 2013

  • Seref A.

    Great event. Lots of hints from real life use cases. Thanks to Carlos and everybody who presented. Thanks for the beers too!
    I'd really like to take a second look at the slides. Any chance they can be put in SlideShare etc?

    1 · April 25, 2013

  • Andrew W.

    Particularly enjoyed the talk on Probabilistic Data Structures

    April 25, 2013

  • John Van P.

    First talk particularly good. I'd like to review the slides for that talk. Is it possible for the speaker to post them? Thanks.

    3 · April 25, 2013

  • Jerome

    Sorry I didn't come. I got the confirmation after the event started.
    Hope you had a good time.

    April 24, 2013

  • Franco B.

    Hi, I found an iPod on the floor of the streets near the building when I was going out, I suspect the owner is some person called "martin talbot" by the content but if anyone lost it and can describe it or what it has I shall return it.

    1 · April 24, 2013

  • Chris W.

    Fantastic, just got an email at 7pm saying that a space has opened up for tonights meetup. With a 1+ hour journey time I should be able to get there for at least the last half...

    1 · April 24, 2013

    • karlis z.

      Bit of a shame that circa 30-40 name badges are still downstairs unclaimed.

      April 24, 2013

    • Anish M.

      @michael thought u were busy with startup ;)

      April 24, 2013


    Great Place!!

    April 24, 2013

  • Tom N.

    Is there going to be any pizza?

    April 24, 2013

  • Dan K.

    not enough time in the day :(

    April 24, 2013

  • Alex

    Still on waiting list along with 195 others. I'm wondering whether to just turn up and take my chances.

    April 24, 2013

  • Chris J.

    Unable to attend now Hope someone can use this place

    April 24, 2013

  • David A.

    I am attending a different event tonight.

    April 24, 2013

  • Mark B.

    Always the way...was really looking forward to this!

    April 24, 2013

  • Seref A.

    Many thanks to those who kindly change their RSVP before the event if they won't be able to make it.

    April 24, 2013

  • Dominic S.

    Sorry notification came too late.

    April 24, 2013

  • Ben G.

    work issues - must bail, sorry :'(

    April 24, 2013

  • Jurek G.

    sorry, cannot be there tonight, hopefully someone will be able to take the my place last minute

    April 24, 2013

  • Richard S.

    I will not be able to attend due to last minute commitments.

    April 24, 2013

  • Peter

    238 waiting! Any chance of getting this session recorded?

    April 24, 2013

  • Michael C.

    Sadly I can't make it tonight, hope someone can take my place :o)

    April 24, 2013

  • Heather S.

    Whoops. Got a waitlist place (yay!) but it's husband's birthday (also yay! but preemptive). Enjoy. ;-)

    April 24, 2013

  • Jon W.

    Sorry, not going to be able to make this after all, got sick.

    April 24, 2013

  • Dave D.

    Many apologies, I've caught at work.

    April 24, 2013

  • Raja K.

    last minute work

    April 24, 2013

  • Mark B.

    Gutted not able to go now, project delivery getting in way! Enjoy this .

    April 24, 2013

  • Callum R.

    got tooth ache

    April 22, 2013

  • sahera k.

    I must attend this meeting because It is Really what I am looking for " Holistic Architecture".

    April 21, 2013

  • Christoforos A.

    Highly active in building scalable streaming analytics in storm. Very interested to attend please keep me posted.

    April 18, 2013

  • Patrick Z.

    Storm, Probabilistic Data Structures, & Scalable Machine Learning

    April 18, 2013

  • Sam B.

    I wish you could film this. Or at least provide the slides afterwards. Given the size of he waiting list, it could prove useful.

    3 · April 16, 2013

  • Leonidas T.

    166 people waiting? Come on guys, you have to pick a bigger venue. Really looking forward for this.

    1 · April 12, 2013

  • Vignesh M.


    April 11, 2013

  • Paul M.

    Interested in discovering new techniques

    April 10, 2013

  • Robin E.

    Sorry calendar mixup! Wish I could be there.

    April 10, 2013

  • A former member
    A former member

    Pow! That filled up fast

    April 9, 2013

  • Antonios C.

    Doing lots of Scala / Scalding / HBase / Cascading development. A bit of Mahout - and interested into scalable machine learning & storm

    April 9, 2013

  • Anthony S.

    looking forward to this one

    April 9, 2013

  • Seref A.

    Apparently not a good day to attempt to 'unplug' for a few hours. Can we see where we are in the waiting list?

    April 9, 2013

  • Ivan Z.


    April 9, 2013

  • Thiago G.

    Looking forward to it.

    April 9, 2013

  • Geoff H.

    So what is the temporal distribution of place bookings following the meetup announcement?
    Can we infer what proportion of bookings are automated bots listening out for emails for Data Science meetings?

    3 · April 9, 2013

    • Michael C.

      Writing a "meetup bot" is still on my backlog, for now I'm just relying on lightning-fast reflexes on seeing the emails come in :o)

      2 · April 9, 2013

  • John O.


    April 9, 2013

  • A former member
    A former member


    April 9, 2013

  • Sebastian M.


    April 9, 2013

  • Peter L.

    Looking forward to it

    April 9, 2013

  • Tim R.


    April 9, 2013

  • Stevens Yun Z.

    Must be good

    April 9, 2013

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy