addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwchatcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgoogleimageimagesinstagramlinklocation-pinmagnifying-glassmailminusmoremuplabelShape 3 + Rectangle 1outlookpersonplusprice-ribbonImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruseryahoo

February Hadoop Meetup: Hadoop-as-a-Service & Zookeeper

Dear HUG UK members,

I am pleased to announce our February meetup.

As mentioned in our January email, we are planning to have more lightning talks at our evening events.

Therefore, if you would like to give a short lightning talk this time, please get in touch with us.

This meetup is sponsored by ASOS.

Details below.

Best wishes,



Tuesday February 18 (previously planned for the 11th) 2014, Doors open 6:30pm.

Presentations from 7:00pm to 8:30pm.


Greater London House, Hampstead Road, London, NW1 7QX

(Just opposite Mornington Crescent tube stop)


Intro: ASOS is a global online fashion and beauty retailer selling over 65,000 branded and own-label products to fashion forward twenty-somethings through our website, We ship, for free, to 237 countries and territories from our 1.1 million square foot global distribution centre in the UK. By way of introduction to ASOS, we’ll spend a few minutes talking about our business, the huge amount of data we collect and some of the exciting things we’re doing with it.

Session 1: Xplenty’s cloud-based Hadoop-as-a-service platform

Speaker: Alex Grach

Abstract: Xplenty’s cloud-based Hadoop-as-a-service platform features a simple GUI, allowing anyone in an organization to benefit from big data processing without needing to program a single line of code. The platform provides 3 key simplification features: 1) Infrastructure - single-click cluster provisioning, cluster optimization and maintenance. 2) Data Processes Development - code-free design environment that saves the user from writing any code whatsoever in order to generate data flow designs, instead you work in the intuitive drag and drop GUI. Data transformation components such as select, sort, filter and join are all included and custom components can be created. 3) Job Management - Xplenty manages scheduling, monitoring and error logging services. 

* recently named Xplenty among Top 10 Big Data start-ups to watch.
** Here is their joint press release with Hortonworks.

Session 2: Is your distributed Zoo under control? 

Speaker: Flavio Junqueira

Abstract: Implementing distributed systems is hard. Servers crash, become slow, get partititioned away... These are all events that can happen in a real setting and your distributed system needs to be ready to deal with them. Apache ZooKeeper has been developed to deal with such problems. It is a replicated in-memory system that stores small files called znodes in a hierarchical manner. The ordering guarantees of operations over znodes and the notification scheme it provides enables the implementation of a number of commonly used recipes, like master election, group membership, locks, barriers, etc. Without a component like ZooKeeper, implementing such recipes can be a significant burden because they require sophisticated algorithms with many corner cases that are easy to overlook. ZooKeeper, however, does not make the promise of completely hiding all the problems of a distributed system, but instead to simplify the task. In this presentation, we cover some basic concepts of ZooKeeper, design choices, and caveats.

Short bioFlavio Junqueira is a member of the research staff of Microsoft Research in Cambridge, UK. He holds a PhD degree in Computer Science from the University of California, San Diego. He is interested in various aspects of distributed systems, including distributed algorithms, concurrency, and scalability. He is an active contributor of Apache projects, such as Apache ZooKeeper (PMC chair and committer) and Apache BookKeeper (committer).

Join or login to comment.

  • A former member
    A former member

    1 · March 17, 2014

  • Dan H.

    We've uploaded the vides from this meetup to our YouTube channel.

    You can also find the slides from the Xplenty talk here:

    March 2, 2014

  • A former member
    A former member

    Are there any videos from the session?

    February 26, 2014

  • Basel Y.

    Unfortunately couldn't make it, hope there will be another meetup sometime soon!

    February 21, 2014

  • Adam R.

    If anybody needs a hand with Recruitment in the Big Data Space or needs a new job, contact me on [masked]

    February 19, 2014

  • A former member
    A former member

    I don't know if its because I'm getting older and my hearing is failing, but I'm finding it hard to hear the speakers at the events without a microphone. The events keep going from strength to strength - keep it up!

    1 · February 19, 2014

    • Dan H.

      Thanks for the feedback, we'll try ensure we have a PA working in the future, it was definitely quiet at the back this time.

      1 · February 19, 2014

  • Alex M.

    1) Hi Chris. I agree that using a microphone and PA is best. I think that was a technical failure we couldn't sort out. 2) If anyone is interested in giving a ten minute "lightning talk" please email us at [masked] I am thinking of doing one about doing NLP on Hadoop, and one about Nutch for web crawling. Anyone else?

    1 · February 19, 2014

  • Sanjay C.

    Jedidiah.... I would be interested to hear more.
    My email address is [masked]

    If you could drop me an email with your number so we can have a chat.


    February 19, 2014

  • jedidiah f.

    Big thanks to Alex and Flavio for the talks last night. If anyone is interested in opportunities at ASOS let me know.

    1 · February 19, 2014

  • A former member
    A former member

    Interesting mix of sessions & some excellent conversation!

    3 · February 18, 2014

  • Ozgur G

    Product issues :(

    February 18, 2014

  • Graham P.

    Really disappointed that I won't be able to be at this one. Is there going to be a recording of the session?

    February 3, 2014

    • Dan H.

      Yes we will be recording the event. We'll post the links here once they are up on our YouTube channel.

      February 16, 2014

  • John A.

    Seb, did the change of location/time mean we had to re-RSVP? I could of sworn i was in and now i'm on the wait list. Hmm. guess i'll risk it and just show up anyways

    February 10, 2014

    • Alex M.

      John, no new RSVP is needed unless you can no longer make it. It should still have your old RSVP if you made one. I am not sure if anything happened to change yours.

      February 10, 2014

    • Dan H.

      Yes the RSVP's didn't change.

      February 11, 2014

  • Alex M.

    Hopefully you all got an email but this event has changed its date because of the planned Tube strike... it is on the 18th, NOT the 11th. Please update your RSVP if you can no longer make it.

    February 8, 2014

  • Nick B.

    I am keen to build a hosted Hadoop capability at KPMG for our clients

    February 3, 2014

  • Alex G.

    Hi Graham, it's sad that you won't be able to make it. You can always drop me an email ([masked]) if you want to get additional information about our work at Xplenty. Cheers, Alex

    February 3, 2014

Our Sponsors

  • Cognizant

    July sponsor

  • Bigstep

    July sponsor - 30% discount code for BDW: HUG_BDW.

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy