addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscontroller-playcrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramFill 1light-bulblinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonprintShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

Impala and Big Query - discussion about dremel based technologies.

As we know today, MapReduce is a main workhorse for the large analytical data processing. And we also aware of its main drawbacks is a lack of interactiveness.

A few years ago Google published academic paper about the Dremel - technology used by google for interactive analysis of big data sets http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en//pubs/archive/36632.pdf).
This paper is a basis for the several developed products: Impala, BigQuery, as well as several projects in progress:
Drill and OpenDremel.

Impala is an open source implementation of the Dremel concepts from Cloudera.
Big Query is a google's service based on Dremel.

Main topics to be discussed:
a) What is Impala, its main features and limitations. How it is integrated with Hadoop.
There will be some low-level technical details of the Impala implementation available from its sources.
b) Google Big Query. We will analyze its features and try to put on the table pros and cons of its usage.

I think this discussion will be interesting for people considering technologies to implement in order to make large data analysis more interactive.

I will be glad to get related topics and will do my best to prepare answers, if you can share those with me.
Your experience in these technologies - I will be glad to summarize and share it or give a slot for presentation.

We also will take case of what MediaMind are doing with big data and analyze what part of their workload can be optimized by using Impala.

Join or login to comment.

  • A former member
    A former member

    David - thanks for a very in-depth yet amusing meetup.

    June 30, 2013

  • Sagi Z.

    Really good meetup

    June 28, 2013

  • David G.

    Presentation can be found here:
    http://www.meetup.com/HadoopIsrael/files/

    June 28, 2013

    • eli s.

      Thanks for a very informative session and a great setting. Looking forward to future discussions.

      June 28, 2013

  • Voronov D.

    Perfect meet up!

    1 · June 28, 2013

  • Ofir A.

    Very informative and helpful

    June 28, 2013

  • A former member
    A former member

    Hi,
    Is there a parking arrangement in the building?

    June 27, 2013

  • David M.

    What time does it start officially?

    June 27, 2013

    • David G.

      We start gathering at 18:30 to have some time for networking. I think actual presentation will start 19.00

      June 27, 2013

  • Dan

    Please share the presentations afterwards

    June 27, 2013

  • Nigel b.

    This looks like a fantastic event". May e I can share my cloudera experiences from the defense community

    June 25, 2013

    • David G.

      please share with me the topic

      June 25, 2013

  • A former member
    A former member

    Sounds interesting! :)

    June 24, 2013

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy