Interactive SQL-on-Hadoop: from Impala to Hive/Tez to spark SQL to JethroData

Abstract
Since the launch of Cloudera's Impala 2 years ago, multiple tools have emerged in the quest to improve interactive query performance on Hadoop. JethroData, an Israeli startup, recently joined to field, announcing beta release of an index-based engine to fast sql-on-hadoop .

In this session we'll examine in detail Brute-Force vs Index Access architecture approaches, and which use-cases are suitable for each. We will explain how JethroData is designed and present performance benchmarks of some of the tools. The session will also include a demo and open Q&A.

About the presenter
Ofir Manor is a blogger and expert in the area of BigData. Ofir holds more than 15 years of experience developing and supporting databases. Worked for Oracle, GreenPlum and was an independent consultant for BigData. Lately joined JethroData as their Product Manager. http://www.linkedin.com/in/ofirmanor

Join or login to comment.

  • Nader G.

    X DX .

    August 1

  • Asaf B.

    Thank you Ofir and team for the excellent lecture. i really enjoted it and got alot of important eduction in one of the hottest domains of information technologies.

    July 23

  • Ofir M.

    Hi all,
    thanks for showing up last night! I hope you enjoyed it - it was great to have so many good questions.
    I'm uploading the presentation, it has a few extra slides we didn't cover.
    If you have questions or want to discuss your use case or want to try our beta, ping me or Ronen (our emails are at the end of the presentation)

    July 22

  • Alex S.

    Hi all. Thanks Ofir. As I have told to some of you there is an planned Horizon 2020 Call for Big-Data. The text of the Call includes the following sentences: " R&D projects to develop novel data structures, algorithms, methodology, software architectures, optimisation methodologies and language understanding technologies for carrying out data analytic, data quality assessment and improvement, prediction and visualization tasks at extremely large scale and with diverse structured and unstructured data. Of specific interest is the real time cross-stream analysis of very large numbers of diverse, and, where appropriate, multilingual, multimodal data streams. .... Explicit experimental protocols and analyses of statistical power are required in the description of usability validation experiments for the systems proposed.". I will be happy to have more F2F clarification. Alex[masked]

    July 22

  • EranW

    Hi,
    in spite of what might seam from the Q&A, I really enjoyed the presentation yesterday. although I personally believe that SPARK in general and SPARK-QL will have a significant role in the future of big data analysis then presented.
    With regard to JethroData, the product looks promising but not sure if presented\positioned correctly - The general architecture I like to follow is based on an Hadoop data lake with all of the enterprise data and in-front of it a use-case driven indexing such as SOLR for full text, columnar DB RDBS etc. JethroData is one of these indexing technique for ad-hoc DW interaction for BI tools.

    July 22

  • Rami C.

    Any parking tips for this venue?

    July 21

    • Tal B.

      I work around the area, no parking problems are expected at all

      July 21

  • david a.

    Dear friend, please have a look:
    Our startup doing "Mobile to sensor platform, (use case - IoT for the Golden Age)" is challenging Cisco IoT competition
    https://iotchallenge.cisco.spigit.com/Page/ViewIdea?ideaid=3808

    ….Our Intellectual Property is the first world solution of MESH over Wi-Fi Direct for 50 Billion sensors "things" @ NON IP (and IP) sensors and wearable BTLE devices.
    We are adding cyber security and QoS to this mobile middleware platform.

    Initially we are going to use it for a 3rd age safe life holistic sensoring solution (Smart Health combined with Smart Building)

    Thanks for any feedback - I value your time,

    David Alon, Israel
    [masked]
    Allinpack.com
    http://www.linkedin.com/pub/david-alon 8200/5a/280/903

    July 20

  • Ofir M.

    Hi all,
    I'll be happy to answer questions regarding the session.
    We will start by a technical discussion on the state of the rich SQL-on-Hadoop space, based on architecture and different use cases. After that, we will introduce JethroData and see its different design goals and what that can achieve (demo included).
    Session will be in Hebrew.

    1 · July 15

  • david a.

    Dear friends, I need your help - pls.

    Our startup doing " Mobile cross platform middleware APP, ( IoT FOR THE GOLDEN AGE)" is challenging Cisco IoT competition, and we need friendly votes:

    Please register on site "LeaderBoard" on the title entry, and Like your entry…
    voted entry by the crowd will enter the Semi-finals.

    https://iotchallenge.cisco.spigit.com/Page/ViewIdea?ideaid=3808

    Our Intellectual Property is the world solution for MESH over Wi-Fi Direct for 50 Billion sensors "things" @ NON IP (and IP) sensors – and wearable BTLE devices!
    And we are going to use it for a 3rd age better life solution

    Thanks - I value your time,

    Dadi Alon, Israel
    Allinpack.com

    July 1

Our Sponsors

People in this
Meetup are also in:

Create a Meetup Group and meet new people

Get started Learn more
Henry

I decided to start Reno Motorcycle Riders Group because I wanted to be part of a group of people who enjoyed my passion... I was excited and nervous. Our group has grown by leaps and bounds. I never thought it would be this big.

Henry, started Reno Motorcycle Riders

Start your Meetup today

Act now and get 50% off.
Until February 1.

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy