HBase - A Technical Introduction

  • September 17, 2013 · 7:00 PM

HBase. It's in almost every Hadoop Distro. We all use it everyday without knowing it. It's probably the least understood Hadoop project.

A deep dive session into HBase:
- how it works
- what you can use it for
- where it's going
- why it will blow your mind.

Additionally, if there are any HBase users in the group (and I know there is), now is the time to come out and let us know what you are doing with it.

Thank you to Shopify for volunteering to host in their space!!!


Disclaimer: I will not be introducing Hadoop in this session. The plan is to make this into a USER group. I realize this is a new technology but the goal is to bring a group of people together that can exchange ideas and experience. 

Join or login to comment.

  • Arthur

    Nice presentation thanks. Didn't have the time to get there but would there be a library similar to phoenix to create graph database out of hbase (similar to what titan does on top of Cassandra)?

    September 19, 2013

    • Arthur

      Actually, quick search and I see that Titan seems to work on top of hbase as well, so I guess I have my question answered. If anybody knows of other tools to achieve that, I would still be curious to know...

      September 19, 2013

    • Adam M.

      Make sure you check out Apache Giraph, Yahoo's graph analysis tool written as a YARN application. Check out the page rank example too http://giraph.apache....­

      September 19, 2013

  • Paresh Y.

    Another awesome presentation Adam! I missed the networking part, had to leave early. May be we should have another networking/drinks night so that we can buy you a beer as token of our appreciation :).

    September 18, 2013

    • Arthur

      I support offering a beer to Adam during a TOHUG social event.

      September 19, 2013

  • Sunil R.

    Superb overview of hbase!! Thanks Adam!

    September 18, 2013

  • Richard W.

    Thanks for the intro to HBase. The expert tips are invaluable. Sorry I had to leave a bit early. Looking forward to the next meeting.

    September 18, 2013

  • Manmohan

    Adam, Thank you for the informative presentation. Would love to join 'Writing YARN applications' meetup.

    September 18, 2013

  • Leons P.

    Very interesting technical presentation.

    September 18, 2013

  • Raghu S.

    Very good especially, Adam explained hbase architecture, hfiles, the big table, hmaster and the relationship between the hservers and data nodes. His coverage on High availability, timestamp, rowkey and other aspects are profound and presented in clear succinct way. Due to lack of time hoya was covered quickly but definitely gave an idea what to expect in the next meetups. Clearly explained the best scenarios when HBASE would function effectively such as when low latency data is needed, when referential integrity is not an factor so and so forth.
    thank you Adam for an excellent introduction to HBASE.

    September 18, 2013

  • Tri N.

    Very good, I wish it could last an entire day.

    September 18, 2013

  • Vanessa S.

    Thanks, Adam! Great presentation.

    For the people who asked about job opportunities that involve Hadoop, here is our open position: http://www.shopify.com/careers?posting=data-infrastructure-engineer-hadoop-big-data

    September 17, 2013

    • Dayong

      good and look forward

      September 18, 2013

  • Sean H.

    A great presentation by Adam on a complex subject. Adam gave a pretty detailed descirption of the architecture and practical advice on use cases. Hopefully, in the future we'll have more participants with hands-on experience that can share their insights.

    September 18, 2013

  • Otto M.

    It was a good intro given Hadoop knowledge. Would have like to go deeper on the use cases and the performance trade-off vs. storing and managing data directly on HDFS.

    September 17, 2013

    • Adam M.

      Thanks Otto, these comments are very helpful. Hopefully we can cover it in a smaller, deeper session

      September 18, 2013

  • Paul H.

    Thanks for the talk Adam. We use HBase extensively and found the talk interesting and useful.

    September 18, 2013

  • Hardik

    As usual, great quality presentation from Adam with lots of in-depth knowledge and very insightful Q&A, please post the slide deck link

    September 18, 2013

  • Ashwin T.

    Also, was great to connect with folks at the meeting. We do have an opening on our data team, if you or someone you know might be interested: http://hire.jobvite.com/CompanyJobs/Careers.aspx?c=qrv9VfwK&v=1&page=Job%20Description&j=oboGXfwW

    September 18, 2013

  • Ashwin T.

    Adam, great presentation. A very detailed run-through on HBase. Excited to hear more about about YARN and HoYa. We'd love to host another THUG meeting at BNOTIONS.

    September 18, 2013

  • David T.

    Great to get into the technical details.

    September 18, 2013

  • Steve D.

    Great talk Adam. Had to leave a little early but what I saw was golden.

    September 18, 2013

  • A former member
    A former member

    Thanks Adam for a high quality presentation.

    September 17, 2013

  • Dipankar

    it was nice hearing it from someone inside...great session...

    September 17, 2013

  • Rajiv A.

    Is the address Spadina road or spadina avenue?

    September 17, 2013

    • Vanessa S.

      Avenue. It's between Queen and Dundas.

      September 17, 2013

    • Rajiv A.

      Thanks Vanessa

      September 17, 2013

  • Raghu S.

    Along with HBase introduction, if the speaker can talk briefly about Hadoop, Mapreduce and NoSql would benefit as a nice start to the speech

    September 11, 2013

    • Adam M.

      I appreciate that many people need it, but one of the goals of a user group is to get a group of self-motivated and hadoop experienced individuals in the same room so they can interact. Eventually I hope to do less of these presentations and have a more interactive session. We want to enable Hadoop users with updates to the latest tech and practical advice. If we have to take (it takes an 60-90 mins by my experience) time to present the basics of Hadoop and field questions every session then the experienced people will cease to attend and I'll probably stop presenting too. Besides, no one in a user group for a specific technology should expect to show up and get free introductory training for that technology. If you really want to get ramped up on Hadoop then please consider one of the courses from Hortonworks, Cloudera, and Big Data University (IBM) if you need education.

      4 · September 12, 2013

    • A former member
      A former member

      Youtube is your friend: http://goo.gl/KceS7y­

      2 · September 17, 2013

  • A former member
    A former member

    I wonder if anyone has experience with Drake and how do you like it?

    September 12, 2013

  • Raghavendra N.

    Please let me know if someone in the group is contributing to the open source Hadoop community.

    September 11, 2013

    • Adam M.

      You might want to move this to the general discussion board. I will actually be scheduling a YARN introduction and development example for the group's next session. We will only cover YARN briefly enough to explain HOYA (HBase On YARN) ... http://hortonworks.co...­

      September 11, 2013

    • Adam M.

      Also, if you want to contribute to Hadoop, check out: http://wiki.apache.or...­

      September 11, 2013

  • Dayong

    wondering coprocessor and 2nd index

    August 28, 2013

    • Dayong

      give some user case of using it. Who has created 2nd index on HBase?

      September 11, 2013

    • Adam M.

      I will try to add that to my material....

      September 11, 2013

  • Rao

    Every body is talking about Hadoop big data . very much interested to know and participate in this group meetings.

    September 7, 2013

  • Sean L.

    I will be interested in understand the differences/pro-and-cons between HBase and Hive.

    August 28, 2013

    • Adam M.

      Good article. I'm not sure I agree with his conclusion about Cassandra and HBase though. The eventual consistency in Cassandra can be too much of a trade-off and would lead me to use HBase for many high-volume write scenarios. He also fails to give credit to Cassandra for it's ease of replication.

      August 29, 2013

    • Edwin C.

      Should also be noted a lot of these tend to be general rules of thumb that won't necessarily work for all use cases. Best you do a POC before fully committing to any of these data stores.

      August 29, 2013

  • A former member
    A former member

    I wonder if anyone has experience in developing a Hadoop ETL solution in Cascading and Cascalog. I would love to know its comparison with PIG and other data processing tools.

    August 28, 2013

    • Edwin C.

      Cascading looks like it would make for an interesting discussion / presentation for a future meetup...

      August 29, 2013

  • Dipankar

    I am working in Big Data Analytics. Trying to be more comfortable with Hbase.

    August 28, 2013

Our Sponsors

  • IBM

    Meeting facilities, expert speakers, free product, books and education.

  • Big Data University

    Free on-line courses in Hadoop and big data related technologies.

  • Cloudera

    10% off training for Toronto Hadoop User Group members.

  • Hortonworks

    Food, speakers, beverages

  • T4G

    Hosting Meeting locations and providing relevant speakers

People in this
Meetup are also in:

Create your own Meetup Group

Get started Learn more
Bill

I started the group because there wasn't any other type of group like this. I've met some great folks in the group who have become close friends and have also met some amazing business owners.

Bill, started New York City Gay Craft Beer Lovers

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy