47th Bay Area Hadoop User Group (HUG) Monthly Meetup

Address: Classrooms 4/5 at Building C at Yahoo Sunnyvale campus

Detailed agenda and summaries to follow. General agenda:

6:00 - 6:30 - Socialize over food and beer(s)

6:30 - 7:00 - This ain't your Father's Search Engine

7:00 - 7:30 - Securing data in Hadoop using Apache Hive

7:30 - 8:00 - Comprehensive, Centralized Security for Hadoop

Session I (6:30 - 7:00 PM) – This ain't your Father's Search Engine

In just a few short years, search has quickly evolved from being a small text box in the nether regions of a website to being front and center in our lives. Increasingly, however, the combination of search engine and Hadoop technology is also being used for practical, real time recommendations, events processing, complex spatial functionality and time series analysis capable of not only matching user's queries in text, but also driving real time decision making and analytics. In fact, open source Apache Lucene/Solr can do all of this and more by taking advantage of new data structures and algorithms as well as deeper integration with Hadoop and related projects. In this demo-driven talk, Lucene committer Grant Ingersoll will take a look at some of the new and exciting ways users are leveraging Lucene, Solr and big data to drive deeper insight into information needs that go beyond keywords in a text box. 

Speaker:  Grant Ingersoll, CTO and co-founder, LucidWorks


Grant Ingersoll is the CTO and co-founder of LucidWorks as well as an active member of the Lucene community – a Lucene and Solr committer, co-founder of the Apache Mahout machine learning project and a long standing member of the Apache Software Foundation. Grant’s prior experience includes work at the Center for Natural Language Processing at Syracuse University in natural language processing and information retrieval. Grant earned his B.S. from Amherst College in Math and Computer Science and his M.S. in Computer Science from Syracuse University. Grant is also the co-author of “Taming Text” from Manning Publications.

Session II (7:00 - 7:30 PM) – Securing data in Hadoop using Apache Hive

Apache Hive 0.13 shipped with support for SQL standards based authorization.It lets users manage access control using familiar SQL grant/revoke statements with users and roles. The model also facilitates the development of more complex access control patterns, such as the ability to restrict access to table data at the column or row level when used in conjunction with views.

This is the third authorization mode supported in Hive. In this talk, we will discuss how this compares with other available authorization modes. We will also discuss how this can be used in conjunction with Storage Based Authorization to address the different use cases of Hive.

Speaker: Tejas Nair, Software Engineer , Hortonworks


Thejas Nair is a software engineer working on Apache Hive and Apache Pig at Hortonworks. He is a committer and PMC member of these Apache projects. His most recent work has focussed on improving security features in Hive. Previously, he worked at Yahoo for 9 years, developing solutions for large scale distributed data processing.

Speaker: Chris Drome, Technical, Yahoo


Chris Drome is Tech lead for Hive/HCat at Yahoo

Session III (7:30 - 8:00 PM) – Comprehensive, Centralized Security for Hadoop

With the advent of YARN, enterprises can adopt a true data lake architecture using Hadoop, supporting multiple use cases and applications within the same platform. And with the multi tenant environment comes the challenges of protecting sensitive data, controlling access and monitoring behavior across multiple user groups and different datasets. There is an increased focused on data privacy and compliance controls. Data security is now an important pillar in the enterprise Hadoop strategy. Enterprises are looking for enhanced support across authentication, authorization, auditing and data protection with a centralized framework for managing security in one place. The open source community, along with Hortonworks, is committed to bring comprehensive security across the Hadoop platform.

In this technical session, we’ll talk about the current work in enabling comprehensive security across the Hadoop platform, with a centralized security administration, capabilities for fine gained authorization, detailed auditing across HDFS, Hive and HBase, and data protection.

Speaker: Bosco Durai, Enterprise Security Architect , Hortonworks


Bosco Durai is an Apache committer and currently working at Hortonworks, focused on enabling enterprise grade security within Hadoop platform. Bosco brings years of experience building and managing enterprise data security products. Before Hortonworks, Bosco was the co-founder and Chief Security Architect of big data security startup, XA Secure. XA Secure was built ground up to address the unique security challenges that big data environments bring. XA Secure was subsequently acquired by Hortonworks in May 2014. Bosco also was co-founder at Bharosa, a fraud detection startup which was acquired by Oracle in 2007.


Yahoo Campus Map:

Detail map


Location on Wikimapia:



Join or login to comment.

  • Lee

    Register and attend 23rd Big Data Bootcamp Jan 16-18 Santa Clara Convention Center. (Hadoop 2.0, Cassandra, Advanced Spark, R, Hadoop Performance, MongoDB, Hive, Machine Learning, Data Science) Workshops, Hadoop Security, VC investment in 2015 & Use cases will be covered at the event.

    Register :http://bit.ly/1CdwNX0 using discount code "MEETUP" and Get Discount

    January 13

  • Jiten G.

    Looking for forward to the the next meetup event. Meanwhile would like to give a shout out that my team is looking for Java developer with Big Data experience. We are located in Mountain View and would like to move fast with this.

    Please message me for more info.

    September 26


    Appreciate mentors !!!

    2 · August 20

  • Yahoo! HUG O.

    Wonderful presentations. Thank you all presenters and to folks who attended.

    August 20

  • Ron C.

    Hi, I am planning on attending my 1st HUG meeting this week. Since it's at Yahoo are there any check-in details I need to know about? Thanks!

    August 18

    • Yahoo! HUG O.

      It is a secure facility, but someone should be there to open the door. There are two doors to URLs 2nd floor, one from the 1st floor and one through the garage. Try both if you run into issues.

      August 20

  • Huzefa S.

    Would there be a recording or live streaming of the event?

    August 20

    • Yahoo! HUG O.

      Recording will be posted in a few days after the event

      August 20

Our Sponsors

  • Yahoo! Inc.

    Meeting space, pizza and drinks are sponsored by the Yahoo! Hadoop team.

People in this
Meetup are also in:

Imagine having a community behind you

Get started Learn more

We just grab a coffee and speak French. Some people have been coming every week for months... it creates a kind of warmth to the group.

Rafaël, started French Conversation Group

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy