Skip to content

44th Bay Area Hadoop User Group (HUG) Monthly Meetup

Photo of Yahoo! HUG Organizer
Hosted By
Yahoo! HUG O.
44th Bay Area Hadoop User Group (HUG) Monthly Meetup

Details

Agenda

6:00 - 6:30 - Socialize over food and beer(s) 6:30 - 7:00 - Integrate Hue with existing Hadoop Cluster 7:00 - 7:30 - Apache Phoenix: SQL skin over HBase 7:30 - 8:00 - Apache Sentry: Enterprise-grade Security for Hadoop Session I (6:30 - 7:00 PM) - How to integrate Hue with your existing Hadoop clusters

Hue is a Web interface for analyzing data with Apache Hadoop. It provides integration to Hive, Pig, Impala, Spark, Oozie, HBase, Solr, Sqoop2, ZooKeeper and more. Hue’s target is the Hadoop user experience and lets users focus on quick data processing. Hue is a mature Web project that integrates into a single UI the Hadoop components and their main satellite projects.

This talk will describe how Hue can be integrated with existing Hadoop deployments with minimal changes/disturbances. Romain will cover details on how Hue can leverage the existing authentication system and security model of your company. He will also cover the Hive/Shark/Pig/Oozie best practice setup for Hue.

Speaker: Romain Rigaux, Software Engineer, Cloudera

Bio:

Romain is the Lead Engineer working on Hue. Before joining Cloudera, he previously used intensively MapReduce, Oozie and Pig at Yahoo! Search since the early days of Hadoop. He has been developing websites for more than 10 years. He holds a MSCS from the Georgia Institute of Technology and the Université de Technologie de Compiègne.

Session II (7:00 - 7:30 PM) – Apache Phoenix: SQL skin over HBase

Apache Phoenix is a SQL skin over HBase delivered as a client-embedded JDBC driver targeting low latency queries over HBase data. In the talk, I'll present the motivations that we need Phoenix on top of HBase, main features in Phoenix, new features in Phoenix 4.0 which will be included in HDP(Hortonworks Data Platform)2.1, roadmap and Q&A. If time permits, will run a quick demo showing secondary index in action & table join.

Speaker: Jeffrey Zhong, Software Engineer Hortonworks

Bio:

Jeffrey Zhong is the member of HBase team at Hortonworks. He is Phoenix Committer and HBase committer.

Session III (7:30 - 8:00 PM) - Apache Sentry: Enterprise-grade Security for Hadoop

Apache Hadoop offers strong support for authentication and coarse grained authorization - but this is not necessarily enough to meet the demands of enterprise applications and compliance requirements. Providing fine-grained access to data will enable organizations to store more sensitive information in Hadoop; only those users with the appropriate privileges will ever see that sensitive data.

Apache Sentry (incubating) is a new open source authorization module that integrates with Hadoop-based SQL query engines (Apache Hive and Impala) as well as interactive search (Cloudera search). In this talk, we will provide details on its implementation, as well as a short demo on how it enables secure access to data on Hadoop through various ecosystem components like Impala, Hive, and Search.

Speakers: Srayva Tirukkovalur (Software Engineer, Cloudera ) and Xuefu Zhang (Software Engineer, Cloudera )

Bio:

Srayva Tirukkovalur is a Software Engineer at Cloudera. She is a Committer on Apache Sentry (incubating) and contributor on Apache Flume.

Xuefu Zhang has over 10 year's experience in software development. Working for Cloudera since May 2013, he spends a lot of his efforts on Apache Hive and Pig. Prior to joining Cloudera, Xuefu Zhang served for Inadco, an online ads serving company, as the chief architect.

Yahoo Campus Map:

Detail map (http://photos4.meetupstatic.com/photos/event/2/8/e/d/600_21370477.jpeg)

Location on Wikimapia:

http://www.wikimapia.org/#lat=37.4181633&lon=-122.0250607&z=18&l=0&m=b&search=yahoo

Photo of Bay Area Hadoop Meetup group
Bay Area Hadoop Meetup
See more events