Past Meetup

Big Data Science Meetup Event

This Meetup is past

125 people went


Name of this event: Hortonworks Day

This event is Co-sponsored by Hortonworks.

All attendees should pre-download the Hortonworks Sandbox from the following location:

1:30 P.M. - 2:00 P.M. Networking

2:00 P.M. - 2:45 P.M. Session 1

Title: Pig and HCatalog in the Hadoop Ecosystem

Speaker: Alan Gates, Co-founder, Hortonworks

Abstract: Hadoop has many tools for data analysis, including Pig and Hive. Pig is often used for ETL and model building workloads. It also performs well in the case where the schema needs to be decided at read time or where the schema is not consistent. HCatalog opens up Hive's metadata to tools such as Pig, enabling them to interact with data as tables rather than files and to share data easily between tools and external systems. This talk will give a brief introduction to Pig and HCatalog, discuss their place in the Hadoop ecosystem, and cover how to use them together for data analysis.

Speaker Bio: Alan Gates is a co-founder of Hortonworks. He is an original member of the engineering team that took Pig from a Yahoo! Labs research project to a successful Apache open source project. Alan also designed HCatalog and guided its adoption as an Apache Incubator project. Alan has a BS in Mathematics from Oregon State University and a MA in Theology from Fuller Theological Seminary. He is also the author of Programming Pig (, a book from O’Reilly Press. Follow Alan on Twitter: @alanfgates (!/alanfgates).

2:45 P.M. - 3:00 P.M. Q/A

3:00 P.M. - 3:45 P.M. Session 2

Title: Making Hive 100x Faster

Speaker: Grant Liu, Solutions Engineer, Hortonworks

Speaker Bio: An app developer who eventually found his way to Big Data. Now rolls with the crew that is an integral part of the community that made Apache Hadoop what it is today - and he's pretty happy about that. Also likes long walks on the beach.

3:45 P.M. - 4:00 P.M. Q/A

4:00 P.M. - 5:00 P.M. Networking

Coffee and Light Snacks will be available.

See all Meetups from Big Data Science