I am excited to announce the next Hive meetup.
Where: Forward Internet Group, Floor 2, Centro 3, 19 Mandela Street, NW1, London
When: 6:30 pm, Thursday 11th of October 2012
What: Drinks, food, networking, and two excellent speakers!
SnowPlow: How Apache Hive and other big data technologies are transforming web analytics
Yali is cofounder at SnowPlow - an open source web analytics platform that puts customer-level and event-level data directly in the hands of web analysts and data scientists. SnowPlow uses Hive extensively, both for ETL and data analysis. In this talk, Yali will give an overview of SnowPlow and explain how big data technologies include Hive have the potential to shake up the web analytics industry. In the second part of his talk, he'll explain precisely how Hive is used at SnowPlow, looking particularly at the strengths and weaknesses of Hive versus other MapReduce and columnar database alternatives.
Your Hive honeymoon can be cut short if you don't take the necessary precautions. In this talk I'll share my experience with Hive in the last 3 years (in Elastic MapReduce and Cloudera CDH3), describing what I got wrong the first time around, and what eventually saved the day. I've used Hive in environments with a number of events ranging from a few million to a few billion a day, so hopefully there'll be something for everyone.
The Forward Internet Group has agreed to host and sponsor the second meetup with drinks and food. They have an awesome place so this meetup is set to be fun and comfortable.
Big thank you to Forward Internet Group, and Andrew and Paul for helping to organise the event!