Rapid Data Exploration with Hadoop (LinkedIn) 3/18 LinkedIn is the premiere professional social network with over 60 million users and a new user joining every second. One of LinkedIn's strategic advantages is their unique data. While most organizations consider data as a service function, LinkedIn considers data a cornerstone of their product portfolio. To rapidly develop these products LinkedIn leverages a number of technologies including open source, 3rd party solutions, and some we've had to invent along the way. This talk will discuss some best practices for quickly uncovering patterns, visualizing trends, and generating actionable insights from large datasets. We will also walk through a few basic examples that use Hadoop, Pig, Voldemort, & Python to perform tasks related to trend detection, spatial analytics, & collaborative filtering.
Pete Skomoroch is a Research Scientist at LinkedIn, focusing on building data driven products. For the past several years, he was founder of Data Wrangling in Washington, DC, working on projects involving search, finance, and recommendation systems. Previously, he was the Director of Advanced Analytics at Juice Analytics and a Sr. Research Engineer at AOL Search.