Hadoop has become the established tool for dealing with big data, and one of the largest public data sets available comes from Twitter. Utilizing several tools from the Hadoop ecosystem, Twitter data can be efficiently processed and analyzed. Join us in May as we have two big data experts present their work in building complete systems to handle Twitter data.
6:30 PM -- Networking & Wraps from Santoni's
6:55 PM -- Greetings
7:00 PM -- Analyzing Twitter Data with Hadoop - Joey Echeverria
8:00 PM -- Working With Mahout - Sean Busbey
8:30 PM -- Time for Drinks at Little Havana
Google maps does a good job of pinpointing the exact building location based on the address above.
Parking is not an issue. You can park on the street nearby or you can take up any Visitor spot that you can find. Also, feel free to park in the Ad.com/AOL employee parking lot but not in spaces marked Under Armour or Reserved.
Joey Echeverria is a Principal Solutions Architect at Cloudera where he works directly with customers to deploy production Hadoop clusters and solve a diverse range of business and technical problems. Joey joined Cloudera from the NSA where he worked on data mining, network security, and clustered data processing using Hadoop. Prior to working full time for NSA, Joey attended Carnegie Mellon University where he attained a M.S. and a B.S. in Electrical and Computer Engineering.
Sean Busbey is a Solutions Architect at Cloudera where he works with
customers to architect, implement and optimize Big Data solutions for
a diverse range of use cases for CPG, Interactive Entertainment,
Advertising Analytics, and Federal clusters. Sean previously worked as a Software Engineer on a Big Data team at the NSA. Prior to working full time for NSA, Sean attended the University of Illinois at
Urbana-Champaign where he attained a B.S. in Computer Science.
Analyzing Twitter Data with Hadoop - Social media has gained immense popularity with marketing teams, and Twitter is an effective tool for a company to get people excited about its products. Twitter makes it easy to engage users and communicate directly with them, and in turn, users can provide word-of-mouth marketing for companies by discussing the products. Given limited resources, and knowing we may not be able to talk to everyone we want to target directly, marketing departments can be more efficient by being selective about whom we reach out to. In this talk, Joey will describe how you can use Apache Flume, Apache HDFS, Apache Oozie, and Apache Hive to design an end-to-end data pipeline that will enable us to analyze Twitter data.
Working With Mahout - Once the end-to-end pipeline is established, what insights can be gained? Sean will continue the Twitter analysis by describing how machine learning and data mining algorithms can be applied to the data.
Cloudera is the leader in Apache Hadoop-based software and services and offers a powerful new data platform that enables enterprises and organizations to look at all their data — structured as well as unstructured — and ask bigger questions for unprecedented insight at the speed of thought. Behind some of the top minds in Big Data, including Doug Cutting, who invented Hadoop, Cloudera enhances the storage and processing technologies originally developed by the world’s biggest Web companies. Today, Cloudera is the market leader in Hadoop with tens of thousands of nodes under management, as well as the top contributor of code to the Hadoop ecosystem. Markets include financial services, government, telecommunications, media, web, advertising, retail, energy, bioinformatics, pharma/healthcare, university research, oil and gas, gaming and more.
All of the employee lots are up for grabs after 5pm. This means the two huge gravel/sand looking lots are good – one off of Hull Street and the other off of Key Hwy in front of Domino sugar. There is also about 75 specific visitors spots all around the buildings that they can park in. There is an alley between our building and the one in front of ours that can house at least 25 of them – most people don’t realize that they can turn down there. There is also street parking on Hull. Last – any Aol spot is up for grabs. The UA reserved spots and zip car only spots are not for public use.
1325 Key Highway
Baltimore, MD 21230