Hadoop & Big Data


Details
In this FREE session, Big Data/Hadoop/Spark Evangelist, Subash D'Souza will provide an introduction to Hadoop, the open source project that allows organizations to process, store and analyze massive application datasets.
Hadoop, serves as the data processing engine behind some of the world's largest and most popular Internet businesses including Google, Yahoo! and Facebook. Subash will cover the most frequently asked questions about Hadoop / MapReduce as well as tips to designing, deploying, and maintaining a Hadoop cluster.
See important parking info here (http://www.southbaymobileusergroup.com/documents/teamone_parking_info.pdf)
http://photos3.meetupstatic.com/photos/event/c/8/a/3/600_440871363.jpeg
SESSION AGENDA
Why the World Needs Hadoop
• What is Apache Hadoop?
• How Did Apache Hadoop Originate?
• The Economics of Hadoop
• Common Use Cases
Fundamental Concepts
• How Hadoop Differs from other Distributed Computing Architectures
• High-Level Architecture
• The Anatomy of the Cluster
HDFS: The Hadoop Distributed Filesystem
• Comparison to Standard Filesystems
• HDFS Replication and Reliability
MapReduce
• Data Processing with MapReduce
• Thinking in MapReduce
• Hadoop Streaming
• Visual Overview of Job Execution
• Hadoop’s Java API for MapReduce
Using Apache Hadoop Effectively
• Partitioning the Keyspace
• Improving Performance with a Combiner
• Tips for Running at Scale
• When Hadoop is Not the Right Choice
The Hadoop Ecosystem
• Apache Flume
• Apache Sqoop
• Apache Hive
• Apache Pig
• Apache HBase
• Apache Mahout
• Hadoop Versions and Distributions
http://photos1.meetupstatic.com/photos/event/c/8/1/d/600_440871229.jpeg
About the Speaker Subash D’Souza Recognized as a Champion of Big Data by Cloudera
Big Data/Hadoop/Spark Evangelist
Co-Organizer- Los Angeles Hadoop User Group
Organizer- Los Angeles HBase User Group
Organizer -Big Data Day LA
Speaker – Big Data Day LA 2013
Speaker- Hadoop Innovation Summit San Diego 2014
Leading a BOF Session at Hadoop Summit Europe 2014

Hadoop & Big Data