Skip to content

Hadoop & Big Data

Photo of Phillip North
Hosted By
Phillip N.
Hadoop & Big Data

Details

In this FREE session, Big Data/Hadoop/Spark Evangelist, Subash D'Souza will provide an introduction to Hadoop, the open source project that allows organizations to process, store and analyze massive application datasets.

Hadoop, serves as the data processing engine behind some of the world's largest and most popular Internet businesses including Google, Yahoo! and Facebook. Subash will cover the most frequently asked questions about Hadoop / MapReduce as well as tips to designing, deploying, and maintaining a Hadoop cluster.

See important parking info here (http://www.southbaymobileusergroup.com/documents/teamone_parking_info.pdf)

http://photos3.meetupstatic.com/photos/event/c/8/a/3/600_440871363.jpeg

SESSION AGENDA

Why the World Needs Hadoop

• What is Apache Hadoop?

• How Did Apache Hadoop Originate?

• The Economics of Hadoop

• Common Use Cases

Fundamental Concepts

• How Hadoop Differs from other Distributed Computing Architectures

• High-Level Architecture

• The Anatomy of the Cluster

HDFS: The Hadoop Distributed Filesystem

• Comparison to Standard Filesystems

• HDFS Replication and Reliability

MapReduce

• Data Processing with MapReduce

• Thinking in MapReduce

• Hadoop Streaming

• Visual Overview of Job Execution

• Hadoop’s Java API for MapReduce

Using Apache Hadoop Effectively

• Partitioning the Keyspace

• Improving Performance with a Combiner

• Tips for Running at Scale

• When Hadoop is Not the Right Choice

The Hadoop Ecosystem

• Apache Flume

• Apache Sqoop

• Apache Hive

• Apache Pig

• Apache HBase

• Apache Mahout

• Hadoop Versions and Distributions

http://photos1.meetupstatic.com/photos/event/c/8/1/d/600_440871229.jpeg

About the Speaker Subash D’Souza Recognized as a Champion of Big Data by Cloudera
Big Data/Hadoop/Spark Evangelist
Co-Organizer- Los Angeles Hadoop User Group
Organizer- Los Angeles HBase User Group
Organizer -Big Data Day LA
Speaker – Big Data Day LA 2013
Speaker- Hadoop Innovation Summit San Diego 2014
Leading a BOF Session at Hadoop Summit Europe 2014

Photo of Code District group
Code District
See more events
Team One Advertising
13031 W Jefferson Blvd, Space 800 · Los Angeles, CA