Hadoop Distribution (Hortonworks) - Scaling and Performance
Details
Agenda
-
Technical overview of Hortonworks Framework
-
Vendor use cases of large scale Hadoop deployments
-
Key factors impacting deployment
- Storage and Bandwidth vs Latency
- block size, replication factor
- map, copy/shuffle, and reduce phase tuning parameters
- compression, etc.,
- Horizontal scaling vs Data center foot print