Skip to content
Hadoop Map-Reduce with Cascading

Details

Developing useful applications on Hadoop with raw MapReduce isn’t a trivial task. Cascading framework based on Hadoop MapReduce bridge this gap and may simplify this process significantly.

Cascading( http://www.cascading.org/ ) is the proven application development platform for building data applications on Hadoop. The Cascading SDK embodies these plumbing metaphors and provides equivalent high-level Java constructs and classes to implement your sources, sinks, traps, flows, and pipes.

So We are going to talk about how Cascading can be used with a Beginner level Introduction and Popular wordcount example.

Agenda:

  1. Short Basic Overview of Hadoop and Map-Reduce (For beginners)
  2. Introduction to Cascading and its Need.
  3. Cascading Basics (Source, Sink, Tuple, Pipe, Assembly, Filter, Aggregator etc)
  4. Operations (Max, Min, Avg, Sum, GroupBy, Joins etc)
  5. Cascading Use-cases:
    • How it works with/without Hadoop
    • How Bixo (Web-Mining Tool kit) is built on Cascading ( http://openbixo.org/ )
  6. Demo with Examples: Wordcount, Groupby, Calculating TF-IDF etc.

Contact No: +91-9908599937

Photo of BigData-Blockchain-AI-ML-XR-3Dprinting-Automation-Robotics group
BigData-Blockchain-AI-ML-XR-3Dprinting-Automation-Robotics
See more events
IIIT-Hyderabad campus, Gachibowli, · Hyderabad