addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwchatcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscrosseditemptyheartexportfacebookfolderfullheartglobegmailgoogleimageimagesinstagramlinklocation-pinmagnifying-glassmailminusmoremuplabelShape 3 + Rectangle 1outlookpersonplusprice-ribbonImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruseryahoo

MapReduce Design Patterns

Austin ACM SIGKDD is presenting a weekly series on MapReduce Design Patterns. It is based on the book, "MapReduce Design Patterns" by Donald Miner and Adam Shook.

We meet either on Wednesday nights at 7:00 or Saturday at 1:00, depending on the availability of the meeting room at Northwest recreation center.

If you want to sign up and present one of the design patterns, please let me know. Any pattern without a presenter is open. Any patterns without a presenter, I will present.

Textbooks

MapReduce Design Patterns: Building Effective Algorithms and Analytics for Hadoop and Other Systems by Donald Miner and Adam Shook

http://www.amazon.com/MapReduce-Design-Patterns-Effective-Algorithms/dp/1449327176/ref=sr_1_4?ie=UTF8&qid=1355520026&sr=8-4&keywords=hadoop

Hadoop: The Definitive Guide, 3rd ed., by Tom White

http://www.amazon.com/Hadoop-Definitive-Guide-Tom-White/dp/1449311520/ref=sr_1_1?ie=UTF8&qid=1355520684&sr=8-1&keywords=hadoop

Code and Data

Code for Map Reduce Design Patterns

https://github.com/adamjshook/mapreducepatterns

Code for Hadoop: The Definitive Guide - this includes the data

https://github.com/tomwhite/hadoop-book

Syllabus


Week 1, Wednesday, Jan 23, 2013, 7:00, Overview of Hadoop, Northwest Recreation Center,  Presenter - David Boney


Week 2, Wednesday, Jan 30, 2013, 7:00, Hadoop API - Readers and Writers, Northwest Recreation Center, Presenter - David Boney


Week 3,  Wednesday, Feb 6, 2012, 7:00, Basic Hadoop Programming - Word Count Program, Northwest Recreation Center,  Presenter - David Boney


Week 4, Saturday, Feb 16, 2013, 1:00, Map Reduce Design Patterns Overview, Northwest Recreation Center,  Presenter - David Boney

 

Week 5, Saturday, Feb 23, 2012, 1:00, Numerical Summarizations Pattern, Northwest Recreation Center, Presenter - David Boney


Week 6, Saturday, Mar 2, 2012, 1:00, Inverted Index Summarizations Pattern, Northwest Recreation Center, Presenter - David Walling


Week 7, Saturday, Mar 9, 2012, 1:00, Counting with Counters Pattern, Northwest Recreation Center

 

Week 8, Wednesday, Mar 13, 2012, 7:00, Filtering Pattern, Northwest Recreation Center, Presenter - Jacob Silva


Week 9, Saturday, Mar 23, 2013, 1:00, Top Ten & Distinct Patterns, Northwest Recreation Center, Presenter - David Boney


Week 10, Wednesday, Mar 27, 2013, 7:00, Structure to Hierarchical Pattern, Northwest Recreation Center, Presenter - David Boney

 

Week 11, Saturday, April 6, 2013, 1:00, QR Factorization, Northwest Recreation Center, Presenter - Choudur K. Lakshminarayan

 

Skip a week

 

Week 12, Saturday, April 20, 2013, 1:00, Bloom Filtering, Northwest Recreation Center, Presenter - Ram Kosurus


Week 13, Saturday, April 27, 1:00, Partitioning and Binning Patterns, Northwest Recreation Center, Presenter - Roger Nasr

 

Skip a week

 

Week 14, Wednesday, May 8, 2013, 7:00, Total Order Sorting and Shuffling Patterns, Northwest Recreation Center, Presenter - Omar Odibat

 

Week 15, Saturday, May 18, 2013, 1:00, Reduce Side Joins, Northwest Recreation Center, Presenter - Ram Kosurus

 

Week 16, Wednesday, May 22, 2013, 7:00, Replicated Join Pattern, Northwest Recreation Center, Presenter -


Week 17, Wednesday, May 29, 2013, 7:00, Composite Join Pattern, Northwest Recreation Center, Presenter - Robert Justice


Week 18, Cartesian Produce Pattern, Northwest Recreation Center


Week 19, Job Chaining Pattern, Northwest Recreation Center


Week 20, Chain Folding Pattern, Northwest Recreation Center


Week 21, Job Merging Pattern, Northwest Recreation Center


Week 22, Custom Input and Output in Hadoop Pattern, Northwest Recreation Center


Week 23, Generating Data Pattern, Northwest Recreation Center


Week 24, External Source Output Pattern, Northwest Recreation Center


Week 25, External Source Input Pattern, Northwest Recreation Center


Week 26, Partition Prunning Pattern, Northwest Recreation Center

 

 

Join or login to comment.

4 went

Our Sponsors

  • Visa

    Proud sustaining sponsor of Austin ACM KDD +sponsor of the ML courses.

  • HomeAway

    Proud sustaining sponsor of Austin ACM KDD

  • Actian

    ACTIAN ANALYTICS PLATFORM ARCHITECTURE — Open, Fast and Enterprise-Grade

  • Cloudera

    Gold Pledge Sponsor for the Large Scale Machine Learning Workshop

  • AWS

    Platinum Pledge Sponsor for the Large Scale Machine Learning Workshop

  • Association for Computing Machinery

    Parent Organziation

  • ACM SIGKDD

    We are the local Austin chapter of ACM SIGKDD

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy