MapReduce Design Patterns

Austin ACM SIGKDD is presenting a weekly series on MapReduce Design Patterns. It is based on the book, "MapReduce Design Patterns" by Donald Miner and Adam Shook.

We meet either on Wednesday nights at 7:00 or Saturday at 1:00, depending on the availability of the meeting room at Northwest recreation center.

If you want to sign up and present one of the design patterns, please let me know. Any pattern without a presenter is open. Any patterns without a presenter, I will present.

Textbooks

MapReduce Design Patterns: Building Effective Algorithms and Analytics for Hadoop and Other Systems by Donald Miner and Adam Shook

http://www.amazon.com/MapReduce-Design-Patterns-Effective-Algorithms/dp/1449327176/ref=sr_1_4?ie=UTF8&qid=1355520026&sr=8-4&keywords=hadoop

Hadoop: The Definitive Guide, 3rd ed., by Tom White

http://www.amazon.com/Hadoop-Definitive-Guide-Tom-White/dp/1449311520/ref=sr_1_1?ie=UTF8&qid=1355520684&sr=8-1&keywords=hadoop

Code and Data

Code for Map Reduce Design Patterns

https://github.com/adamjshook/mapreducepatterns

Data for Map Reduce Design Patterns

http://www.sonicartifacts.com/mrdp/users.xml.gz

http://www.sonicartifacts.com/mrdp/posts.xml.gz

http://www.sonicartifacts.com/mrdp/comments.xml.gz

Code for Hadoop: The Definitive Guide - this includes the data

https://github.com/tomwhite/hadoop-book

 

Syllabus


Week 1, Wednesday, Jan 23, 2013, 7:00, Overview of Hadoop, Northwest Recreation Center,  Presenter - David Boney


Week 2, Wednesday, Jan 30, 2013, 7:00, Hadoop API - Readers and Writers, Northwest Recreation Center, Presenter - David Boney


Week 3,  Wednesday, Feb 6, 2012, 7:00, Basic Hadoop Programming - Word Count Program, Northwest Recreation Center,  Presenter - David Boney


Week 4, Saturday, Feb 16, 2013, 1:00, Map Reduce Design Patterns Overview, Northwest Recreation Center,  Presenter - David Boney

 

Week 5, Saturday, Feb 23, 2012, 1:00, Numerical Summarizations Pattern, Northwest Recreation Center, Presenter - David Boney


Week 6, Saturday, Mar 2, 2012, 1:00, Inverted Index Summarizations Pattern, Northwest Recreation Center, Presenter - David Walling


Week 7, Saturday, Mar 9, 2012, 1:00, Counting with Counters Pattern, Northwest Recreation Center

 

Week 8, Wednesday, Mar 13, 2012, 7:00, Filtering Pattern, Northwest Recreation Center, Presenter - Jacob Silva


Week 9, Saturday, Mar 23, 2013, 1:00, Bloom Filtering Pattern, Northwest Recreation Center


Week 10, Wednesday, Mar 27, 2013, 7:00, Top Ten Pattern, Northwest Recreation Center

 

Week 11, Distinct Pattern, Northwest Recreation Center


Week 12, Structured to Hierarchical Pattern, Northwest Recreation Center


Week 13, Partitioning Pattern, Northwest Recreation Center

 

Week 14, Binning Pattern, Northwest Recreation Center

 

Week 15, Total Order Sorting Pattern, Northwest Recreation Center, Presenter - Omar Odibat


Week 16, Shuffling Pattern, Northwest Recreation Center


Week 17, Reduce Side Join Pattern, Northwest Recreation Center, Presenter - Misty Nodine


Week 18, Replicated Join Pattern, Northwest Recreation Center, Presenter - Rodger Nasr


Week 19, Composite Join Pattern, Northwest Recreation Center


Week 20, Cartesian Produce Pattern, Northwest Recreation Center


Week 21, Job Chaining Pattern, Northwest Recreation Center


Week 22, Chain Folding Pattern, Northwest Recreation Center


Week 23, Job Merging Pattern, Northwest Recreation Center


Week 24, Custom Input and Output in Hadoop Pattern, Northwest Recreation Center


Week 25, Generating Data Pattern, Northwest Recreation Center


Week 26, External Source Output Pattern, Northwest Recreation Center


Week 27, External Source Input Pattern, Northwest Recreation Center


Week 28, Partition Prunning Pattern, Northwest Recreation Center

 

Join or login to comment.

  • A former member
    A former member

    Can't make it this week.

    March 7, 2013

Our Sponsors

  • Association for Computing Machinery

    Parent Organziation

  • ACM SIGKDD

    We are the local Austin chapter of ACM SIGKDD

  • Visa

    Meeting space + pizza for the "ML with Python" course.

  • HomeAway

    Hosting the leacture on 7/30

  • PayPal

    Meeting space for the "Introduction to Spark" course.

People in this
Meetup are also in:

Create a Meetup Group and meet new people

Get started Learn more
Bill

I started the group because there wasn't any other type of group like this. I've met some great folks in the group who have become close friends and have also met some amazing business owners.

Bill, started New York City Gay Craft Beer Lovers

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy