February 21, 2012 6:30 PM - 192 attended

Pixable and 10gen at the MongoDB User Group

Zocdoc (map)

Selected By: Francesca Krihely

We will have talks by Pixable and 10gen engineer Tyler Brock. 

Crawling, categorizing and ranking 5 billion photos

How we did it in MySQL and why we are migrating to MongoDB We have two main challenges at Pixable. The first one is how to access millions of photos per day from Facebook, Twitter, Instagram, and other services in the most efficient manner. The second one, is how to process, organize, index, and store all the meta-data related to those photos. In order to accomplish this, we heavily rely on Amazon Web Services, where we have 100+ servers running. As of today, we are processing the metadata of an average 20 million new photos per day, which we need to compare, rank and sort with over 5 billion that are already stored in our database. On top of this we have a API backend that serves thousands of request per second, and a logging and analytics system that performs over 10K updates per second in our database.   To support all ever-growing database needs, we have built a highly customized MySQL cluster with semi-automatic shards and partitions, optimized every single request and de-normalized most of our data. For large write rates we use our own publish/subscriber frameworks with various database levels and memory buffers (almost everything in Pixable is asynchronous). But in our quest of building a system that could allow us to scale in a more easy, flexible and solid way, we chose MongoDB as our next generation data storage solution.  Resuming, this is the story of how we got to this point in MySQL, what are our current challenges and how are we progressively migrating our millions of users and billion of photos to MongoDB.

Tyler Brock, 10gen Software Engineer, will give a demo of the new aggregation framework that will be released in v 2.2. 

192 attended

RSVPs closed

4.50 4.5020 (20 ratings)

10gen

10gen organizes the NY MongoDB User Group

SpringSource

SpringSource hosts the MongoDB User Group

Gilt Groupe

Gilt provides space for the NY MongoDB User Group.

O'Reilly

Members save 40% off print and 50% off ebooks use discount code DSUG

Buddy Media

Buddy Media hosts the MongoDB User Group.

ZocDoc

ZocDoc kindly provides space for the MongoDB User Group

Offer a perk for our members and get exposure.

Offer a perk →
Other nearby
Meetups
Why these groups?
x

The Meetup Groups shown here are topically similar to New York MongoDB User Group.

Groups are more likely to be displayed here if they:

  • have a Meetup scheduled
  • have a high rating
  • have a group photo
  • are "public" and not "private"
  • have shown they are likely to stick around (older than 30 days)
Find more Meetup Groups
near New York

Log in

  • Not registered with us yet?
or

Log in to Meetup with your Facebook account.

Log in using Facebook

Sign up

or

Join this Meetup Group even quicker with your Facebook account.

Sign up using Facebook
By clicking the "Sign up using Facebook" or "Sign up" buttons above, you agree to Meetup's Terms of Service