align-toparrow-leftarrow-rightbackbellblockcalendarcamerachatcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-crosscrosseditfacebookglobegoogleimagesinstagramlocation-pinmagnifying-glassmailmoremuplabelShape 3 + Rectangle 1outlookpersonplusImported LayersImported LayersImported Layersshieldstartwitteryahoo

Machine Learning and Solbase

**To folks who plan on joining us for networking at the wynkoop afterwards - bring cash!**

Food, Soda, Beer and Networking. We are grateful to TEksystems for their continued sponsorship of the Food and Soda!  Also a big thanks to our new sponsor BWBacon Group for supplying the beer.

6:00-7:00: BASIC CONCEPTS: Solbase and Realtime Activity with Kyungseog Oh, Josh Hollander and Ron White
Abstract: Solbase, the real time open-source search engine, is now available on github. Solbase was developed by and is built upon Lucene, Solr and HBase. Photobucket has also recently released a real time community activity stream capturing the 4 million daily uploads as well as all of your friends' comments and favorite photos.  The foundation of the system is HBase and also employs Kestrel queues.  This talk will cover the architecture, implementation details and share many of the lessons learned while developing this real time big data system.


Kyungseog Oh Kyungseog joined Photobucket in 2008 as a developer to take a role on the search team. During this time, Kyungseog has become the search guru within Photobucket and led the migration of a stand-alone Lucene search engine into a distributed architecture using Solr. Prior to Photobucket, Kyungseog developed web analytic software at Omniture. Kyungseog holds a Bachelors Degree and a Masters Degree in Computer Science from University of Utah.  

Josh Hollander Josh joined Photobucket as a developer in 2011 and has been primarily focused on performance and scalability.  Prior to joining Photobucket, Josh developed local search applications for Local Matters.  Josh holds a Bachelors Degree in Computer Science from Calvin College and a Masters in Computer Science from University of Colorado, Denver.

Ron White Ron joined Photobucket as a senior developer on the infrastructure team in 2008.  Since that time he has worked on several areas of the web site as well as Photobucket's open API and developed the original Photobucket Mobile app for Android.  Prior to working at Photobuket, Ron worked for 10 years as a developer for Raytheon and BIT Systems sub-contracting on government projects.  Ron holds a Bachelors Degree in Physics from Colorado School of Mines and a Masters in Electrical Engineering from Princeton University. Let us know if you need anything else.



MAIN PRESENTATION: Machine Learning with Apache Mahout with Daniel Glauser 


Have you wondered why you were being asked a certain question when the system should already know the answer?  Well, if it can’t know the answer then at least it can make a good suggestion, right?  As a programmer how would you write code to handle this?  Could you find patterns in the data that are not obvious even with lots of domain knowledge?  Machine Learning to the rescue.  We’ll explore how with a little math and a large data set you can quickly construct a recommendation engine.  There are numerous algorithms to choose from and we will spend time reviewing their strengths and weaknesses.  Data sets can quickly grow too large for a single system so we will also explore how things change when you distribute the work.  All examples will be in Clojure using the Apache Mahout library.


Daniel Glauser (Clojure)  has spoken on for audiences in Denver, Boulder, Colorado Springs and Trivandrum, India.  Daniel organizes the where he focuses on working with strong members of the Clojure community to teach everything from beginning functional programming to advanced concurrency patterns.  Daniel is a software architect with over thirteen years of experience working for companies like Comcast, NBC-Universal and BellSouth.  Currently Daniel works as a software architect for VMware in Colorado Springs where he is working on large scale cloud management systems.  Daniel's interests include functional programming, big data, distributed systems, logic systems, and enterprise architecture.  In Colorado Daniel has spoken at DJUG, DOSUG, BJUG and CSOSUG.

8:45 p.m. Door prizes
9:00 pm – after meeting networking at Wynkoop Brewing Company. Very special thanks to ReadyTalk for their sponsorship of food at this location.

Join or login to comment.

  • Zettie C.

    You can view the slides from the Photobucket/Solbase and Realtime Activity presentation at

    February 10, 2012

  • Dan L.

    Someone left their pocket chopsticks/utensils set here at FullContact. Feel free to swing by to pick them up.

    February 9, 2012

  • Tom F.

    Doug McKenan, Kyungseog Oh, Josh Hollander and Ron White from Photobucket gave the best group presentation about the challenges of managing data for social networking . The topics flowed nicely from SolBase, through Activity Stream to HBase.
    Daniel Glauser introduce us to machine learning how Mahoot ties it to Hadoop. The use of Clojure was inspired

    February 9, 2012

  • Paul R.

    Great presentations, both from Photobucket and Daniel. Learned a lot, and really enjoyed the evening.

    February 9, 2012

42 went

Our Sponsors

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy