align-toparrow-leftarrow-rightbackbellblockcalendarcamerachatcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-crosscrosseditfacebookglobegoogleimagesinstagramlocation-pinmagnifying-glassmailmoremuplabelShape 3 + Rectangle 1outlookpersonplusImported LayersImported LayersImported Layersshieldstartwitteryahoo

Recommendation Engines & Accumulo - Sqrrl Tues. May 21 @6pm MST

University of Colorado Denver - Tuesday May 21, 2013 @ 6:00pm MST

Livestream Link for Recommendation Engines & Accumulo:  http://www.ustream.tv/channel/recommendationengines

Livestream starts at 6:15pm MST

Large auditorium (170 person capacity) with 20' screen.

Location: CU Denver - North Classroom #1539 - 1200 Larimer Street
Denver, CO[masked] - Map: http://bit.ly/Tyznzg

Agenda:

6:00 - 6:15 Schmooze - Old Chicago Pizza will be served.

6:15 - 7:30 Recommendation Engines by Tom Rampley

7:30 - 8:30 Accumulo - Sqrrl NoSQL Database by John Dougherty

8:30 - 9:30 Network at Old Chicago at 14th and Market.

See: http://www.oldchicago.com/denver-market-street


RECOMMENDATION ENGINES - ABSTRACT

Recommendation Engines (RE) are software tools and techniques providing item suggestions to a user. The massive growth and variety of information can often overwhelm, leading to poor decisions. While choice is good, more choice is not always better. REs have proved in recent years to be a valuable means for coping with the information overload problem.

In their simplest form, personalized recommendations are offered as ranked lists of items. In performing this ranking, REs try to predict what are the most suitable products or services for a user, based on their preferences and constraints. In order to complete this computational task, REs collect preferences from users, which are either explicitly expressed (e.g., as ratings for products) or are inferred by interpreting user actions. For instance, a RE may consider the navigation to a particular product page as an implicit sign of preference for the items shown on that page.

Amazon's RE for example relies on a basic formula (collaborative filtering) that suggests products to you based on your viewing history, your purchase history and which related products other customers bought.

BIO

Tom Rampley is a data scientist with a background in finance and psychology. He received his MBA from Indiana University’s Kelley School of Business in 2012, with concentrations in finance and business analytics. Since graduation, he has been working within the Viewer Measurement group at Dish Network LLC on customer segmentation models, the development of recommendation engines, and the implementation of big data IT platforms. He prefers R to SAS, Python to any other scripting language, and while trained as a frequentist currently considers himself Bayes-curious. Outside of work he is married with no kids (yet), a lifelong martial artist, and endlessly nostalgic for the days when he played lead guitar in his grad school rock band. This is his first Data Science meetup presentation.

ACCUMULO - SQRRL NOSQL DATABASE - ABSTRACT

Apache Accumulo is an open-source highly secure NoSQL database created in 2008 by the National Security Agency. It easily integrates with Hadoop, can securely handle massive amounts of structured and unstructured data - at scale cost-effectively - and enables users to move beyond traditional batch processing and conduct a wide variety of real-time analyses. Accumulo is a sorted, distributed key/value store based on Google's BigTable design. It is a system built on top of Hadoop, ZooKeeper and Thrift. Written in Java, Accumulo has cell-level access labels and a server-side programming mechanisms.

Accumulo offers "Cell-Level Security" - extending the BigTable data model, adding a new element to the key called "Column Visibility". This element stores a logical combination of security labels that must be satisfied at query time in order for the key and value to be returned as part of a user request. This allows data of varying security requirements to be stored in the same table, and allows users to see only those keys and values for which they are authorized.

Sqrrl Enterprise, developed by Sqrrl Data, is the operational data store for large amounts of structured and unstructured data. It is the only NoSQL solution that scales elastically to tens of petabytes of data and that has fine-grained security controls. Sqrrl Enterprise enables development of real-time applications on top of Big Data. Sqrrl uses HDFS for storage; Accumulo for security/speed of access; Thrift API for interactivity; and works with map/reduce, visualizations, third party software, and existing schema explored databases.

This presentation reviews Accumulo and Sqrrl Enterprise.

BIO

John Dougherty is CIO for Viriton, a consulting and systems integration organization. He is the organizer for Big Data for Business, helping to apply Big Data concepts to C-suite perspectives. He began utilizing applied strategies, using technology, in the early nineties, and has continued to incorporate blue blood technologies in forward thinking solutions.

Join or login to comment.

    • gretchen g.

      @5280BigData says thank you and awesome!

      May 22, 2013

  • Michael W.

    Livestream Link for Recommendation Engines & Accumulo: http://www.ustream.tv/channel/recommendationengines


    Starts at 6:15pm MST

    May 21, 2013

  • Jared W.

    planning to attend remotely

    May 21, 2013

  • Nancy A

    Looks like I will only be able to attend remotely.

    May 21, 2013

  • A former member
    A former member

    I'll be on the web

    May 20, 2013

  • Michael W.

    Livestream if unable to attend in person - register and we will email you a link to watch via livestream video 2 hours prior to start.

    Register @ http://bit.ly/17ZBjqC

    May 20, 2013

  • A former member
    A former member

    I would like to watch it remotely. Thanks!

    May 20, 2013

  • Emily

    Planning on attending via livestream.

    May 20, 2013

  • A former member
    A former member

    I will wait for the link to watch it remotelly. Regards from Buenos Aires

    May 16, 2013

  • Norman G.

    Machine learner, NLPer, interested in collaborative filtering and related techniques. Checking out the local scene.

    May 13, 2013

  • Michael M.

    Regretfully, I have a work conflict -- so sad to miss this one!

    May 10, 2013

  • A former member
    A former member

    I am former financial professional, data is my tools. Want to learn more.

    May 8, 2013

  • gretchen g.

    Will this be a webcast also?

    April 30, 2013

    • Michael W.

      Yes, for those who register. Yet meeting new folks and making new friends is fun.

      April 30, 2013

  • A former member
    A former member

    see you there - barring yet another unseasonal blizzard!

    April 29, 2013

  • Michael M.

    Thank you again for scheduling on a Tuesday

    April 29, 2013

Our Sponsors

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy