SFPUG July: MADlib

Who: Hitoshi Harada, Greenplum

What: MADlib

Where: SwitchFly offices, San Francisco

Why: cool machine learning and analytics

When: Tuesday, July 10th, 7pm

Hosted by SwitchFly (formerly EZREZ).  Refreshments sponsored by Greenplum.

MADlib is an open-source library for scalable in-database analytics. It provides data-parallel implementations of mathematical, statistical and machine learning methods for structured and unstructured data.

The MADlib mission is to foster widespread development of scalable analytic skills, by harnessing efforts from commercial practice, academic research, and open-source development. The library consists of various analytics methods including linear regression, logistic regression, k-means clustering, decision tree, support vector machine and more. That's not all; there is also super-efficient user-defined data type for sparse vector with a number of arithmetic methods. It can be loaded and run in PostgreSQL 8.4 to 9.1 as well as Greenplum 4.0 to 4.2. This talk covers its concept overall with some introductions to the problems we are tackling and the solutions for them. It will also contain some topics around parallel data processing which is very hot in both of research and commercial area these days.

Join or login to comment.

  • Steve

    Interesting and informative presentation. Thanks!

    July 11, 2012

  • Don L.

    Well organized.

    July 11, 2012

  • Fazal M.

    The presenter spent far more time shilling EnterpriseDB than actually presenting MADlib.

    July 11, 2012

People in this
Meetup are also in:

Create a Meetup Group and meet new people

Get started Learn more
Allison

Meetup has allowed me to meet people I wouldn't have met naturally - they're totally different than me.

Allison, started Women's Adventure Travel

Start your Meetup today

Act now and get 50% off.
Until February 1.

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy