Anomaly Detection presented by Ted Dunning

  • March 17, 2014 · 6:30 PM

The basic ideas of anomaly detection are simple. You build a model and you look for data points that don’t match that model. Building a practical anomaly detection system requires deal with practical details starting with algorithm selection, data flow architecture, anomaly alerting, user interfaces and visualizations. We will describe the major classes of anomaly detection systems and show how to build anomaly detection systems for:

a) rate shifts to determine when events such as web traffic, purchases or process progress beacons shift rate

b) topic spotting to determine when new topics appear in a content stream such as Twitter

c) network flow anomalies to determine when systems with defined inputs and outputs act strangely.

While describing how to solve these problems, we will describe how clustering, dimensionality reduction, and density estimation can be used in systems that adapt and learn about their environment and how these systems can tell you when something has changed.

This talk will reprise the content of my Strata presentation, but will include extra material that shows how compression equals truth and how anomaly detection can make databases faster among other sundry philosophical truths.

Ted Dunning - Chief Application Architect, MapR

Ted Dunning has been involved with a number of startups with the latest being MapR Technologies where he is Chief Application Architect working on advanced Hadoop-related technologies. He is also a PMC member for the Apache Zookeeper and Mahout projects. Opinionated about software and data-mining and passionate about open source, he is an active participant of Hadoop and related communities and loves helping projects get going with new technologies.


Join or login to comment.

  • Kenji M.

    Okay, here's a link to the ebook that doesn't show my contact info: http://info.mapr.com/resources_ebook_anewlook_anomalydetection.html?cid=blog

    June 7, 2014

  • Kenji M.

    Anyone know the github repository he mentioned in the talk?

    1 · March 19, 2014

  • Aman A.

    The EKG data used for Ted's example project is available on PhysioBank -- http://physionet.org/physiobank/database/#cv

    March 20, 2014

  • Jiunjiun M.

    Will this event be recorded or streamed? Is it possible to have the slide shared somehow? Thanks.

    March 17, 2014

  • Andy Z.

    Great!

    March 18, 2014

  • Jim S.

    Excellent talk.

    1 · March 18, 2014

  • Sree C

    I am going to join "Online Meeting"

    March 17, 2014

  • Sree C

    Is this going be live?

    1 · March 16, 2014

    • Pavel M.

      yes, see the info below

      1 · March 17, 2014

  • henry c.

    I'll have to pass now; something's come up. Will this be recorded for posterity?

    1 · March 16, 2014

  • Ziad H.

    Is the talk going to be recorded?

    2 · March 11, 2014

    • Pavel M.

      yes, I'll try my best to remember to hit the record button

      5 · March 12, 2014

    • Pavel M.

      Will attempt to stream (subject to 25 max connections). Otherwise should be able to record and post the link: Anomaly detection presented by Ted Dunning
      ------------------------­------------------------­------------------
      Meeting Number:[masked]

      Join Online Meeting:
      http://fuze.me/237696...­

      To join from a phone:
      Toll: [masked]

      If prompted, enter the meeting number[masked] followed
      by your Host PIN 4747, then press #.

      For the best integrated HD Audio and Video experience Download Fuze from https://www.fuzebox.co...­

      Need help? Contact Support at https://www.fuzebox.co...­

      March 17, 2014

  • auro

    This meetup coincides with the Bay Area useR (R Lang) March Meetup. I chose this one over the other one and have a good feeling I'll be proven right :-)

    1 · March 12, 2014

    • Paolo m.

      thank you

      March 16, 2014

    • Shayne H.

      Hacker Dojo has a machine learning group that meets - seems like every other Friday or so; the Data Science Journal meetup is there every two weeks; the ACM has a regular Big Data / ML meeting, the IEEE sometimes has stuff. Then there are a host of application-specific meetups (Hadoop at Yahoo, etc.), and of course there'll be stuff going on at Stanford as well.

      March 16, 2014

  • Paolo m.

    Hi just moved to the valey. Looking to meet data scientists interested in entrepreneurial activities ? Also if you d have suggestions for networking meetings that would me great

    March 15, 2014

  • Gary F.

    I seem to be late to the table this time. Looks like it's going to be a good presentation.

    1 · March 13, 2014

  • Frank C.

    Wow, right up my alley. Looking forward to attending the talk. Thanks!

    2 · February 28, 2014

Our Sponsors

People in this
Meetup are also in:

Create your own Meetup Group

Get started Learn more
Bill

I started the group because there wasn't any other type of group like this. I've met some great folks in the group who have become close friends and have also met some amazing business owners.

Bill, started New York City Gay Craft Beer Lovers

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy