Skip to content

Noel Welsh - Making Big Data Small: Streaming Algorithms in Scala

Photo of Andy Hicks
Hosted By
Andy H.
Noel Welsh - Making Big Data Small: Streaming Algorithms in Scala

Details

IMPORTANT Sign up at skillsmatter:

http://skillsmatter.com/podcast/scala/making-big-data-small-streaming-algorithms-in-scala

Making Big Data Small: Streaming Algorithms in Scala

http://photos1.meetupstatic.com/photos/event/1/4/c/event_148560332.jpeg

In this talk I'll discuss a class of algorithms,
streaming algorithms, that allow real-time processing of data, scale
extraordinarily well, and are simple to implement. Streaming
algorithms were developed to handle data capacities that exceed our
ability to store them (e.g. the Large Hadron Collider) but turn out to
be a great fit for small teams that want to move quickly on data
analysis projects.

In this talk I'll describe the Bloom filter and Count-Min sketch for
counting item occurrence and heavy hitter and quantile algorithms for
estimating frequency information. If there is time and interest I
might discuss lock-free stochastic gradient descent for learning
classifiers and recommendation systems

Noel has over fifteen years experience in software architecture
and development, and over a decade in machine learning and data
mining. Examples of the projects he’s been involved with include one
of the first commercial products to apply machine learning to the
Internet (eventually acquired by Omniture), a BAFTA award winning
website, and a custom CMS used daily by thousands of students. His
latest ventures are Scala consultancy at Underscore
(underscoreconsulting.com (http://underscoreconsulting.com/)) and data analysis and machine learning at
Myna (mynaweb.com (http://mynaweb.com/))

Noel is an active writer, presenter, and open source contributor. Noel
has a PhD in machine learning from the University of Birmingham.

We will, as always, also be heading to the Slaughtered Lamb (http://www.theslaughteredlambpub.com/) pub afterwards.

**IMPORTANT READ ME TO REGISTER **

Skills Matter are hosting this event and are handling the attendance it is essential that you confirm your place at this link:

http://skillsmatter.com/podcast/scala/making-big-data-small-streaming-algorithms-in-scala

failure to do so may result in not obtaining a seat. Please register on the Meetup.com "I'm going" to only let the others in the group know your going.

If this is your first time to SkillsMatter, directions are: http://skillsmatter.com/go/find-us

Photo of London Scala User Group group
London Scala User Group
See more events
The Skills Matter eXchange
116-120 Goswell Road, EC1V 7DP · London