Practical On-line Approximation Algorithms in Storm with Ted Dunning


Details
Dear friends,
This time we will have Ted Dunning - Chief Application Architect at MapR, speaking about "Practical On-line Approximation Algorithms in Storm."
There are lots of things that are easy to compute if you have all the data in hand, but hard to compute exactly in an on-line fashion with limited memory. Examples include count(distinct), percentiles and heavy hitters (top-40). On the other hand, Storm really prefers that we compute in an on-line fashion with limited memory. Happily, there are approximate algorithms for these quantities that are relatively easy to use, have tunable accuracy, and are fast. Unfortunately, these algorithms are not widely known. I will describe several of these algorithms and how they can be used in Storm.
We will also have welcome note from Geir Magnusson, AppNexus CTO.
Agenda:
6pm: Guest arrival & networking
6.30pm: Onstage welcome from Geir Magnusson, CTO, Appnexus (approx. 5 mins)
6.35pm: Ted Dunning - Chief Application Architect, MapR
7.45pm: Networking reception
8.30pm: Event ends
(Please note start time: 6:00 PM, not usual 6:30 PM.)
http://photos2.meetupstatic.com/photos/event/9/8/8/2/600_404739042.jpeg
From our sponsor- O'Reilly Publishing: you will get 20% off Strata 2014 registration with code UGNYCSUG20 and this link. (http://www.anrdoezrs.net/click-3868817-11828964)
About Speakers:
Ted Dunning - Chief Application Architect, MapR
http://photos1.meetupstatic.com/photos/event/d/6/4/4/600_410454852.jpeg
Ted is Chief Application Architect at MapR and has held Chief Scientist positions at Veoh Networks, ID Analytics and at MusicMatch, (now Yahoo Music). Ted is responsible for building the world's most advanced identity theft detection system, as well as one of the largest peer-assisted video distribution systems and ground-breaking music and video recommendations systems. Ted has 24 issued and numerous pending patents and contributes to Apache Mahout, Zookeeper and Drill™. He is also a mentor for Apache Spark, Storm, DataFu and Stratosphere. Ted has spoken at numerous conferences throughout the world.
Geir Magnusson, CTO and SVP, Engineering, AppNexus
http://photos3.meetupstatic.com/photos/event/d/9/a/a/600_410455722.jpeg
Geir Magnusson, Jr. is Chief Technology Officer and SVP, Engineering, AppNexus, responsible for technology strategy and product delivery, as well as driving the evolution of the company’s product architecture.
Geir was previously Vice President of Engineering at AppNexus leading the development of real-time auction, decisioning and data systems as well as the company’s mobile technologies. Geir has served as a technical executive and leader for companies including Function(x) – now Viggle – Gilt Groupe, 10gen, Joost, Adeptra, Bloomberg, and Intel, and has built systems and solutions for industries ranging from financial markets to fraud contact to digital audio to mobile consumer.
He also has broad experience in open source, having founded several significant open source projects, such as Apache Geronimo, Apache Harmony and Apache Velocity. Geir is a member of the Apache Software Foundation, and has represented the Foundation as a member of the Executive Committee of the Java Community Process, as well as served as a past member of the Board of Directors.
He is also an international speaker on open source and software technology. Geir holds degrees in Physics and Electrical and Computer Engineering from Johns Hopkins University.
Please share this meetup on Twitter, LinkedIn, Facebook.
This event is sponsored By MapR, AppNexus and O'Reilly:
http://photos4.meetupstatic.com/photos/event/c/8/8/e/600_411111342.jpeg
http://photos3.meetupstatic.com/photos/event/c/a/2/8/600_411111752.jpeg

Practical On-line Approximation Algorithms in Storm with Ted Dunning