Skip to content

Storm - Real-time Big Data Stream Processing - WebMD

Photo of Prasad Sripathi
Hosted By
Prasad S.
Storm - Real-time Big Data Stream Processing - WebMD

Details

Storm is a distributed and high-performance real-time computation system used by Twitter, Yahoo, Spotify, WebMD. Storm is a top level Apache Project which brings brand, governance and large community of the Apache Software Foundation. Storm scales linearly, fault-tolerant, provides a reliable processing semantics, and is language agnostic (e.g. Java, Ruby, Python, Javascript, Perl)

AGENDA

In this talk, Eugene Dvorkin (@edvorkin (https://twitter.com/edvorkin)) , Architect @WebMD and NYC Storm User Group organizer will present Apache Storm framework:

  1. Why use Apache Storm?

  2. Common use cases

  3. Storm Architecture - Components, concepts, topology

  4. Building simple Storm topology with Java and Groovy

  5. Operation- fault tolerance, guaranteed msg delivery

  6. Running and monitoring Storm in production

  7. Trident - high-level abstraction on top of Storm

  8. QA

Speaker: Eugene Dvorkin

Eugene has more than 20 years of industry experience. He is Architect at WebMD, where he is leading efforts in creating scalable, Big Data solutions.He introduced Storm to WebMD technology stack and developed WebMD’s Storm topology to power Medscape Medpulse mobile application which allow medical professionals to follow important medical trends with Medscape's curated Today on Twitter feed and selection of blogs. He also contribute to many other Storm related projects. Outside of Storm his interests include distributed computing, machine learning, Hadoop ecosystem.

Photo of NJ Generative AI group
NJ Generative AI
See more events
NJ Big Data and Hadoop Meetup
3525 Quakerbridge Rd #1400 IBIS Office Plaza, Suite 1400 · Hamilton Township, NJ