addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramlinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

Bay Area Hadoop User Group (HUG) March Meetup

Hello Hadoopers RSVPs is open for the March Bay Area Hadoop user group at Yahoo!'s Sunnyvale campus. Please note that the location has changed -
Building C, Second Floor, Classroom 5 It's in the same campus, just cross the street and walk pass building D to Building C


  • 6:00 - 6:20 - Socializing and Beers

  • 6:20 - 6:50 - Preview to the Hadoop Security Release Owen O'Malley, Yahoo!

  • 6:50 - 7:20 - MapReduce Online Tyson Condie University of California, Berkeley

  • 7:20 - 7:50 - High level distributed programming with Clojure, Cascading, and Hadoop Bradford Cross, Flightcaster

  • QnA and Open Discussion

Session details are available below. Looking forward to seeing you there! Dekel
MapReduce Online MapReduce is a popular framework for data-intensive distributed computing of batch jobs. To simplify fault tolerance, the output of each MapReduce task and job is materialized to disk before it is consumed. In this talk, I will describe a modified MapReduce architecture that allows data to be pipelined between operators. This extends the MapReduce programming model beyond batch processing, and can reduce completion times and improve system utilization for batch jobs as well. The Hadoop Online Prototype (HOP) is our modified version of the Hadoop MapReduce framework with pipelining support. It enables online aggregation, which allows users to see "early returns" from a job as it is being computed. HOP also supports continuous queries, which enable MapReduce programs to be written for applications such as event monitoring and stream processing. HOP retains the fault tolerance properties of Hadoop, and can run unmodified user-defined MapReduce programs in both pipelined and traditional blocking modes. Bio: Tyson Condie is a Ph.D. student at the University of California, Berkeley, whose research focuses on data management and distributed systems. He has been advised by Prof. Joseph M. Hellerstein since entering the Berkeley Ph.D. program in 2004. His thesis at Berkeley focuses on designing and developing distributed system software in a high-level declarative language. Prior to Berkeley graduate school he was at Stanford University where he earned a Masters degree in Computer Science under Prof. Hector Garcia-Molina. His industry experience includes research internship positions at Intel and Yahoo! as well as full-time development positions at Sybase and Oracle.
High level distributed programing with Clojure, Cascading, and Hadoop Presenter: Bradford Cross Flightcaster built a scalable machine learning system in Clojure wrapping Cascading and Hadoop. The infrastructure that wraps Cascading/Hadoop and its configuration/deployment to EC2 clusters is all written in Clojure. Come and see how much simpler and more fun your life can be.

Join or login to comment.

  • A former member
    A former member

    Smart Speakers. Interesting topics.

    March 26, 2010

  • Kumar S.

    Presentation and demo was good. Food and drinks was also good.

    March 25, 2010

  • Arul

    Todays meetup was really informative and very good presentation on HOP.. keep it coming!!

    March 25, 2010

  • KC L.

    The meetings are larger, food is getting better,lots of good beer,sodas,smart dedicated hardworking people at the meetings and the presentations are truly informative and addictive.

    March 24, 2010

  • KC L.

    Awesome meeting and presentations.

    March 24, 2010

  • A former member
    A former member

    Very detailed and to the point.

    March 24, 2010

  • Dennis S.

    Very informative, useful practice, and nice setting.

    March 24, 2010

  • A former member
    A former member

    The security and performance presentations were excellent. I enjoyed the climate and the chance to meet and talk with other Hadoop enthusiasts. Great job organizing and putting these events together. I sincerely appreciate it. Dave J.

    March 24, 2010

  • Sambit M.

    Well organized and interesting enough to keep the audience engaged after work!!

    March 24, 2010

  • A former member
    A former member

    It was good to see security enhancemnents. Streaming and flight delay predictor were very cool and interesting

    March 24, 2010

  • A former member
    A former member

    It really is a Hadoop USER group, not a beginner group.

    March 24, 2010

  • KC L.

    Over 264 plus attending, wow Hadoop is getting hot each month !

    March 23, 2010

Our Sponsors

  • Yahoo

    Free admission, Space, Pizza and Beer

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy