Skip to content

Oct 22: Hands On MapReduce and Spark Programming by Roger Ding

Photo of Gray Herter
Hosted By
Gray H.
Oct 22: Hands On MapReduce and Spark Programming by Roger Ding

Details

Title: Hands On MapReduce and Spark Programming by Roger Ding

When: Oct 22, 6 - 9 PM (Presentation begins at 7 PM)

Getting In: The meetup is in the community room in the basement. You will be directed to it from the guard desk. Please bring an ID. If you have trouble, please call Gray at 703 727-1307.

Food: We'll have pizza and sodas.

Sponsor: Thank you Sotera Defense Solutions for hosting this meetup for us and providing the food and drinks.

Hadoop/Spark Conference: DevIgnition - Elephant Talk (https://www.eventbrite.com/e/dec-5-devignition-2014-elephant-talk-tickets-13451262087) is coming up on Dec 5th. This year, we will focus on Hadoop and Spark talks. Registration is open and tickets are going fast, and our Call for Speakers (https://docs.google.com/forms/d/1IFM3gcFohpM0Vxi7XjAtBfDwRHDt5LXj1Dbbnhsbdn0/viewform) is still open for a couple more weeks.

Meetup Description

: An overview of some simple jobs solved with MapReduce(in Java), and then solved again using Spark(in Scala). The second half of the meetup will be hands on, devoted to walking people through the MapReduce/Spark examples on their own laptops. People can try it in groups.

If time permits, we will also demonstrate solving the same problem by Hive(SQL) and Impala(SQL), giving people an overview of how to solve problems by several different tools(MapReduce, Spark, Hive, Impala) in Hadoop EcoSystem.

Prepare the development environment:
(1) Download Cloudera QuickStart VM for CDH 5.2.x, this will have everything we need for our demos, http://www.cloudera.com/content/cloudera/en/downloads/quickstart_vms/cdh-5-2-x.html
(2) (Optional) Install your preferred IDE, IntelliJ will be used in the demo.
(3) Get a copy of source code from https://github.com/rogerding/ ­, there are 2 project there, one is for MapReduce, the other one is for Spark

If you want to participate in the exercises, please have this setup before the meetup.

Speaker: Roger Ding is a Solutions Consultant at Cloudera, where he loves working with distributed computing technologies. He worked as a software engineer for 15 years before joining Cloudera. In his spare time, he enjoys hiking. Roger is also a long-time active member of our user group!

Photo of DCJUG/Frontrunners group
DCJUG/Frontrunners
See more events
Sotera Defense Solutions
1501 Farm Credit Dr Ste 2300 · Mc Lean, VA