Skip to content

Details

The purpose of these classes is to train Java Programmers how to contribute to first Apache Bigtop (incubating) and then other Hadoop ecosystem components. BigTop is a software framework Cloudera open sourced which is used to build, deploy and validate Hadoop distributions (Bigdata stack currently consisting of Hadoop, Hive, Flume, Sqoop, HBase, Mahout into RPM and DEB packages). This is a good starter project if you are interested in getting hands on programming experience in Hadoop without having to become a Map Reduce or Distributed Computing expert first. So far we have shown how to do an install, Apache Jira ticket workflow, Jenkins build systems for Hadoop/Cloudera, system/integration test creation and execution against a pseudo-distributed cluster.

Lab 1: Installing BigTop(Complete).

Lab 2: Building Bigtop on VirtualBox or Linux Instance(Complete).

Lab 3: Create a Hadoop integration test based on a simle Mapreduce job and execute it via Bigtop test execution framework.

See what progress you can make after the installation of bigtop. Follow the directions on the README and debug.

Biocurious membership required for attendees on the second visit. First visit is free per Biocurious space policy. Per Biocurious website policy for using the space. This is not a charge collected by this meetup group or any individual, contributor or particpant in this group either in full or any fraction thereof. Please join on the Biocurious website.

Member Links:

http://apachebigtop.pbworks.com/w/page/4843... (http://apachebigtop.pbworks.com/w/page/48434924/FrontPage%3Cbr%20/%3E)

https://docs.google.com/document/d/1hl8GubUoWZvA-BdP_8veUlPglKp45zb-q0hlCZup4Ig/edit

Members are also interested in