Skip to content

Apache Spark for BI & Data Science

Photo of Troy Wuttke
Hosted By
Troy W.
Apache Spark for BI & Data Science

Details

Apache Spark is a fast and general engine for large scale data processing. With up to 100x faster processing speeds than Hadoop MapReduce (in memory) and 10x faster from disk, it is fast becoming one of the hottest topics in the Big Data/Data Science world.

Please join us on an Apache Spark session to hear about the latest happenings with Apache Spark, how BI and Data Scientists are using it in real life and how it can benefit a lot of workloads that requires such a platform. We will also briefly cover Spark Streaming and show some use cases where people have used it successfully including Web Trends that were able to push 10 million messages/sec using Spark Streaming.

In this session we will be covering the following:

• Apache Spark Overview

• Whats new in Spark 1.6

• Apache Zeppelin

• Examples for BI and Data Scientists.

Our session will be led by Ned Shawa from Hortonworks (http://hortonworks.com/). Ned is a diversified data engineer with over 10 years of experience in software and hardware, working currently at Hortonworks as a Solution Engineer for Australia and New Zealand. Prior to Hortonworks Ned worked at EMC where he was the Isilon & Big Data Specialist for Australia and NewZealand. On top of his day duties, after hours Ned leads 2 major Apache Spark meetups in Melbourne & Sydney.

Thanks to sponsorship from Hortonworks, we will be meeting at 6pm for pizza and drinks, following by the talk from 6.30 - 7.30pm.

http://photos4.meetupstatic.com/photos/event/8/d/b/1/600_446016273.jpeg

Photo of Big Data Adelaide group
Big Data Adelaide
See more events