Skip to content

Bright Spark (Machine Learning) + Oracle Big Data Discovery

C
Hosted By
Carlo P. and Andrew R.
Bright Spark (Machine Learning) + Oracle Big Data Discovery

Details

Hello!

You are invited to attend the second Spark User’s Group meetup (sponsored by Intel and Oracle), we will be hosting the event this month with two presentations confirmed:

Session 1: “Bright Spark - Using Apache Spark for Data Science”

Prensenter: Ian Hansel - Data@Ogilvy

Ian is a data scientist working at Data@Ogilvy. He has previously worked in the areas of fraud detection, insurance and web analytics. He is focused on applying analytics to business problems in a way that is simple, easy to understand and generates real value.

Abstract: Most of the discussion about Spark for data science has been around its ability to keep data resident in memory, which can speed up iterative machine learning workloads compared to MapReduce. However there are other advantages such as; the increasing capabilities of the built-in machine learning library MLlib, it's interactive shell for exploratory analysis, and the newly introduced pipelines for defining data workflows. This talk will be a practical based tutorial of using Spark Mllib. This will cover:

  • Setting up a spark cluster on AWS
  • Reading in data and processing it
  • Running machine learning algorithms with MLlib through the Python API
  • Using pipelines to streamline the process

Session 2: Oracle Big Data Discovery - The Visual Face of Hadoop

Presenter: Craig Han - Oracle

Craig is the APAC Solutions Consulting Leader for Business Analytics and Data Discovery at Oracle. His role involves working with customers to maximise the value of data by deriving new and creative ways of using the data, regardless of it being structured or unstructured, originating from internal or external to their organisation.

Abstract: Today's Big Data challenge is not how to store it, but how to make sense of it. Oracle Big Data Discovery (a new Oracle product that leverage Apache Spark), is a fundamentally new approach to making sense of Big Data, empowering organisations to quickly see and understand the potential of raw data in Hadoop, easily transform the data to make it better, and intuitively discover and share new value - all within a single visual product. Oracle Big Data Discovery offers tremendous speed at massive scale, streamlining Big Data analytics to unlock new value for everyone.

PLEASE NOTE: Limited availability.. please RSVP only if you are sure you can attend.

Photo of Sydney Apache Spark User Group group
Sydney Apache Spark User Group
See more events
York Conference and Function Centre
Level 2 99 York St, Sydney NSW 2000 · Sydney