Scalable Data Science in R and Spark Streaming


Details
Please also register at [Azure Meetup Seattle Registration (https://www.microsoftevents.com/profile/form/index.cfm?PKformID=0x1021688d95f)]
---
We have a fun two part session on Apache Spark 2.0 today - the first session focusing on Scalable Data Science in R and the second session talking about the current state of Spark Streaming.
We will be at the new Seattle Microsoft Technology Center (MTC) in Lincoln Square. More information can be found at: https://www.microsoft.com/en-us/mtc/locations/seattle.aspx.
Agenda
6:00pm-6:30pm: Networking and some pizza
6:30pm-7:15pm: Scalable Data Science in R and Apache Spark 2.0
7:15pm-8:00pm: Introducing Structured Streaming
8:00pm-8:30pm: Finale
Scalable Data Science in R and Apache Spark 2.0
R is a very popular platform for Data Science. Apache Spark is a highly scalable data platform. How could we have the best of both worlds? In this talk we will walkthrough many examples how several new features in Apache Spark 2.0.0 will enable this. We will also look at exciting changes coming next in Apache Spark 2.0.1 and 2.1.0.
Speaker: Felix Cheung, Apache Spark Committer, Microsoft
Introducing Spark Structured Streaming
We have a fun session talking on Structured Streaming as part of Apache Spark 2.0. Whether you are a novice or have worked with Spark Streaming for quite some time, this demo heavy session showcasing Spark Streaming scenarios and watching Spark Streaming running live.
Speaker: Jason Pohl, Data Solution Architect, Databricks
---
Please also register at [Azure Meetup Seattle Registration (https://www.microsoftevents.com/profile/form/index.cfm?PKformID=0x1021688d95f)]

Scalable Data Science in R and Spark Streaming