Create powerful parallel processing solutions with Databricks and AWS Kinesis


Details
Session Details:
In this talk, the Amazon Kinesis team will demonstrate how to set up a Kinesis stream from scratch, followed by Databricks provisioning a live Spark analytics platform on top of the streaming data. The combined team will discuss best practices around live data ingestion and a variety of industry-specific solutions where a combination of Kinesis and Spark can create powerful parallel processing solutions.
After this, we will have a real world customer case study putting these components together in the session
Powering Work Collaboration Analytics at Smartsheet with Kinesis, Spark, Lambda, and Redshift
- Overview of our Use Cases and Supporting Architecture
- Demo
- Key Learnings
----
Fill this Seattle Spark Meetup survey (https://docs.google.com/a/databricks.com/forms/d/15bnMRhdEbzkiunJyv5e9Jk3-A7aZkVWChc7EsHClrdE/viewform) and have a chance to win a copy of "Learning Spark"
----
Background:
Amazon Kinesis is a fully-managed service designed for real-time data ingestion and processing. The service can scale elastically, capturing terabytes of data each hour that can be analyzed in real-time. This solution pairs well with cloud-based big data technologies like Databricks, which provides an integrated workspace for creating and managing Apache Spark analytics clusters. Databricks' vision is to dramatically simplify big data processing. It was founded by the team that created and continues to drive Apache Spark, a powerful open source data processing engine built for sophisticated analytics, ease of use, and speed. Databricks offers a cloud platform that makes it easy to turn data into value, from ingest to production, without the hassle of managing complex infrastructure, systems and tools. Logistics:
Doors open at 5:30 PM for Happy Hour drinks and food. Tech talk starts at 6:00 PM.
Speakers:
• Aditya Krishnan, Principal Product Manager, Amazon Kinesis
• Denny Lee, Technology Evangelist, Databricks
• Francis Lau, Senior Director of Product Intelligence, Smartsheet
Note, this is a joint AWS Seattle (https://www.meetup.com/AWS-Seattle-OfficialEvents/events/225017732/?a=ea1_grp&rv=ea1) and Seattle Spark Meetup event

Create powerful parallel processing solutions with Databricks and AWS Kinesis