Solving Analytics Problems in the Cloud w/ Spark, Presto, Hive


Details
Bio:
Sam is a software technology and digital marketing professional with experience in a variety of verticals. He has been with Qubole since March of 2016, and has previously worked on several analytics and big data projects for DEKA, Isobar, Viviaki (Digitas), and other companies.
Content:
The presentation will review real customer use cases detailing how using cloud for Big Data analytics addresses a number of challenges inherent with traditional solutions. There will be a demonstration of an array of big data technologies such as Spark, Presto, Hive available via a single platform. We will work with publicly available data sets and attendees can create accounts, in real-time, for additional explorations
There will be detailed review of technical elements to using Big Data on Cloud such as -
- Spark Tuning
- Design and Performance of Cloud infrastructure
- Auto-Scaling and Spot Instances
- BootStrapping and Unified MetaData
- Notebooks

Solving Analytics Problems in the Cloud w/ Spark, Presto, Hive