Skip to content

Learn about Apache Spark and Big Data on Amazon Web Services (AWS)

public group
Learn about Apache Spark and Big Data on Amazon Web Services (AWS)

Details

Join us for a webinar on learning more about Apache Spark and analytics using Zeppelin on Amazon Web Services. Some of the topics we will cover are described below.

PLEASE NOTE: This is a webinar and you must register on the GotoWebinar Link

https://attendee.gotowebinar.com/register/2705812734257737729

Topics:

– Apache Spark and Big Data Ecosystem Overview
– Role of Spark with respect to Hadoop, AWS, EMR, and popular big data technologies
– Analytics and ETL with SparkSQL and DataFrame/Dataset APIs
– Basics of Spark Execution and Memory
– Visualizing Data with Zeppelin (and possibly Tableau, time permitting)
– Intro to Machine Learning with SparkML
– Intro to Spark Streaming
– Spark on YARN: Clustering and Operations within EMR
– Business Cases and Architecture Patterns with Spark

Technologies:

Some of the technologies we will talk about and demonstrate include:
– Amazon EMR clusters supporting Apache Spark 2.0, HDFS and/or EMRFS, Apache Zeppelin with support for at least Scala (Spark), PySpark, (Spark)SQL, sh, hdfs interpreters

Presenter:
Adam Breindel is a stackArmor Big Data Consultant focused on consulting and teaching Apache Spark. Adam’s experience includes work with banks on neural-net fraud detection, streaming analytics, cluster management code, and web apps, as well as development at a variety of startup and established companies in the travel, productivity, and entertainment industries. He is excited by the way that Spark and other modern big-data tech remove so many old obstacles to system design and make it possible to explore new categories of interesting, fun, hard problems.

Please register on the GotoWebinar Link

https://attendee.gotowebinar.com/register/2705812734257737729

Photo of Data DC group
Data DC
See more events
Online Webinar GotoWebinar
Online · Online, VA