Apache Spark Introduction and Resilient Distributed Dataset basics and deep dive


Details
We will give a detailed introduction to Apache Spark and why and how Spark can change the analytics world.
Apache Spark's memory abstraction is RDD (Resilient Distributed DataSet). One of the key reason why Apache Spark is so different is because of the introduction of RDD.
You cannot do anything in Apache Spark without knowing about RDDs.
We will give a high level introduction to RDD and in the second half we will have a deep dive into RDDs.
Speakers -
Paranth (Architect (STSM), Analytics Platform at IBM Labs)
LinkedIn - https://in.linkedin.com/pub/paranth-thiruvengadam/9/771/256
Satya (Architect at IBM Analytics)
LinkedIn - https://in.linkedin.com/in/patelsatya
Looking forward to have a great interactive session.

Sponsors
Apache Spark Introduction and Resilient Distributed Dataset basics and deep dive