Skip to content

Apache Spark Introduction and Resilient Distributed Dataset basics and deep dive

Photo of IBM Big Data
Hosted By
IBM Big D. and 2 others
Apache Spark Introduction and Resilient Distributed Dataset basics and deep dive

Details

We will give a detailed introduction to Apache Spark and why and how Spark can change the analytics world.

Apache Spark's memory abstraction is RDD (Resilient Distributed DataSet). One of the key reason why Apache Spark is so different is because of the introduction of RDD.

You cannot do anything in Apache Spark without knowing about RDDs.

We will give a high level introduction to RDD and in the second half we will have a deep dive into RDDs.

Speakers -

Paranth (Architect (STSM), Analytics Platform at IBM Labs)

LinkedIn - https://in.linkedin.com/pub/paranth-thiruvengadam/9/771/256

Satya (Architect at IBM Analytics)

LinkedIn - https://in.linkedin.com/in/patelsatya

Looking forward to have a great interactive session.

Photo of IBM AI Developer Accelerator - Bengaluru Chapter group
IBM AI Developer Accelerator - Bengaluru Chapter
See more events
IBM A Block,
Ground Floor cafeteria, EGL,Indiranagar-Kormanagala, Intermediate Ring Road, EGL,Indiranagar-Kormanagala, Intermediate Ring Road · Bangalore