Skip to content

Details

Dear Data Science Aspirants,

Python Programmers should upskill to PySpark & Big Data Analytics. Getting started with PySpark hardly takes 5 hours. To give you the much need start we are organizing a 5 Days Workshop starting from Mon, 25th Jan at 9:00 PM.

If you know basic python and have to get better opportunities, this is the course you have to do. 2016-20 was the wave of data analytics. From 2019 data boom is happening and everywhere huge data is generating. Managing that data is a difficult task without a big data tool. PySpark is a way to get into big data analytics for python programmers.

This workshop is a stepping stone to big data analytics.
What you will get from the workshop?

  1. Understand the PySpark framework
  2. Main differences of working in PySpark compared to Python
  3. Will have the confidence to do basic data manipulation steps in PySpark

Agenda:

Day 1: – Getting Started with PySpark
Install & Configure Spark, Run our first PySpark code!

Day 2: – Understand RDD in PySpark
RDD – Resilient Distributed Dataset,

Day 3: – RDD vs PySpark Dataframe vs Pandas Dataframe
PySpark SQL, Create PySpark Dataframe, Define Schema

Day 4: – Lazy Evaluation, Transformation and Action
Understanding how PySpark is different from Python in Data Processing

Day 5: – Persisting Data in PySpark
Understanding the need of data persistence in PySpark and different Storage Levels

Prerequisites:

  1. This is a beginner level workshop. Prior knowledge of python is good.

Note: For attending the workshop you have to register on the given link.

----- ----- ----- ----- ----- ----- ----- ----- ----- ----- -----
Register Here for Workshop: https://www.k2analytics.co.in/5-days-workshop-on-pyspark-big-data-analytics/
----- ----- ----- ----- ----- ----- ----- ----- ----- ----- -----

You may also like