5 DAYS WORKSHOP ON PySPARK – BIG DATA ANALYTICS
Details
Dear Data Science Aspirants,
Python Programmers should upskill to PySpark & Big Data Analytics. Getting started with PySpark hardly takes 5 hours. To give you the much need start we are organizing a 5 Days Workshop starting from Mon, 25th Jan at 9:00 PM.
If you know basic python and have to get better opportunities, this is the course you have to do. 2016-20 was the wave of data analytics. From 2019 data boom is happening and everywhere huge data is generating. Managing that data is a difficult task without a big data tool. PySpark is a way to get into big data analytics for python programmers.
This workshop is a stepping stone to big data analytics.
What you will get from the workshop?
- Understand the PySpark framework
- Main differences of working in PySpark compared to Python
- Will have the confidence to do basic data manipulation steps in PySpark
Agenda:
Day 1: – Getting Started with PySpark
Install & Configure Spark, Run our first PySpark code!
Day 2: – Understand RDD in PySpark
RDD – Resilient Distributed Dataset,
Day 3: – RDD vs PySpark Dataframe vs Pandas Dataframe
PySpark SQL, Create PySpark Dataframe, Define Schema
Day 4: – Lazy Evaluation, Transformation and Action
Understanding how PySpark is different from Python in Data Processing
Day 5: – Persisting Data in PySpark
Understanding the need of data persistence in PySpark and different Storage Levels
Prerequisites:
- This is a beginner level workshop. Prior knowledge of python is good.
Note: For attending the workshop you have to register on the given link.
----- ----- ----- ----- ----- ----- ----- ----- ----- ----- -----
Register Here for Workshop: https://www.k2analytics.co.in/5-days-workshop-on-pyspark-big-data-analytics/
----- ----- ----- ----- ----- ----- ----- ----- ----- ----- -----
