Skip to content

Introduction to Spark for Data Engineers, Data Scientist and Developers

Photo of Nancy Berlin
Hosted By
Nancy B.
Introduction to Spark for Data Engineers, Data Scientist and Developers

Details

SEATS ARE LIMITED: YOU MUST RESERVE YOUR SEAT HERE: http://bit.ly/2rYXBYc

IBM is offering a free all day hands-on lab for clients and practitioners on Apache Spark. This is a full day of education on Spark with hands on exercises instructed in person by Spark experts. The POT will provide a detail overview of Apache Spark. The exercises will be performed on Jupyter notebooks with publicly available datasets. Participants will use IBM’s fully managed free Cloud platform available for educational purposes.

Who should go:
"Anyone interested in learning more about Apache Spark."

Prerequisite:
"A working knowledge of Coding (Preferred Python and/or Scala), understand distributed computing, Spark and SQL."

Please sign up for free accounts: Bluemix (www.bluemix.net) and DSX ( http://datascience.ibm.com )

****** You must bring your own laptop *****

What to expect:
"Expect to spend a full day of lecture and hands on exercises attacking real-world data challenges using Apache Spark. In 8 hours you will learn the basic essentials of Apache Spark and why it's important to your organization. This workshop will focus on data wrangling and machine learning."

Full Day Agenda:

8:30 am – 9 am Breakfast, Socialize

9:00 am – 10:00 am Kickoff, Apache Spark Overview

10:00am–11:00am Lab1,HelloSpark-Handonexercise

11:00 am – 12:00 pm Apache Spark SQL Overview

12:00 pm – 1:00 pm Lunch

1:00 pm – 2:00 pm Lab 2, Spark SQL - Hands on exercises

2:00 pm – 3:00 pm Overview of Data Science & Machine Learning w/ Apache Spark

3:00 pm – 4:00 pm Lab 3, Machine Learning w/ Spark – Hands on exercises

4:00 pm – 4:30 pm Wrap up – Feedback from attendees

SEATS ARE LIMITED: YOU MUST RESERVE YOUR SEAT HERE: http://bit.ly/2rYXBYc

Photo of Data, Cloud and AI in Raleigh group
Data, Cloud and AI in Raleigh
See more events