Skip to content

Details

Hello,

This will be the second in a series of study sessions for both EdX MOOCs (Massively Open Online Classes) on Apache Spark. This session is led by Dan Serban (https://www.meetup.com/Bucharest-Big-Data-Meetup/members/37021982/) (thank you, Dan).

Dan's description:

You can find the EdX MOOCs here:

Intro to big data: https://www.edx.org/course/introduction-big-data-apache-spark-uc-berkeleyx-cs100-1x
Scalable Machine Learning: https://www.edx.org/course/scalable-machine-learning-uc-berkeleyx-cs190-1x

Our focus points for this study session will be lab 4 of CS100.1x ("Applying Machine Learning to Movie Recommendations using Apache Spark") as well as lab 1 of CS190.1x (the numpy-related one).

In order for it to be easier for us to help each other, please make sure that any issues you have with the labs can be reproduced in the pySpark shell, and that we can visually inspect samples from all intermediary RDDs in the lineage. This is because the iPython notebook environment is somewhat inadequate for debugging purposes.

Look forward seeing you at this study session,

Valentina

Sponsors

Sponsor logo
eSolutions
Unlock the power of your data with customized digital solutions.

Members are also interested in