Introduction to Big Data with Apache Spark, Week 4
Details
edX has an awesome class on Spark:
https://courses.edx.org/courses/BerkeleyX/CS100.1x/1T2015/info
Week 4 lab exercise is to "perform text analysis and entity resolution on Google and Amazon product listings".
If you work on the lab exercise in advance, you'll benefit from the discussion:
