Workshops plan :

• Small example of real data usage

• Usage of DoubleRDDFunctions

• Usage of PairRDDFunctions

• Understand joins in depth and how to use broadcast mechanism to reduce amount of data shuffled.

• Understand Function/Lambda serialization mechanism between driver and workers

• How to configure partitions

• More advanced usage of accumulators

Language : Polish