Past Meetup

Apache Spark - RDD advanced concepts

This Meetup is past

20 people went

Location image of event venue


Workshops plan :

• Small example of real data usage

• Usage of DoubleRDDFunctions

• Usage of PairRDDFunctions

• Understand joins in depth and how to use broadcast mechanism to reduce amount of data shuffled.

• Understand Function/Lambda serialization mechanism between driver and workers

• How to configure partitions

• More advanced usage of accumulators

---------------->>>> Exercises material ( <<<<------------------

Language : Polish