Skip to content

Details

What's better than random forests? Random forests at scale. We'll focus on the data engineering side of applying decision trees to a cluster. We'll walk through setting up a cluster on AWS, show you how to submit Spark jobs to it, and play with your local install of spark (with an emphasis on random forests).

Workshop led by Julio Barros.

Bring your laptop. We'll bring the pizza.

Members are also interested in