Spark 2: Random Forests at Scale

Hosted By
Grimm

Details
What's better than random forests? Random forests at scale. We'll focus on the data engineering side of applying decision trees to a cluster. We'll walk through setting up a cluster on AWS, show you how to submit Spark jobs to it, and play with your local install of spark (with an emphasis on random forests).
Workshop led by Julio Barros.
Bring your laptop. We'll bring the pizza.

Portland Data User Group
See more events
New Relic
111 SW 5th Avenue, Suite 2900 · Portland, OR
Spark 2: Random Forests at Scale