Self-Hosted fun with the Kubernetes Operator for Apache Spark
Details
Managed Services such as Databricks are a great way for organisations to
get started with Spark. However, there are some situations in which
these are not appropriate, or even not available.
Self-hosting Spark can be a challenging undertaking, but the Kubernetes
Operator for Apache Spark can help to simplify these challenges.
This talk will describe how we approached deploying the Spark Operator
on a real-life project, and discuss some of the trade-offs and benefits
we discovered.
Gavin Campbell Bio:
Gavin Campbell is a DevOps consultant with a background in Data and
Analytics. He occasionally writes things down at https://gavincampbell.dev.
Microsoft Azure
Apache Spark
Microsoft
