Skip to content

Kubernetes Day: Running Apache Spark (Google), Apache Pulsar & Heron (Streamlio)

Photo of Arivoli Tirouvingadame
Hosted By
Arivoli T.
Kubernetes Day: Running Apache Spark (Google), Apache Pulsar & Heron (Streamlio)

Details

TECH TALK #1: Running Apache Spark on Kubernetes
Speaker Bio: Yinan Li, Software Engineer on the Kubernetes team at Google
LinkedIn: https://www.linkedin.com/in/yinan-li-91a3b214/

Abstract:
Kubernetes becomes a native scheduler backend in Spark 2.3, allowing Spark applications to run natively on Kubernetes clusters and share the clusters with other types of workloads. This talk gives an overview of Kubernetes and a deep dive of the technical details of the Kubernetes scheduler backend, followed by a demo.

Yinan works on Spark/K8s and is the largest contributor to upstream Apache Spark from Google.

TECH TALK #2: Building Data Pipelines on Kubernetes using Apache Pulsar and Heron
Speaker Bio: Karthik Ramasamy, Co-Founder, Streaml.io
LinkedIn: https://www.linkedin.com/in/kramasamy/

Abstract:
Enterprises are increasingly building applications to take advantage of real-time workloads, and Kubernetes is fast becoming the de-facto scheduler and orchestrating system for distributed systems. In this session, Karthik Ramasamy will show how easy it is to deploy Apache Pulsar for queuing and streaming and Twitter Heron for stream processing using Kubernetes and show how end-to-end data pipelines are built. Pulsar and Heron are at the core of Streamlio's end-to-end real-time platform, which is ideally suited to enterprises building next generation real time applications.

Photo of Data Riders group
Data Riders
See more events
Hacker Dojo
3350 Thomas Road · Santa Clara, CA