
What we’re about
This group is for anyone interested in Ray, a general-purpose framework for parallel and distributed Python for scaling machine learning (ML) and AI workloads and applications!
Out of the box Ray offers native-libraries as Ray AI Libraries for scaling and writing an end-to-end machine learning applications, supporting common ML libraries and frameworks.
If you want to scale your ML workloads, build an ML infrastructure, or deploy an ensemble of ML/AI models, you'll want to join this group to learn from the original creators of Ray and Ray community of global users, across many industries.
Share your experience using Ray or learn more about how it works!
If you want to give a talk about your journey with Ray, let us know. Ping me at jules@anyscale.com
Code of conduct
Our meetup is a harassment-free place for everyone, regardless of gender, gender identity and expression, age, sexual orientation, disability, physical appearance, body size, race, ethnicity, or religion (or lack thereof).
We do not tolerate harassment of meetup participants in any form. Sexual language and imagery are not appropriate for any meetup venue (both online or in-person), including talks, workshops, parties, happy hour, etc.
Harassment includes offensive verbal comments related to gender, gender identity and expression, age, sexual orientation, disability, physical appearance, body size, race, ethnicity, religion, technology choices, sexual images in public spaces, deliberate intimidation, stalking, following, harassing photography or recording, sustained disruption of online or in-person talks or other events, inappropriate physical contact, and unwelcome sexual attention.
Ray meetup participants asked to stop any harassing behavior are expected to comply immediately. Hence, any member violating these harassment rules may be expelled from the meetup at the discretion of the Ray meetup organizers.
(Partially based on meetup and conference conduct.)
The Ray Team
Upcoming events (1)
See all- Bay Area Ray Community MeetupNeeds location
Please join us for an evening of community technical talks from the users of Data and Ray community. We want to thank PingCAP for being our gracious host and facilitating the meetup!
👉 PLEASE RSVP/REGISTER FOR THIS MEETUP AT THIS LINK 👉 https://lu.ma/eb8tpu2f
Agenda
(The times are not strict; they may vary slightly.)- 5:30-6:00 pm: Networking, Snacks & Drinks
- 6:00 pm: Talk 1 (30-35 mins) : - How to build serverless database cloud service
- Q & A (10 mins)
- 6:45 pm: Talk 2 (30-35 mins) : Multi-Region/Cloud Ray Pipeline with Distributed Caching
- Q & A (10 mins)
- 7:20 pm: Talk 3 (30-35 mins): - Introduction to Ray for Distributed and ML/AI Applications in Python
Talk 1: How to build a serverless database cloud service
Abstract:Relational databases have long been the core component of application systems, and their reliability and performance are critical to the stability and availability of applications. Distributed SQL, as the evolution direction of the next-generation database, offers built-in features such as horizontal scaling and high availability.
In this talk, Li Shen will introduce the architecture and key technologies of the open-source distributed SQL database - TiDB, and how we utilize the capabilities provided by the Public Cloud to build a cloud-native Serverless database service.
Bio:
Li Shen is SVP and founding engineer of PingCAP, the company behind TiDB. He is a maintainer of several popular open-source projects including TiDB and TiKV, a distributed transactional key-value store and CNCF graduated project. Li has extensive experience in data infrastructure, software architecture design, and cloud computing.
Talk 2: Multi-Region/Cloud Ray Pipeline with Distributed Caching
Abstract: In some cases, the machine learning pipeline stages may be distributed across regions or clouds. Data preprocessing, model training, and inferencing are in different regions/clouds to leverage special resource types or services that exist in a particular cloud, and to reduce latency by placing inference near user-facing applications. Additionally, as GPUs remain scarce resources, it is getting more common to set up remote training clusters from where data resides. This multi-region/cloud scenario introduces challenges of losing data locality, resulting in latency and expensive data egress costs.
In this talk, Beinan Wang, Senior Staff Software Engineer from Alluxio, will discuss how Alluxio’s open-source distributed caching system integrates with Ray in the multi-region/cloud scenario:- The data locality challenges in the multi-region/cloud ML pipeline
- The stack of Ray+PyTorch+Alluxio to overcome these challenges, optimize model training performance, save on costs, and improve reliability
- The architecture and integration of Ray+PyTorch+Alluxio using POSIX or RESTful APIs
- ResNet and BERT benchmark results showing performance gains and cost savings analysis
- Real-world examples of how Zhihu, a top Q&A platform, leveraged Alluxio’s distributed caching and data management with Ray’s scalable distributed computing to optimize their multi-cloud model training performance
Bio: Beinan Wang
Dr. Beinan Wang is a Senior Staff Software Engineer at Alluxio and a TSC of PrestoDB. Prior to Alluxio, he was the Tech Lead of the Presto team at Twitter and he built large-scale distributed SQL systems for Twitter’s data platform. He has twelve-year of experience working on performance optimization, distributed caching, and volume data processing. He received his Ph.D. in computer engineering from Syracuse University on the symbolic model checking and runtime verification of distributed systems.
Talk 3: Introduction to Ray for ML/AI Applications in Python
Abstract: An introduction to Ray (https://www.ray.io/), the system for scaling your Python and machine learning applications from a laptop to a cluster. We'll start with a hands-on exploration of the core Ray API for distributed workloads, covering basic distributed Ray Core API patterns for scaling ML workloads:- Remote Python functions as tasks
- Remote objects as futures
- Remote Python classes as stateful actors
- Multi-model training with Ray Core APIs patterns
👉 PLEASE RSVP/REGISTER FOR THIS MEETUP AT THIS LINK 👉 https://lu.ma/eb8tpu2f