Skip to content

πŸ’šπŸ’™πŸ’­πŸš€ Building Data Orchestration for Big Data Analytics in the Cloud

Photo of Eloisa Elias
Hosted By
Eloisa E. and 2 others
πŸ’šπŸ’™πŸ’­πŸš€ Building Data Orchestration for Big Data Analytics in the Cloud

Details

# πŸ’šRSVP: Co-host: Seattle Spark AI

🍬 🍭🍿 Happy 2023 and we're going to kick start it with our first meetup back in Seattle downtown at the Common Room offices in Pioneer Square! Come for great technical content, discussions, and food!

🌷 🌸 🌼 PyData Seattle meetup is an accessible, community-driven meetup, with novice to advanced level presentations in Data Science/ML/AI/DL

πŸ’ž πŸ’Ÿ Raffle! πŸŽ‰**:**

Agenda

  • 6pm: Doors Open, Eat, & networking πŸ” πŸ•
  • 6:30pm-7:10pm: Building Data Orchestration for Big Data Analytics in the Cloud by Jasmine Wang and Shouwei Chen from Alluxio
  • 7:15pm-7:55pm: Koushik Krishnan - Talk: Notebooks as Functions
  • 8:15pm Close up

Session 1: Building Data Orchestration for Big Data Analytics in the Cloud
Abstract:
Originally developed from UC Berkeley AMPLab as research project "Tachyon", Alluxio (www.alluxio.io) implements the world’s first open-source data orchestration system in the cloud. Alluxio creates a unified access layer for data-driven applications in bigdata and ML, enabling Spark, Presto or TensorFlow and etc to transparently access different external storage systems while actively leveraging in-memory cache to accelerate data access.

In this talk, the speaker will present
- New trends and challenges in the data ecosystem in cloud era
- Effective Data engineering in the cloud world with data orchestration
- Production use cases of using popular stacks like Presto/Alluxio/S3

πŸ’– Speakers

🌷🌸 Jasmine Wang is the Head of Community and DevRel at Alluxio. She is a former national debate champion who turned into a traveling yoga teacher with a strong passion in building teams and being the bridge at early startups in Silicon Valley. Previously, she worked as the Head of Global Talent Acquisition and Operations. Currently she is building the Alluxio open source community, responsible for community, developer relations, developer experience, and cross-community collaborations at Alluxio.

🌸 🌼 Dr. Shouwei Chen is a core maintainer and product manager of open-source Alluxio. Before joining Alluxio, Shouwei received a Ph.D. degree from Rutgers University. Shouwei’s research focuses on the codesign of the memory-centric computing frameworks with in-memory distributed file systems in large-scale environments.

πŸš€πŸ€Ÿ Koushik Krishnan is a Site Reliability Engineer at Yugabyte. Talk: Notebooks as Functions
Jupyter notebooks are a wonderful environment to write code for both beginners and experienced individuals. The hard part comes when you want to take your notebook and productionize it. That's where Jupyrest comes to the rescue. Jupyrest is a tool that can turn Jupyter notebooks into HTTP functions. It's a serverless platform for Jupyter notebooks. I created Jupyrest at Microsoft and open sourced it earlier this year. In this talk I'll demonstrate how to use Jupyrest to productionize your Jupyter notebooks.

PyData Seattle is looking for speakers. Many of our members are doing amazing data science with Python tools. We want to hear what you are up to! If you have a presentation of between 10 minutes and 1 hour that you would like to share with our group, please submit a short proposal.

You can propose a talk, workshop or lightning talk for our monthly meetups and TalkNights hosted in Seattle and Bellevue.

Fill in the form and let everyone know about the cool work you are doing: Here πŸ’Ÿ

πŸ’™ πŸ’š Sponsor PyData Seattle
Host an event or provide some delicious food and snacks for our attendees 🍬 🍭🍿
Email us at [pydataseattle@gmail.com](https://forms.gle/FYrxFdCQcM3SrQ9V9) or fill out the form here

🌸 Thank you for your support to @NumFOCUS, your participation help us to bring awareness to NumFOCUS a 501(c)(3) nonprofit that supports and promotes world-class, innovative, open source scientific computing projects for Data Science, including: Pandas, Numpy, Sympy, IPython, Jupyter, Matplotlib, Julia and many other cool open source data science projects.

πŸ’› πŸ’œ Become a NumFOCUS Member!
Help sustain the open source data stack by becoming a NumFOCUS member

NumFOCUS envisions an inclusive scientific and research community that utilizes actively supported open source software to make impactful discoveries for a better world.

Photo of PyData Seattle group
PyData Seattle
See more events