This is a new meetup group for data science and data engineering practitioners from companies in the SF bay area. We’ve seen that across many companies, data teams are solving very similar problems with similar tools. In this meetup we aim to share best practices that our teams have learned from their experiences working with large-scale distributed systems in production environments.
Talks will focus on a holistic systems view of theory & practice for production data use cases, including:
• Data engineering, and building ETL pipelines across batch and streaming • Experiments / A/B testing theory and practice, including infrastructure, randomization/assignment, and analysis • Machine learning, bridging from theory to production systems, including deep learning, offline training, online inference, production model serving, and metrics/monitoring