What we're about

80-90% of time spent on data projects is gathering data and putting it into a format you can analyze. Data Wranglers DC is a professional group that meets monthly to discuss topics including open data, data gathering, data munging, and the creation, storage and maintenance of datasets. We combine presentations with hands-on workshops, always seeking to make our data munging lives easier. No experience necessary - just bring your interest.

Upcoming events (4+)

Data Engineer's Lunch #86: Building Real-Time Applications at Scale

Link visible for attendees

As the demand for real-time data processing continues to grow, so too do the challenges associated with building production-ready applications that can handle large volumes of data and handle it quickly. In this talk, we will explore common problems faced when building real-time applications at scale, with a focus on a specific use case: detecting and responding to cyclist crashes. Using telemetry data collected from a fitness app, we’ll demonstrate how we used a combination of Apache Kafka and Python-based microservices running on Kubernetes to build a pipeline for processing and analyzing this data in real-time. We'll also discuss how we used machine learning techniques to build a model for detecting collisions and how we implemented notifications to alert family members of a crash. Our ultimate goal is to help you navigate the challenges that come with building data-intensive, real-time applications that use ML models. By showcasing a real-world example, we aim to provide practical solutions and insights that you can apply to your own projects.

Key Takeaways:

  • An understanding of the common challenges faced when building real-time applications at scale
  • Strategies for using Apache Kafka and Python-based microservices to process and analyze data in real-time
  • Tips for implementing machine learning models in a real-time application
  • Best practices for responding to and handling critical events in a real-time application

Bring your lunch and join in. Don't have to leave your desk. Can come as early as 11:45 AM ET/10:45 AM CT to network & catchup.

5-10m Wait for people to get in.
10-15m Volunteer presents/ talks about something they are working on/cool stuff
10-15m Q/A Commentary

Data Engineer's Lunch

Link visible for attendees

Bring your lunch and join in. Don't have to leave your desk. Can come as early as 11:45 AM ET/10:45 AM CT to network & catchup.

5-10m Wait for people to get in.
10-15m Volunteer presents/ talks about something they are working on/cool stuff
10-15m Q/A Commentary

Data Engineer's Lunch

Link visible for attendees

Bring your lunch and join in. Don't have to leave your desk. Can come as early as 11:45 AM ET/10:45 AM CT to network & catchup.

5-10m Wait for people to get in.
10-15m Volunteer presents/ talks about something they are working on/cool stuff
10-15m Q/A Commentary

ScyllaDB Summit 2023

Link visible for attendees

ScyllaDB Summit 2023 focuses on how market disruptors are powering the data-intensive applications for this next tech cycle using ScyllaDB — along with the latest hardware advancements, cloud infrastructure, event streaming, and other data ecosystem elements.

Whether you want to discover the latest ScyllaDB advancements, hear how your peers are solving their toughest data-intensive application challenges, or explore the latest trends across the broader data ecosystem, we’ve got you covered.

Discover the latest trends and best practices impacting data-intensive applications. Join us at ScyllaDB Summit – free + virtual – for 30+ technical sessions.

*ALL Attendees must register via the ScyllaDB Summit Link: https://www.scylladb.com/scylladb-summit-2023/

Explore
Explore what’s next for distributed data technologies and strategies from leading experts.

Connect
Connect with the global ScyllaDB community and see what they’re achieving with fast NoSQL.

Learn
Level up your NoSQL expertise with actionable tips that give your team and project an edge.

Past events (153)

Data Engineer's Lunch #85: Designing a Modern Data Stack

This event has passed