Data Engineering
Meet other local people interested in Data Engineering: share experiences, inspire and encourage each other! Join a Data Engineering group.
0
members
0
groups
Frequently Asked Questions
Yes! Check out data engineering events happening today here. These are in-person gatherings where you can meet fellow enthusiasts and participate in activities right now.
Discover all the data engineering events taking place this week here. Plan ahead and join exciting meetups throughout the week.
Absolutely! Find data engineering events near your location here. Connect with your local community and discover events within your area.
Data Engineering Events Near You
Connect with your local Data Engineering community
Building Scalable Customer Identity Resolution Pipelines on AWS Using AI
Customer identity resolution becomes increasingly complex as organizations scale across multiple systems, regions, and data formats. Traditional rule-based approaches often fail to keep up with data variability, require constant manual tuning, and struggle with real-time processing needs.
This session presents a practical approach to building a scalable identity resolution pipeline using AWS services and modern AI techniques. The architecture combines data ingestion through Amazon S3 and AWS Glue, transformation pipelines using Spark on EMR, and machine learning models deployed via SageMaker for entity matching and standardization. Graph-based relationship modeling is implemented using Amazon Neptune to improve resolution accuracy by incorporating household and shared attribute context.
We will walk through how machine learning models can be used for name and address normalization, how intelligent blocking strategies improve matching efficiency, and how feedback loops can be introduced to continuously improve accuracy. The session also highlights how serverless components such as AWS Lambda can be used for orchestration and real-time processing.
**SPEAKER BIO**
Mosaic Syed is a Senior Data Engineering and Cloud Solutions Architect with over 20 years of experience designing and delivering scalable, secure, and high-performance data solutions across global enterprise environments.
https://www.linkedin.com/in/mosaic-basha-syed-92300856
**CALL FOR SPEAKERS**
Learn more: [https://www.awscolumbus.com/get-involved/](https://www.awscolumbus.com/get-involved/)
**THANK YOU** *VEEAM* for hosting our meetup! To learn more about *Veeam*, please visit their website: [https://www.veeam.com/](https://www.veeam.com/)
**DIRECTIONS**
8800 Lyra Dr #450 · Columbus, OH
go to 4th floor.
**Want to sponsor the pizza and/or bar tab?**
Please contact me if you would like to sponsor this meetup's pizza and/or bar tab: angelo@mandato.com
Quarterly Community Gathering
Join the Columbus AI community for our quarterly gathering — a casual, community-focused evening where everyone has a chance to share, learn, and connect. These open mic–style events give anyone in the community up to **5 minutes** to present a project, share a tool, pose a question, or offer a perspective on the evolving AI space.
No slides required — just a welcoming space to exchange ideas and keep the local AI conversation moving.
If you’d like to take the stage, message \*\*Chris (the organizer)\*\*with a **title and short description** of what you’d like to share.
Whether you’re deep in the field or just getting curious, come connect with others building and exploring AI in Columbus.
Sponsored by [Transform Labs](https://www.linkedin.com/company/transformlabs/)
Sign up also accessible via [Transform Labs Luma](https://luma.com/transformlabshq)
Best Practices for Building a Reliable Lakehouse
**Abstract:** This is a practical playbook for building a production-grade data lakehouse. It walks through foundational principles — naming conventions, least-privilege access, automated CI/CD testing — before diving into medallion architecture. Furthermore, metadata-driven design patterns show how configuration tables and dynamic notebook orchestration eliminates hard-coded pipelines. The deck covers star schema modeling, guidance on choosing between Spark, Pandas, and SQL, and data quality enforcement using DQX with YAML data contracts. Finally, we dive into security best practices and performance optimizations.
**Host:** Justin Shea, Mehdi Jeddi, Erik Pak, and Sou-Cheng Choi
**Talk Format:** This is a hybrid event. To attend online, join us on Zoom here at 6pm:
https://iit-edu.zoom.us/j/89379230295?pwd=NdETyE5sdYuSrvsrBZXSBFkUESBVkg.1
Meeting ID: 893 7923 0295
Passcode: 5t5WYn
**Sponsor:** Adyen, UIC College of Business, and PyData Chicago co-host this event. UIC will provide the meeting site. Adyen will sponsor pizza and soft drinks for the onsite participants.
**Address:** University of Illinois - Chicago, Douglass Hall, Room 220, 705 S Morgan St, Chicago, IL 60607
**Logistics:** “UIC Douglass Hall” is recognized on Google Maps, which can guide you through campus. Once you arrive, proceed to the second floor, room number 220
TBD
**Important time note:** Please plan on arriving between 5:30 and 6:00 as the elevators lock after 6 and you'll need to message us and we'll need to come get you.
The building address is 4450 Bridge Park
The entrance is 6620 Mooney St, Suite 400
You will need to scan your ID at the door to get a visitor badge.
**Abstract**
TBD
**YouTube Link**
TBD
Level One Tuesdays (Bachata & Salsa Dance Lessons)
Click Here For More Videos:
https://www.facebook.com/share/r/17bwkttXhV/
***********************************
Salsamante Dance Academy will be at Swerve Every Tuesday Night to share the Rhythm & Energy of Bachata & Salsa.
These are Beginner Level lessons to get you comfortable and understand the two dances. Spread the Good News to all.
Swerve Dance & Fitness Complex
640 Lakeview Plaza Blvd A, Worthington, OH 43085
Bachata 7pm-8pm
Salsa 8pm-9pm
$15 - One Lesson
$20 - Both Lessons
COhPy Monthly Meeting
**Improving Office in Franklinton**
Physical location:
Improving Office
330 Rush Alley Suite #150
Columbus, OH 43215
Schedule:
6:00 p.m.: Socialize, eat, and drink. Improving will be providing pizza and beverages.
6:30 to 8:00 pm. Main meeting and presentation(s).
Topic: This month John Lairson will share a notebook describing the Alpaca (Paper) Trading API and discuss different algorithms for evaluating stock trades.
We meet on the last Monday of each Month. Presentations are given by members and friends of this group. If you would like to do a presentation (small or large) on a python topic, please contact Central OH Python at centralohpython@gmail.com
Building Momentum: From Ambiguity to Execution
**Building a great product is one thing—building momentum behind it is another.**
Join **Senior Product Manager Adam Solaiman** and **User Experience Manager Tyson Smith** for a behind-the-scenes look at what it takes to turn complex ideas into scalable products inside large organizations.
In this session, they’ll share how teams move from ambiguity to execution—navigating organizational complexity, aligning stakeholders, and continuously evolving products after launch.
You’ll walk away with insights on how to:
* Build and sustain momentum across teams
* Adapt to changing priorities without losing direction
* Scale products thoughtfully in complex environments
Whether you're driving a new initiative or growing an existing product, this conversation will give you practical strategies to keep things moving forward.
Come connect, learn, and swap stories with fellow product professionals.
\-\-\-
Food and drinks will be provided by Switchbox, our generous host.
Free parking will be available at the front and back sides of the Switchbox Office.






