Skip to content

Details

Customer identity resolution becomes increasingly complex as organizations scale across multiple systems, regions, and data formats. Traditional rule-based approaches often fail to keep up with data variability, require constant manual tuning, and struggle with real-time processing needs.

This session presents a practical approach to building a scalable identity resolution pipeline using AWS services and modern AI techniques. The architecture combines data ingestion through Amazon S3 and AWS Glue, transformation pipelines using Spark on EMR, and machine learning models deployed via SageMaker for entity matching and standardization. Graph-based relationship modeling is implemented using Amazon Neptune to improve resolution accuracy by incorporating household and shared attribute context.

We will walk through how machine learning models can be used for name and address normalization, how intelligent blocking strategies improve matching efficiency, and how feedback loops can be introduced to continuously improve accuracy. The session also highlights how serverless components such as AWS Lambda can be used for orchestration and real-time processing.

SPEAKER BIO
Mosaic Syed is a Senior Data Engineering and Cloud Solutions Architect with over 20 years of experience designing and delivering scalable, secure, and high-performance data solutions across global enterprise environments.
https://www.linkedin.com/in/mosaic-basha-syed-92300856

CALL FOR SPEAKERS
Learn more: https://www.awscolumbus.com/get-involved/

THANK YOU VEEAM for hosting our meetup! To learn more about Veeam, please visit their website: https://www.veeam.com/

DIRECTIONS
8800 Lyra Dr #450 · Columbus, OH
go to 4th floor.

Want to sponsor the pizza and/or bar tab?
Please contact me if you would like to sponsor this meetup's pizza and/or bar tab: angelo@mandato.com

Related topics

Events in Columbus, OH
Artificial Intelligence
Amazon Web Services
Cloud Computing
Serverless Architecture
Software Development

You may also like