Name: Silent Failures in AI Data Systems: Risk Drift in Pipelines
Start: 2026-05-14T12:00:00-04:00
End: 2026-05-14T13:30:00-04:00

As AI becomes embedded in cloud data platforms, organizations are encountering failure modes that differ fundamentally from traditional deterministic systems. In contrast to binary failures, AI-augmented pipelines often degrade silently, with quality erosion emerging gradually and compounding over time.

This talk explores key failure patterns in production AI data systems. Drawing from machine learning systems research, we examine how data and concept drift can persist undetected as models continue producing outputs despite shifting feature distributions. The risk intensifies in chained pipelines, where probabilistic errors compound across stages—for example, when upstream model inaccuracies propagate and amplify downstream.

We also analyze non-deterministic inference behavior, which complicates reproducibility, auditability, and root cause analysis in cloud environments. The session highlights risks of AI-generated data contamination, where synthetic outputs are mistakenly treated as ground truth, accelerating feedback loops and long-term model degradation.

At the infrastructure level, we discuss challenges such as non-linear inference cost scaling, observability gaps that mask semantic failures, and automation complacency that reduces human oversight.

The talk concludes with practical design principles for cloud data systems, including metadata-first architectures, explicit trust boundaries, and human-in-the-loop checkpoints to build resilient, auditable, and trustworthy AI-driven pipelines.

Jean Joseph

datadrivencommunity

Ronen Ariely

Deepthi Goguri

Cloud Data Driven

Azure Data Tech Groups

Technology

Database Professionals

Cloud Computing

Database Development

Data Visualization

Data Mining

Machine Learning

Computer Programming

Big Data

Data Analytics

Cloud Security

Programming Languages

Open Source

Business Intelligence

Data Science

### **[Akanksha Mishra](https://sessionize.com/app/organizer/speaker/21839/50621316-c9d1-41a8-80f7-10e0c8a15b76)**

Sr. Data Engineer
Akanksha Mishra is a Senior Data Engineer with over nine years of progressive experience designing, modernizing, and delivering enterprise-scale data platforms. She specializes in AWS-native cloud architecture, real-time event-driven data pipelines, and large-scale data processing systems that support mission-critical business operations.

Currently, Akanksha serves as a Senior Data Engineer at Amazon.com Inc., where she leads the modernization of large, globally distributed data platforms. She has architected multi-region, GDPR-compliant data pipelines using AWS services such as DynamoDB Streams, Kinesis, S3, AWS Glue, and QuickSight, supporting more than 7,000 daily users across North America, Europe, and APAC while processing tens of millions of records monthly. Her work has resulted in significant performance and cost improvements, including major reductions in processing time and operational overhead.
Previously, as a Data Engineer II at Amazon, Akanksha played a central role in building end-to-end data infrastructure for personalized associate recommendation systems deployed across hundreds of sites and multiple lines of business. She designed and implemented specialized datasets to support machine learning model training, real-time recommendations, and analytics, while ensuring data accuracy through automated validation frameworks. She also led multiple large-scale data platform modernization initiatives, including migrating legacy Hive-based systems to Apache Iceberg, enabling ACID transactions, schema evolution, and element-level updates, and delivering substantial gains in efficiency, scalability, and reliability.
Earlier in her career, Akanksha worked as a Systems Engineer at Tata Consultancy Services, where she supported banking and financial services applications by developing technical specifications, implementing Agile practices, and building advanced SQL-based reporting solutions for regulatory and business intelligence use cases.
Akanksha holds a Master of Science in Information Technology and Management from The University of Texas at Dallas, where she graduated with a 4.0 GPA and received the Jindal School of Business Academic Scholarship. She earned her Bachelor of Technology with high honors from Rajiv Gandhi Proudyogiki Vishwavidyalaya in India. Her background reflects a strong combination of technical depth, data architecture leadership, and the ability to deliver reliable, high-impact data solutions at scale.

Akanksha Mishra