#57 Large Scale Production Engineering India (LSPE-IN) meet up
Details
(Youtube link to be updated later)
Event Agenda and Speaker info
10:45 - Intro over Tea
11:15 - Keynote : Vivek Agarwal, SVP Engineering, Razorpay
11:30 - Observability at Scale: When Logs Become the Problem - Saijal Shrivastava and Nehal , Razorpay
In this session, Saijal and Nehal explores the evolution of observability at scale, moving beyond the noise and high costs of traditional logging. Drawing from real-world production trade-offs at Razorpay, they analyze common pitfalls in metric design—such as high cardinality and excessive emission—and demonstrate how infra-level tracing with eBPF can eliminate entire classes of troubleshooting logs without requiring application instrumentation. The talk concludes by illustrating how reducing observability noise is a prerequisite for effective AI-driven incident summarization and signal triage, offering practical lessons for managing complex systems efficiently.
Saijal leads the Observability charter at Razorpay, where she designs and operates large-scale observability platforms spanning metrics, logs, and traces across highly distributed, high-throughput systems. Her work focuses on standardising telemetry, taming high-cardinality data, and building signal-driven alerting that scales operationally and economically. She works closely with production and platform teams to ensure observability directly improves MTTD, MTTR, and reliability outcomes, rather than becoming a passive reporting layer.
Nehal leads AIOps at Razorpay, building AI-assisted systems for incident detection, correlation, and root cause analysis in complex production environments. His work sits at the intersection of observability data, operational workflows, and applied machine learning, with a strong emphasis on reducing alert noise and accelerating decision-making during incidents. Nehal focuses on deploying pragmatic, production-ready AI systems that augment on-call engineers and scale reliability practices as the organisation and system complexity grow.
12:15 - Exploring Quantum Computing : Debansu Saha, Cluster Leader, Tech Strategy, MHP India
The world is embracing the impact of 2nd Quantum revolution. Govt of India is spearheading it's National Quantum Mission (NQM) with Govt. of Karnataka announcing to build India's first Q-City with a $20 bn Quantum Economy by 2035. In the talk, Debansu will scratch the surface of this emerging field and it's possible need for knowledge exploration and re-skilling.
Debansu is a seasoned techie with three decades plus industry experience, nearly half of which is in Indian public sector. He is one of the founders of #lspe-in meetup forum in 2012. He is a licensed HAM radio operator, serious amateur photographer and a failed guitarist.
1:00 - Lunch
2:00 - Using boring CLI tools curl, ls, ps etc. to break and harden GNU/Linux systems : Krishnendu Paul, Lead of Red Team at Ericsson and Krishnendu Das, Engineer , Advanced Managed Services and AI delivery at Lenovo
For over 20 years, Krishnendu Paul has operated on the digital frontlines, specializing in proactive security, penetration testing, and digital forensics. Currently, he leads the Red Team at Ericsson, where their focus is on emulating advanced adversaries to proactively identify and neutralize critical security vulnerabilities. When not actively breaking systems (legally), Krishnendu is dedicated to researching and sharing knowledge on emerging topics such as Threat Actor Behavior, Digital Forensics, and Reverse Engineering at various industry events.
For over 20 years, Krishnendu Das has focused on the operational and defensive aspects of complex infrastructure, specializing in securing and maintaining large-scale digital environments. As an Engineer - Advanced Managed Services, SSG Hybrid Cloud & AI Delivery at Lenovo. Krishnendu currently focus on ensuring the reliability, resilience, and security of advanced cloud and AI solutions delivered to clients. He brings deep expertise in systems hardening, incident response, and performance engineering, and is committed to sharing practical insights on managing high-stakes production environments.
2:50 - Lightening Talk - Safety Net for critical services beyond primary platforms : Arkadip Basu, Principal Engineer
From Hi to Square off in Trading. Running a parallel infrastructure that protects customer in case of Global infra outage.
Arkadip is Principal Engineer at Fin Tech Enterprise, applying AI & Resiliency for business critical services. He has expertise in developing infrastructure that is capable of handling large volume of high frequency transaction across Data Centre and cloud over multiple geographies. He is engaged in Pro-Active algorithm building for efficient Op-Ex planning & analysis.
3:30 - Networking tea and closing
