About us
We’re delighted to have you join the ClickHouse and friends global community!
This is a meetup for everybody who is interested in ClickHouse, from a technology and use case perspective. ClickHouse® is an open-source, high performance columnar OLAP database management system for real-time analytics using SQL. The sessions and discussions in this group will relate to architecture considerations, software design, coding and much more.
Our user group is free and open so we welcome you all to learn, collaborate, and share experiences!
Upcoming events
1

ClickHouse Delhi/Gurgaon Meetup
Awfis Space Solutions @ Ambience Mall, 07th Floor, Gate No. 03 & Gate No. 04, Ambience Island, NH 48, Ambience Island, DLF Phase 3, Sector 24, Gurugram, Haryana 122010, Gurugram, INJoin us in Delhi for yet another exciting meetup. This event brings together engineers, architects, and data leaders to explore how modern teams are building scalable, real-time analytics platforms in the cloud. Speakers will share practical architectures, production lessons, and real-world insights from high-volume, performance-intensive workloads powered by ClickHouse. Connect with the local data community and discover how teams are turning streaming data into fast, actionable intelligence.
Don’t miss out! RSVP and secure your spot!
🗓️ Agenda:
- 10:00 AM: Registration & networking
- 11:00 AM: Welcome & opening
- 11:10 AM: Talk 1 - What Do Agents Need from Databases? Speed, Scale, and Semantics by Nishant Bangarwa, Cofounder and Head of Engineering, Rill Data
- 11:40 PM: Talk 2 - Building Scalable Real-Time Analytics Pipelines with ClickHouse by Samyak Jain, Senior Data Engineer, Info Edge
- 12:10 PM: Break
- 12:20 PM: Talk 3 - Agentic AI Systems: Performance Engineering for Conversational Workloads by Siddhant Agarwal, Senior Developer Relations Advocate, ClickHouse
- 12:40 PM: Talk 4 - How Atlys Rebuilt Its Data Stack Around ClickHouse. Twice. by Abhishek Banerjee, Senior Data Engineer, Atlys
- 12:55 PM: Closing Remarks
- 01:00 PM: Lunch & networking
If anyone from the community is interested in sharing a talk at future events, complete this CFP form and we’ll be in touch.
🎤 Session Details: What Do Agents Need from Databases? Speed, Scale, and Semantics
Description: AI agents are starting to answer business questions—but letting them query raw tables directly leads to broken analytics: hallucinated KPIs, ambiguous joins, invalid SQL, and expensive scans. The real issue isn’t the model—it’s the lack of a fast, governed semantic interface.
In this talk, we’ll examine why low-latency analytics is a prerequisite for agent-driven workflows, and how modern OLAP engines—particularly ClickHouse—enable this shift. With columnar storage, vectorized execution, and real-time aggregation, ClickHouse supports sub-second analytical queries at scale, allowing agents to iteratively explore, validate, and explain results.
The session will cover why SQL remains the most effective language for defining and governing metrics, and outline best practices for metric modeling, access control, and AI-assisted authoring—while preventing ungoverned KPI creation. The talk concludes with an overview of Rill’s SQL-based semantic architecture and a live demonstration showing how AI agents can safely discover, query, and explain governed metrics on top of ClickHouse delivering fast, deterministic, and explainable business insights.
Speaker: Nishant Bangarwa, Cofounder and Head of Engineering, Rill Data
Nishant Bangarwa is co-founder and Head of Engineering at Rill Data, where he builds open-source tooling for operational BI and fast analytics on top of engines like DuckDB and ClickHouse. Today he's focused on metrics-first architectures: how to model, control, and serve business metrics in a way that works for both humans and AI agents that need reliable answers without raw table access.🎤 Session Details: Building Scalable Real-Time Analytics Pipelines with ClickHouse
Description: In this talk, I'll walk through how to design and build scalable real-time analytics pipelines using ClickHouse. We'll cover practical architecture patterns, ingestion strategies, and how to leverage MergeTree engines for performance and cost efficiency. I'll also share lessons learned from real-world use cases, including handling high-throughput data and optimizing query performance.
Speaker: Samyak Jain, Senior Data Engineer, Info Edge
Samyak Jain is a Data Engineer with experience in building scalable data platforms and real-time analytics systems. He has worked extensively with modern data stack technologies and is particularly interested in high-performance OLAP systems like ClickHouse. He actively explores efficient data modeling, pipeline optimization, and distributed systems.🎤 Session Details: Agentic AI Systems: Performance Engineering for Conversational Workloads
Description: Generative AI got the buzz, but agentic AI is changing how we actually work. Instead of waiting for prompts, agents explore data, run iterative queries, and reason step by step. That shift creates real infrastructure challenges, including unpredictable query patterns, sub-second response requirements, mixed read and write workloads, and the need for visibility and trust in AI-driven results.
In this session, we will share practical lessons from running agentic workloads and what they mean for modern data platforms. We will also give updates on our AI roadmap, including how recent acquisitions like LibreChat and Langfuse strengthen agent interfaces, observability, and evaluation, and how we are building toward AI systems that do not just answer questions, but actively help you find better ones.
Speaker: Siddhant Agarwal, Senior Developer Relations Advocate, APAC, ClickHouse
Siddhant Agarwal leads Developer Communities for APAC at ClickHouse. Prior to ClickHouse, Sid was with Neo4j and Google. He is passionate about tech communities and has built India’s first fintech developer community at Open Financial Technologies and worked with Google Developer Relations programmes including GDSC, TFUG, GDG, and GDE. He has nearly a decade of experience building developer and startup communities globally and is among ACM’s Distinguished Speakers (one of 200+ worldwide, 20+ in India).🎤 Session Details: How Atlys Rebuilt Its Data Stack Around ClickHouse. Twice.
Description: At Atlys, we've run ClickHouse twice — first self-hosted, then on Cloud. This talk covers the full arc: why we moved from a fragmented Snowflake + Segment stack to self-hosted ClickHouse, how we built Pulse (our in-house Go-based ETL on top of NATS) to get real-time ingestion, and what it took to migrate 600M records across 2,259 tables to ClickHouse Cloud with zero downtime and zero data loss. We'll go deep on sharding strategy, the ReplicatingReplacingMergeTree setup, and the 3-phase migration plan — with real query benchmarks throughout.
Speaker: Abhishek Banerjee, Senior Data Engineer, Atlys
Abhishek Banerjee is a Senior Data Engineer at Atlys, where he owns the end-to-end data infrastructure powering analytics across the company's visa processing platform. Previously at Hindustan Times and OTTPlay (HT Media), where he led data pipeline integrations for one of India's largest OTT platforms. He is also a GenAItechLab Fellow.68 attendees
Past events
6


