Skip to content

Details

About the speaker
https://www.linkedin.com/in/manish-sahani-001/

# 📢 Meetup Announcement: Deep Dive into Vector Databases, Vector Stores & Indexing Techniques for RAG

As part of our ongoing Hands-On RAG Series, we now move to the next crucial building block in building real-world RAG applications: Vectors, VectorDBs, Vector Stores, and Indexing Techniques.
Over the past sessions, we have taken a systematic, layered approach to demystify and master Retrieval-Augmented Generation:

### 🔹 Session 1Foundational RAG Building Blocks

We explored the complete lifecycle of RAG: chunking, embedding generation, vectorization, indexing, and retrieval workflows, building strong conceptual clarity.

### 🔹 Session 2Advanced Chunking Techniques

We went deeper into chunking strategies for text, images, audio, video, and multimodal data, understanding how chunk quality directly impacts RAG accuracy and retrieval performance.

### 🔹 Session 3All About Embeddings

We dedicated an entire session to embeddings, covering types, models, quality, dimensionality, similarity metrics, optimization techniques, and best practices.

***

# ⭐ Now: Session 4 — Vectors, Vector Databases & Indexing for High-Performance RAG

This upcoming meetup focuses on one of the most critical components of any serious RAG system:
How do we store, search, and retrieve vectors efficiently at scale?

### What we will cover

#### 1️⃣ Why Vector Databases?

  • Why traditional RDBMS systems fail for RAG workloads
  • What makes vector search unique
  • Real-world performance challenges in RAG projects

#### 2️⃣ What Exactly Is a Vector?

  • Understanding vector representation
  • How embeddings turn unstructured data into searchable geometry
  • Similarity search & distance metrics

#### 3️⃣ Vector Databases vs. Vector Stores

  • Key differences
  • Architectures, storage formats, indexing strategies
  • When to choose which

#### 4️⃣ Popular Vector Databases & Tools

A comparative, practical guide across the ecosystem:
FAISS, Milvus, Pinecone, Chroma, Weaviate, Qdrant, Redis Stack, Vespa, etc.
For each, we'll cover:

  • Strengths & limitations
  • Licensing considerations
  • Scaling behaviours
  • Performance benchmarks
  • Ecosystem & integrations (LangChain, LlamaIndex, Python APIs, etc.)

#### 5️⃣ Indexing Techniques (Core of This Session)

We will deep-dive into indexing algorithms used for vector search:

  • IVF / IVF-PQ / IVF-OPQ
  • Flat indexing
  • HNSW (Hierarchical Navigable Small Worlds)
  • Annoy, ScaNN, LSH, PQ, OPQ
  • Tradeoffs: accuracy vs. speed vs. memory
  • How indexing impacts RAG output quality
  • Which indexing to choose for which use-case

#### 6️⃣ Real-World Experiences & What Actually Works

This session is hands-on and practitioner-driven.
The experts presenting have real-world implementation experience, have “dirtied their hands,” and will share:

  • What worked for them
  • What failed
  • Practical, project-tested tips
  • Pitfalls they discovered the hard way
  • Optimization techniques for enterprise-grade RAG systems

This is not theory or a high-level overview.
These are practical, actionable learnings from people who build real systems every day.

***

# 🎯 Who Should Attend

  • Engineers building RAG systems
  • Data scientists & ML practitioners
  • Architects & solution designers
  • Students exploring AI systems engineering
  • Anyone who wants to understand the backbone of modern intelligent search

***

# 🎉 Outcome

This session will equip you with practical, working clarity on:

  • How vectors are stored
  • How similarity search works under the hood
  • Why indexing matters for RAG
  • How to pick the right vector database for your use-case
  • How to avoid common mistakes while building vector-powered AI apps

Members are also interested in