Let’s Talk Vectors: Databases, Stores & Indexing Explained
Details
About the speaker
https://www.linkedin.com/in/manish-sahani-001/
# 📢 Meetup Announcement: Deep Dive into Vector Databases, Vector Stores & Indexing Techniques for RAG
As part of our ongoing Hands-On RAG Series, we now move to the next crucial building block in building real-world RAG applications: Vectors, VectorDBs, Vector Stores, and Indexing Techniques.
Over the past sessions, we have taken a systematic, layered approach to demystify and master Retrieval-Augmented Generation:
### 🔹 Session 1 — Foundational RAG Building Blocks
We explored the complete lifecycle of RAG: chunking, embedding generation, vectorization, indexing, and retrieval workflows, building strong conceptual clarity.
### 🔹 Session 2 — Advanced Chunking Techniques
We went deeper into chunking strategies for text, images, audio, video, and multimodal data, understanding how chunk quality directly impacts RAG accuracy and retrieval performance.
### 🔹 Session 3 — All About Embeddings
We dedicated an entire session to embeddings, covering types, models, quality, dimensionality, similarity metrics, optimization techniques, and best practices.
***
# ⭐ Now: Session 4 — Vectors, Vector Databases & Indexing for High-Performance RAG
This upcoming meetup focuses on one of the most critical components of any serious RAG system:
How do we store, search, and retrieve vectors efficiently at scale?
### What we will cover
#### 1️⃣ Why Vector Databases?
- Why traditional RDBMS systems fail for RAG workloads
- What makes vector search unique
- Real-world performance challenges in RAG projects
#### 2️⃣ What Exactly Is a Vector?
- Understanding vector representation
- How embeddings turn unstructured data into searchable geometry
- Similarity search & distance metrics
#### 3️⃣ Vector Databases vs. Vector Stores
- Key differences
- Architectures, storage formats, indexing strategies
- When to choose which
#### 4️⃣ Popular Vector Databases & Tools
A comparative, practical guide across the ecosystem:
FAISS, Milvus, Pinecone, Chroma, Weaviate, Qdrant, Redis Stack, Vespa, etc.
For each, we'll cover:
- Strengths & limitations
- Licensing considerations
- Scaling behaviours
- Performance benchmarks
- Ecosystem & integrations (LangChain, LlamaIndex, Python APIs, etc.)
#### 5️⃣ Indexing Techniques (Core of This Session)
We will deep-dive into indexing algorithms used for vector search:
- IVF / IVF-PQ / IVF-OPQ
- Flat indexing
- HNSW (Hierarchical Navigable Small Worlds)
- Annoy, ScaNN, LSH, PQ, OPQ
- Tradeoffs: accuracy vs. speed vs. memory
- How indexing impacts RAG output quality
- Which indexing to choose for which use-case
#### 6️⃣ Real-World Experiences & What Actually Works
This session is hands-on and practitioner-driven.
The experts presenting have real-world implementation experience, have “dirtied their hands,” and will share:
- What worked for them
- What failed
- Practical, project-tested tips
- Pitfalls they discovered the hard way
- Optimization techniques for enterprise-grade RAG systems
This is not theory or a high-level overview.
These are practical, actionable learnings from people who build real systems every day.
***
# 🎯 Who Should Attend
- Engineers building RAG systems
- Data scientists & ML practitioners
- Architects & solution designers
- Students exploring AI systems engineering
- Anyone who wants to understand the backbone of modern intelligent search
***
# 🎉 Outcome
This session will equip you with practical, working clarity on:
- How vectors are stored
- How similarity search works under the hood
- Why indexing matters for RAG
- How to pick the right vector database for your use-case
- How to avoid common mistakes while building vector-powered AI apps
