Let’s Talk Vectors: Databases, Stores & Indexing Explained

Name: Let’s Talk Vectors: Databases, Stores & Indexing Explained
Start: 2025-11-29T16:00:00+05:30
End: 2025-11-29T18:00:00+05:30

Hosted by SUCHETA G D.

Pune Women in Machine Learning & Data Science

Details

About the speaker
https://www.linkedin.com/in/manish-sahani-001/

# 📢 Meetup Announcement: Deep Dive into Vector Databases, Vector Stores & Indexing Techniques for RAG

As part of our ongoing Hands-On RAG Series, we now move to the next crucial building block in building real-world RAG applications: Vectors, VectorDBs, Vector Stores, and Indexing Techniques.
Over the past sessions, we have taken a systematic, layered approach to demystify and master Retrieval-Augmented Generation:

### 🔹 Session 1 — Foundational RAG Building Blocks

We explored the complete lifecycle of RAG: chunking, embedding generation, vectorization, indexing, and retrieval workflows, building strong conceptual clarity.

### 🔹 Session 2 — Advanced Chunking Techniques

We went deeper into chunking strategies for text, images, audio, video, and multimodal data, understanding how chunk quality directly impacts RAG accuracy and retrieval performance.

### 🔹 Session 3 — All About Embeddings

We dedicated an entire session to embeddings, covering types, models, quality, dimensionality, similarity metrics, optimization techniques, and best practices.

***

# ⭐ Now: Session 4 — Vectors, Vector Databases & Indexing for High-Performance RAG

This upcoming meetup focuses on one of the most critical components of any serious RAG system:
How do we store, search, and retrieve vectors efficiently at scale?

### What we will cover

#### 1️⃣ Why Vector Databases?

Why traditional RDBMS systems fail for RAG workloads
What makes vector search unique
Real-world performance challenges in RAG projects

#### 2️⃣ What Exactly Is a Vector?

Understanding vector representation
How embeddings turn unstructured data into searchable geometry
Similarity search & distance metrics

#### 3️⃣ Vector Databases vs. Vector Stores

Key differences
Architectures, storage formats, indexing strategies
When to choose which

#### 4️⃣ Popular Vector Databases & Tools

A comparative, practical guide across the ecosystem:
FAISS, Milvus, Pinecone, Chroma, Weaviate, Qdrant, Redis Stack, Vespa, etc.
For each, we'll cover:

Strengths & limitations
Licensing considerations
Scaling behaviours
Performance benchmarks
Ecosystem & integrations (LangChain, LlamaIndex, Python APIs, etc.)

#### 5️⃣ Indexing Techniques (Core of This Session)

We will deep-dive into indexing algorithms used for vector search:

IVF / IVF-PQ / IVF-OPQ
Flat indexing
HNSW (Hierarchical Navigable Small Worlds)
Annoy, ScaNN, LSH, PQ, OPQ
Tradeoffs: accuracy vs. speed vs. memory
How indexing impacts RAG output quality
Which indexing to choose for which use-case

#### 6️⃣ Real-World Experiences & What Actually Works

This session is hands-on and practitioner-driven.
The experts presenting have real-world implementation experience, have “dirtied their hands,” and will share:

What worked for them
What failed
Practical, project-tested tips
Pitfalls they discovered the hard way
Optimization techniques for enterprise-grade RAG systems

This is not theory or a high-level overview.
These are practical, actionable learnings from people who build real systems every day.

***

# 🎯 Who Should Attend

Engineers building RAG systems
Data scientists & ML practitioners
Architects & solution designers
Students exploring AI systems engineering
Anyone who wants to understand the backbone of modern intelligent search

***

# 🎉 Outcome

This session will equip you with practical, working clarity on:

How vectors are stored
How similarity search works under the hood
Why indexing matters for RAG
How to pick the right vector database for your use-case
How to avoid common mistakes while building vector-powered AI apps

Let’s Talk Vectors: Databases, Stores & Indexing Explained

Pune Women in Machine Learning & Data Science

Details

You may also like