Home/Agentic AI/Long-Term Memory/Vector Databases

Long-Term Memory

Master how AI agents store and retrieve knowledge across sessions using persistent memory systems

Your Progress

0 / 5 completed

Vector Databases: Semantic Search at Scale

Traditional databases search for exact matches—you ask for "password reset," you get documents with those exact words. But AI agents need semantic search: understanding that "forgot credentials" means the same thing as "password reset."

Enter vector databases—systems designed to store and search high-dimensional vectors (embeddings) that capture meaning.

📐 How Embeddings Work

Traditional Keyword Search

🔍

Query:

"password reset"

Looks for exact words: ["password", "reset"]

❌

Misses: "forgot credentials", "login help", "account recovery"

Vector Search (Semantic)

🧠

Query Embedding:

[0.75, 0.55, 0.35, ...]

Finds semantically similar vectors

✅

Matches: "forgot credentials" (0.92), "login help" (0.85), "account recovery" (0.88)

🔢 What is an Embedding?

An embedding is a list of numbers (a vector) that represents the meaning of text. Similar meanings produce similar vectors. Models like OpenAI's text-embedding-ada-002 convert text → 1536-dimensional vectors.

Interactive: Similarity Search

🔍 Query:

"How do I reset my password?"

Embedding: [0.75, 0.55, 0.35]

Similarity Threshold0.70

0.0 (All)0.5 (Medium)1.0 (Exact)

📄 Search Results (3 / 3 documents)

Password Reset Guide

How to reset your password: Click forgot password, enter email...

100%

similarity

Embedding:[0.8, 0.6, 0.3]

Account Security Tips

Best practices for securing your account: use 2FA, strong passwords...

100%

similarity

Embedding:[0.7, 0.5, 0.4]

Recipe: Chocolate Cake

Ingredients: flour, sugar, cocoa powder. Bake at 350°F for 30 minutes...

90%

similarity

Embedding:[0.1, 0.2, 0.1]

💡 Notice:

The "Recipe: Chocolate Cake" document has very low similarity to the password reset query because the meaning is completely different, even though it might share some words like "enter" or "click." Vector search understands context.

🗄️ Popular Vector Databases

⚡

Pinecone

Fully managed, serverless

• Easy to set up
• Auto-scales
• Pay-as-you-go

🐘

pgvector

PostgreSQL extension

• Use existing Postgres
• No new infrastructure
• Good for small/medium scale

🔷

Weaviate

Open-source, GraphQL API

• Built-in vectorization
• Hybrid search
• Self-hosted or cloud

🌟

Chroma

Lightweight, Python-first

• Easy local development
• LangChain integration
• Great for prototyping

🔄 RAG: Retrieval-Augmented Generation

Vector databases power RAG systems—the most common pattern for giving AI agents long-term memory.

User asks a question

"What's our refund policy?"

Embed the question

embed("What's our refund policy?") → [0.12, 0.45, ...]

Search vector DB

Find top 3 most similar documents

Inject into LLM prompt

"Answer using these docs: [retrieved context]"

Generate grounded answer

"According to our policy, refunds are available within 30 days..."

💡 Key Insight

Vector databases enable agents to search by meaning, not just keywords. This is how ChatGPT with plugins, Notion AI, and customer support bots can answer questions about your specific documents—they retrieve relevant context, then generate answers.

←Storage SystemsPrevious