TECHNOLOGIES

Vector Database

A database that stores text as numeric vectors so you can search by meaning instead of exact keywords, which is the retrieval engine most RAG systems run on.

Last reviewed: 2026-06-02 byKevin Riedl wiki β†—

A vector database stores embeddings: numeric representations of text (or images, or audio) where similar meaning maps to nearby points in space. Instead of matching exact keywords, you embed the user’s query the same way and ask the database for the closest vectors. That is similarity search, and it is what lets a system find “how do I cancel my plan” when the document actually says “subscription termination procedure.”

In a RAG pipeline this is the retrieval layer. The quality of your answers depends heavily on it: good embeddings and good search return the right chunks to feed the model, bad ones feed garbage and the model confidently summarizes garbage. This is why retrieval, not the model, is usually where RAG projects succeed or fail.

Here is the part vendors skip: you often do not need a dedicated vector database. If your corpus is small (thousands, not millions of chunks), a vector extension on the Postgres you already run (pgvector) is simpler, cheaper, and one less system to operate. If your search is mostly keyword-driven, plain full-text search may beat vector search outright. Reach for a specialized vector DB when scale, latency, or hybrid search at high volume actually justify it, not because it is on the architecture diagram.

We pick the boring-enough option deliberately under Artificial Intelligence, because every extra system is an extra thing to keep alive at 3am.

// FAQ

FAQs

FAQs

Searching by meaning rather than exact keywords. Text is stored as embeddings (numeric vectors), and queries return the closest matches. It is the retrieval layer most RAG systems rely on.
Often no. For small corpora, a Postgres vector extension like pgvector is simpler and cheaper. For keyword-heavy search, full-text search may win. Reach for a specialized vector DB only when scale, latency, or hybrid search justify it.
Numeric representations of text where similar meaning lands at nearby points in vector space. They let a system match a query to relevant content even when the wording is completely different.