TECHNOLOGIES

Vector Database

A database that stores text as numeric vectors so you can search by meaning instead of exact keywords, which is the retrieval engine most RAG systems run on.

Last reviewed: 2026-06-02 byKevin Riedl wiki ↗

A vector database stores embeddings: numeric representations of text (or images, or audio) where similar meaning maps to nearby points in space. Instead of matching exact keywords, you embed the user’s query the same way and ask the database for the closest vectors. That is similarity search, and it is what lets a system find “how do I cancel my plan” when the document actually says “subscription termination procedure.”

In a RAG pipeline this is the retrieval layer. The quality of your answers depends heavily on it: good embeddings and good search return the right chunks to feed the LLM , bad ones feed garbage and the model confidently summarizes garbage. This is why retrieval, not the model, is usually where RAG projects succeed or fail.

Here is the part vendors skip: you often do not need a dedicated vector database. If your corpus is small (thousands, not millions of chunks), a vector extension on the Postgres you already run (pgvector) is simpler, cheaper, and one less system to operate. If your search is mostly keyword-driven, plain full-text search may beat vector search outright. Reach for a specialized vector DB when scale, latency, or hybrid search at high volume actually justify it, not because it is on the architecture diagram.

Worked example of over-engineering this: a team building an internal doc assistant over a few thousand pages reaches for a managed vector database, a separate embedding pipeline, and a reranking service before they have a single user. They now operate four systems to answer questions that pgvector on their existing Postgres would have handled, and every one is a new thing to monitor, secure, and pay for. The boring version ships in a week and scales fine until the corpus is genuinely large. Reach for the specialised database when the numbers force it (millions of vectors, strict latency budgets, high-volume hybrid search), not because the architecture diagram looks more serious with it.

We pick the boring-enough option deliberately under Artificial Intelligence , because every extra system is an extra thing to keep alive at 3am.

What is a vector database used for? +

Searching by meaning rather than exact keywords. Text is stored as embeddings (numeric vectors), and queries return the closest matches. It is the retrieval layer most RAG systems rely on.

Do I need a dedicated vector database? +

Often no. For small corpora, a Postgres vector extension like pgvector is simpler and cheaper. For keyword-heavy search, full-text search may win. Reach for a specialized vector DB only when scale, latency, or hybrid search justify it.

What are embeddings? +

Numeric representations of text where similar meaning lands at nearby points in vector space. They let a system match a query to relevant content even when the wording is completely different.

FAQs