TECHNOLOGIES

Hallucination

When an LLM produces fluent, confident output that is simply false, which is a direct consequence of how the model works and cannot be fully eliminated, only reduced.

Last reviewed: 2026-06-02 byKevin Riedl wiki ↗

A hallucination is when a model generates something that sounds right and is wrong: an invented citation, a made-up API, a plausible but false fact. It is not a glitch you can patch out. An LLM predicts likely text, and a fluent, confident-sounding answer is statistically likely whether or not it happens to be true. The model has no internal sense of “I do not know,” so it fills the gap with something that fits the pattern.

Because it is structural, the honest framing is risk reduction, not elimination. The biggest lever is grounding: give the model the actual facts at runtime via retrieval (RAG ) so it answers from real source text instead of its training-time guess. Note that fine-tuning is the wrong lever here, since it shapes behaviour rather than supplying facts. Constrain output formats so there is less room to improvise. Ask the model to cite sources you can check. And critically, evaluate, build a test set of real questions, measure how often the system is wrong, and treat that number as a quality metric you track like any other.

This is where AI meets QA, and most teams skip it. Shipping an LLM feature without an evaluation harness is shipping untested code and calling it done. You need to know your failure rate before your users find it for you. We treat that as non-negotiable under Software Quality Assurance .

Worked example of why the evaluation harness is non-negotiable: a team ships a legal-document assistant after testing it on a dozen questions by hand, all of which looked great. In production it confidently cites a clause that does not exist in the uploaded contract, a user acts on it, and now there is a real liability. The harness that would have caught this is unglamorous: a few hundred real questions with known-correct answers, run on every change, producing a single number, the percentage the system got wrong. Without it you do not know your failure rate, which means your users discover it for you, one bad answer at a time. With it, you can decide whether the rate is acceptable for the stakes before you ship.

The trust angle is the whole game in regulated or high-stakes domains. A hallucination in a chatbot that recommends a movie is a shrug. The same hallucination in financial, legal, or medical output is a liability. Match the guardrails to the cost of being wrong, and never let a fluent answer substitute for a verified one.

Why do LLMs hallucinate? +

Because they predict likely text, not verified truth. A fluent, confident answer is statistically probable whether or not it is correct, and the model has no built-in signal for ‘I do not know,’ so it fills the gap.

Can hallucinations be fixed completely? +

No. They are structural to how LLMs work. You reduce them with grounding (retrieval), constrained outputs, source citations, and evaluation, but a residual rate always remains. Manage it, do not assume it is gone.

How do you reduce hallucinations in production? +

Ground answers in real data via RAG, constrain the output format, require checkable citations, and run an evaluation harness that measures your failure rate on real questions. Match guardrail strength to the cost of being wrong.

FAQs