AI hallucination

An AI hallucination is when a language model produces confidently-stated information that is actually false — a fabricated citation, wrong fact, or invented API.

An AI hallucination is the most-cited failure mode of language models: the model produces text that sounds authoritative but is factually wrong. A made-up book title, a fake court citation, a confident answer about an API that doesn't exist.

Why it happens: LLMs are trained to produce plausible next tokens, not true next tokens. The objective rewards fluency. When the model doesn't know something, it tends to interpolate from patterns in its training data rather than say "I don't know."

The risk is uneven. Hallucinations show up most often when:

The model is asked about specific facts (names, dates, numbers, citations).
The model is asked about its own knowledge cut-off or recent events.
The model is pushed beyond its training distribution.
The prompt is leading or contains incorrect premises.

The risk is lowest when:

The model is reasoning over text provided in the prompt itself.
The model is asked to summarize, translate, or transform structured content.
The model has a "say I don't know" instruction in the system prompt.

Practical mitigations: ground every factual answer in retrieved source documents (RAG), require citations the model has to point to, lower temperature for factual tasks, and use stronger frontier models — hallucination rates have been falling roughly an order of magnitude per model generation. Anthropic's Claude and OpenAI's o-series reasoning models have notably lower hallucination rates than earlier-generation models.

The single biggest UX lesson: surface uncertainty. A product that says "I'm not sure, but here are three relevant docs" beats one that confidently fabricates an answer every time.

FAQ

Can hallucinations be eliminated entirely?

Not currently. They can be reduced 10-100x with grounding, RAG, and better models, but a non-zero rate remains. Production systems handle this with citation requirements, confidence thresholds, and human review.

Which model hallucinates the least?

On public hallucination benchmarks, Claude (Anthropic) and OpenAI's o-series reasoning models consistently rank lowest. But the right answer is always 'depends on the task' — measure on your own workload.

Related terms

LLM (Large Language Model) — A Large Language Model is a neural network trained on huge volumes of text to predict the next token, which produces emergent capabilities like reasoning, code generation, and translation.
RAG (Retrieval-Augmented Generation) — RAG combines a language model with a search step over your own documents, so answers stay grounded in your data instead of hallucinating.
Prompt engineering — Prompt engineering is the craft of writing instructions to a language model so it produces reliable, accurate, useful outputs.

Want to actually build with this?

Our Stack Builder picks the best AI tools for your specific project in under 60 seconds.

Build my stack →