Naive RAG

aka Retrieval-Augmented Generation, Top-K Retrieve-and-Stuff

category: retrieval · status: mature

Intent

Condition the generator on top-k chunks retrieved from an external dense index so knowledge lives outside parameters.

Context

The agent needs information that lives in a corpus too large to fit in context, and that may change without retraining.

Problem

Parametric LMs hallucinate, cannot cite, and cannot be updated without retraining; query-time external knowledge is needed.

Forces

Solution

Chunk the corpus. Embed each chunk with a dense encoder. At query time, embed the query, retrieve top-k by similarity, prepend chunks to the prompt, generate. The simplest production RAG pipeline.

Constrains

The generator may use only retrieved chunks plus its parametric memory; the retrieval set is the boundary.

Consequences

Benefits

Liabilities

Known Uses

Related Patterns

References