Glossary Context Management 1 min read

Retrieval-Augmented Generation

Also known as: RAG

A technique that enhances AI model outputs by retrieving relevant information from external knowledge sources and incorporating it into the model's context before generating a response.

RAG, retrieval augmented, knowledge retrieval, document retrieval, vector search, hybrid search, retrieval pipeline, chunking, indexing, knowledge-grounded generation, contextual retrieval, RAG pipeline, semantic retrieval

Sources & References

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Facebook AI Research

Research

RAG (Retrieval-Augmented Generation)

Google Cloud

Documentation

Retrieval Augmented Generation (RAG) in Azure AI Search

Microsoft Azure

Documentation

Related Terms

Context Window

The maximum amount of text (measured in tokens) that a large language model can process in a single interaction, encompassing both the input prompt and the generated output. Managing context windows effectively is critical for enterprise AI deployments where complex queries require extensive background information.

Next Retrieval-Augmented Generation Pipeline

Back to Glossary

MCP Tutorials

RAG Cookbook

Library Integrations

Context Window Engineering

Embeddings & Retrieval

Tool Use & Function Calling

Retrieval-Augmented Generation

Sources & References

Related Terms

Context Window

Embeddings

Knowledge Base

Large Language Model

Vector Database