Retrieval-Augmented Generation

What is it?

A practice when building AI applications to help address 2 fundamental problems with LLMs:

Spouting off answers with no source material
Giving answers to queries that are out-of-date

It’s a system architecture pattern of introducing a separate, well-maintained data store that sits alongside your application, and injects relevant information into prompts that a given user might ask. It’s usually entirely transparent to the end user, and consists of two main parts:

The well-structured, up-to-date data store
The retriever that will pull relevant information from the data store and inject it into the prompt going to the LLM

Additional References

What is Retrieval-Augmented Generation (RAG) - IBM YouTube

System Architecture
Development
Technology

References

Data Cloud
Daily Learnings: Mon, Sep 09, 2024
Agentforce
TDX 2025 - Day 2 (Part 2)

Retrieval-Augmented Generation

What is it?

Additional References

Other Related Topics

References