Retrieval-Augmented Generation

What is it?

A practice when building AI applications to help address 2 fundamental problems with LLMs:

It’s a system architecture pattern of introducing a separate, well-maintained data store that sits alongside your application, and injects relevant information into prompts that a given user might ask. It’s usually entirely transparent to the end user, and consists of two main parts:

Additional References

References