From cd5e2708476e2d09cd7bf2daf2f714e7baa27eee Mon Sep 17 00:00:00 2001 From: Timothy Jaeryang Baek Date: Wed, 26 Mar 2025 21:33:28 -0700 Subject: [PATCH] Update rag.md --- docs/features/rag.md | 10 ++++++++++ 1 file changed, 10 insertions(+) diff --git a/docs/features/rag.md b/docs/features/rag.md index a435c96..c891dca 100644 --- a/docs/features/rag.md +++ b/docs/features/rag.md @@ -3,10 +3,20 @@ sidebar_position: 11 title: "🔎 Retrieval Augmented Generation (RAG)" --- +:::warning + +If you're using **Ollama**, note that it **defaults to a 2048-token context length**. This means that retrieved data may **not be used at all** because it doesn’t fit within the available context window. To improve **Retrieval-Augmented Generation (RAG) performance**, you should **increase the context length** to **8192+ tokens** in your Ollama model settings. + +::: + + Retrieval Augmented Generation (RAG) is a cutting-edge technology that enhances the conversational capabilities of chatbots by incorporating context from diverse sources. It works by retrieving relevant information from a wide range of sources such as local and remote documents, web content, and even multimedia sources like YouTube videos. The retrieved text is then combined with a predefined RAG template and prefixed to the user's prompt, providing a more informed and contextually relevant response. One of the key advantages of RAG is its ability to access and integrate information from a variety of sources, making it an ideal solution for complex conversational scenarios. For instance, when a user asks a question related to a specific document or web page, RAG can retrieve and incorporate the relevant information from that source into the chat response. RAG can also retrieve and incorporate information from multimedia sources like YouTube videos. By analyzing the transcripts or captions of these videos, RAG can extract relevant information and incorporate it into the chat response. + + + ## Local and Remote RAG Integration Local documents must first be uploaded via the Documents section of the Workspace area to access them using the `#` symbol before a query. Click on the formatted URL in the that appears above the chat box. Once selected, a document icon appears above `Send a message`, indicating successful retrieval.