Merge pull request #487 from weisser-dev/api-endpoints

add ollama api proxy support to api doc
2025-06-16 11:28:36 +00:00 · 2025-04-10 11:30:57 -07:00 · 2025-04-10 11:30:57 -07:00 · 75b5971eba
commit 75b5971eba
parent 0f3219d264 0f7bf27745
1 changed files with 33 additions and 0 deletions
--- a/docs/getting-started/api-endpoints.md
+++ b/docs/getting-started/api-endpoints.md
@ -66,6 +66,39 @@ To ensure secure access to the API, authentication is required 🛡️. You can
      return response.json()
  ```
 ### 🦙 Ollama API Proxy Support
 If you want to interact directly with Ollama models—including for embedding generation or raw prompt streaming—Open WebUI offers a transparent passthrough to the native Ollama API via a proxy route.
 - **Base URL**: `/ollama/<api>`
 - **Reference**: [Ollama API Documentation](https://github.com/ollama/ollama/blob/main/docs/api.md)
 #### 🔁 Generate Completion (Streaming)
 ```bash
 curl http://localhost:3000/ollama/api/generate -d '{
  "model": "llama3.2",
  "prompt": "Why is the sky blue?"
 }'
 ```
 #### 📦 List Available Models
 ```bash
 curl http://localhost:3000/ollama/api/tags
 ```
 #### 🧠 Generate Embeddings
 ```bash
 curl -X POST http://localhost:3000/ollama/api/embed -d '{
  "model": "llama3.2",
  "input": ["Open WebUI is great!", "Let's generate embeddings."]
 }'
 ```
 This is ideal for building search indexes, retrieval systems, or custom pipelines using Ollama models behind the Open WebUI.
 ### 🧩 Retrieval Augmented Generation (RAG)
 The Retrieval Augmented Generation (RAG) feature allows you to enhance responses by incorporating data from external sources. Below, you will find the methods for managing files and knowledge collections via the API, and how to use them in chat completions effectively.