mirror of
https://github.com/open-webui/docs
synced 2025-06-10 00:27:43 +00:00
Merge pull request #477 from vox-hunter/main
Added Documentation for Mistral OCR and reorganized a bit.
This commit is contained in:
commit
b1315825f8
@ -1,4 +1,18 @@
|
||||
---
|
||||
sidebar_position: 1
|
||||
title: "😏 Chat Features"
|
||||
---
|
||||
title: "💬 Chat Features"
|
||||
---
|
||||
|
||||
# Chat Features Overview
|
||||
|
||||
Open WebUI provides a comprehensive set of chat features designed to enhance your interactions with AI models. This page provides an overview of the key chat capabilities, with links to dedicated pages for more detailed information.
|
||||
|
||||
## Core Chat Features
|
||||
|
||||
- **[🗂️ Conversation Organization](./conversation-organization.md)**: Organize chats with folders and tags to keep your workspace tidy and structured.
|
||||
|
||||
- **[🔗 URL Parameters](./url-params.md)**: Configure chat sessions through URL parameters, enabling quick setup of models, tools, and other features.
|
||||
|
||||
- **[⚙️ Chat Parameters](./chat-params.md)**: Control system prompts and advanced parameters at different levels (per-chat, per-account, or per-model).
|
||||
|
||||
- **[🗨️ Chat Sharing](./chatshare.md)**: Share conversations locally or via the Open WebUI Community platform with controllable privacy settings.
|
||||
|
@ -3,4 +3,14 @@ sidebar_position: 5
|
||||
title: "🐍 Code Execution"
|
||||
---
|
||||
|
||||
COMING SOON!
|
||||
Open WebUI offers powerful code execution capabilities directly within your chat interface, enabling you to transform ideas into actionable results without leaving the platform.
|
||||
|
||||
## Key Features
|
||||
|
||||
- **Python Code Execution**: Run Python scripts directly in your browser using Pyodide, with support for popular libraries like pandas and matplotlib no setup required.
|
||||
|
||||
- **MermaidJS Rendering**: Create and visualize flowcharts, diagrams, and other visual representations with MermaidJS syntax that automatically renders in your chat.
|
||||
|
||||
- **Interactive Artifacts**: Generate and interact with rich content like HTML websites, SVG graphics, and JavaScript visualizations directly within your conversations.
|
||||
|
||||
These execution capabilities bridge the gap between conversation and implementation, allowing you to explore ideas, analyze data, and create visual content seamlessly while chatting with AI models.
|
29
docs/features/document-extraction/index.md
Normal file
29
docs/features/document-extraction/index.md
Normal file
@ -0,0 +1,29 @@
|
||||
---
|
||||
sidebar_position: 6
|
||||
title: "📄 Document Extraction"
|
||||
---
|
||||
|
||||
# Document Extraction in Open WebUI
|
||||
|
||||
Open WebUI provides powerful document extraction capabilities that allow you to process and analyze various types of documents within your RAG (Retrieval Augmented Generation) workflows. Document extraction is essential for transforming unstructured document content into structured data that can be effectively used by language models.
|
||||
|
||||
## What is Document Extraction?
|
||||
|
||||
Document extraction refers to the process of automatically identifying and extracting text and data from various file formats, including:
|
||||
- PDFs (both text-based and scanned)
|
||||
- Images containing text
|
||||
- Handwritten documents
|
||||
- And more
|
||||
|
||||
With proper document extraction, Open WebUI can help you:
|
||||
- Convert image-based documents to searchable text
|
||||
- Preserve document structure and layout information
|
||||
- Extract data in structured formats for further processing
|
||||
- Support multilingual content recognition
|
||||
|
||||
## Available Extraction Methods
|
||||
|
||||
Open WebUI supports multiple document extraction engines to accommodate different needs and document types. Each extraction method has its own strengths and is suitable for different scenarios.
|
||||
|
||||
Explore the documentation for each available extraction method to learn how to set it up and use it effectively with your Open WebUI instance.
|
||||
|
52
docs/features/document-extraction/mistral-ocr.md
Normal file
52
docs/features/document-extraction/mistral-ocr.md
Normal file
@ -0,0 +1,52 @@
|
||||
---
|
||||
sidebar_position: 4000
|
||||
title: "👁️ Mistral OCR"
|
||||
---
|
||||
|
||||
:::warning
|
||||
This tutorial is a community contribution and is not supported by the Open WebUI team. It serves only as a demonstration on how to customize Open WebUI for your specific use case. Want to contribute? Check out the contributing tutorial.
|
||||
:::
|
||||
|
||||
## 👁️ Mistral OCR
|
||||
|
||||
This documentation provides a step-by-step guide to integrating Mistral OCR with Open WebUI. Mistral OCR is an optical character recognition library designed to extract text from a variety of image-based file formats—including scanned PDFs, images, and handwritten documents—into structured data such as JSON or plain text. With advanced support for multilingual text recognition, layout analysis, and handwriting interpretation, Mistral OCR simplifies the process of digitizing and processing documents for AI applications like search, summarization, and data extraction, all through a robust and customizable interface.
|
||||
|
||||
Prerequisites
|
||||
------------
|
||||
|
||||
* Open WebUI instance
|
||||
* Mistral AI account
|
||||
|
||||
Integration Steps
|
||||
----------------
|
||||
|
||||
### Step 1: Sign Up or Login to Mistral AI console
|
||||
|
||||
* Go to `https://console.mistral.ai`
|
||||
* Follow the instructions as instructed on the process
|
||||
* After successful authorization, you should be welcomed to the Console Home
|
||||
|
||||
### Step 2: Generate an API key
|
||||
|
||||
* Go to `API Keys` or `https://console.mistral.ai/api-keys`
|
||||
* Create a new key and make sure to copy it
|
||||
|
||||
### Step 3: Configure OpenWebUI to use Mistral OCR
|
||||
|
||||
* Log in to your Open WebUI instance.
|
||||
* Navigate to the `Admin Panel` settings menu.
|
||||
* Click on `Settings`.
|
||||
* Click on the `Documents` tab.
|
||||
* Change the `Default` content extraction engine dropdown to `Mistral OCR`.
|
||||
* Paste the API Key on the field
|
||||
* Save the Admin Panel.
|
||||
|
||||
Verifying Mistral OCR
|
||||
=====================================
|
||||
|
||||
To verify that Mistral OCR is working correctly in script, please refer to `https://docs.mistral.ai/capabilities/document/`
|
||||
|
||||
|
||||
### Conclusion
|
||||
|
||||
Integrating Mistral OCR with Open WebUI is a simple and effective way to enhance document processing and content extraction capabilities. By following the steps in this guide, you can set up Mistral OCR as the default extraction engine and leverage its advanced text recognition features. Once configured, Mistral OCR enables powerful, multilingual document parsing with support for various formats, enhancing AI-driven document analysis capabilities in Open WebUI.
|
@ -33,6 +33,8 @@ import { TopBanners } from "@site/src/components/TopBanners";
|
||||
|
||||
- 📚 **Local and Remote RAG Integration**: Dive into the future of chat interactions and explore your documents with our cutting-edge Retrieval Augmented Generation (RAG) technology within your chats. Documents can be loaded into the `Documents` tab of the Workspace, after which they can be accessed using the pound key [`#`] before a query, or by starting the prompt with the pound key [`#`], followed by a URL for webpage content integration.
|
||||
|
||||
- 📄 **Document Extraction**: Extract text and data from various document formats including PDFs, Word documents, Excel spreadsheets, PowerPoint presentations, and more. Our advanced document processing capabilities enable seamless integration with your knowledge base, allowing for accurate retrieval and generation of information from complex documents while preserving their structure and formatting.
|
||||
|
||||
- 🔍 **Web Search for RAG**: You can perform web searches using a selection of various search providers and inject the results directly into your local Retrieval Augmented Generation (RAG) experience.
|
||||
|
||||
- 🌐 **Web Browsing Capabilities**: Integrate websites seamlessly into your chat experience by using the `#` command followed by a URL. This feature enables the incorporation of web content directly into your conversations, thereby enhancing the richness and depth of your interactions.
|
||||
|
Loading…
Reference in New Issue
Block a user