Merge pull request #476 from vox-hunter/main

Update docling.md
This commit is contained in:
Timothy Jaeryang Baek 2025-04-06 21:46:45 -07:00 committed by GitHub
commit 7d33980ee9
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -9,7 +9,7 @@ This tutorial is a community contribution and is not supported by the Open WebUI
## 🐤 Docling Document Extraction
This documentation provides a step-by-step guide to integrating Apache Tika with Open WebUI. Docling is a document processing library designed to transform a wide range of file formats—including PDFs, Word documents, spreadsheets, HTML, and images—into structured data such as JSON or Markdown. With built-in support for layout detection, table parsing, and language-aware processing, Docling streamlines document preparation for AI applications like search, summarization, and retrieval-augmented generation, all through a unified and extensible interface.
This documentation provides a step-by-step guide to integrating Docling with Open WebUI. Docling is a document processing library designed to transform a wide range of file formats—including PDFs, Word documents, spreadsheets, HTML, and images—into structured data such as JSON or Markdown. With built-in support for layout detection, table parsing, and language-aware processing, Docling streamlines document preparation for AI applications like search, summarization, and retrieval-augmented generation, all through a unified and extensible interface.
Prerequisites
------------