Commit Graph

89 Commits

Author SHA1 Message Date
David Tippett
fb30b667e2 First pass at an OpenSearch connector 2024-10-30 15:51:56 -04:00
Timothy Jaeryang Baek
adfce28732
Merge pull request #6543 from execgit/main
Some checks are pending
Deploy to HuggingFace Spaces / check-secret (push) Waiting to run
Deploy to HuggingFace Spaces / deploy (push) Blocked by required conditions
Create and publish Docker images with specific build args / build-main-image (linux/amd64) (push) Waiting to run
Create and publish Docker images with specific build args / build-main-image (linux/arm64) (push) Waiting to run
Create and publish Docker images with specific build args / build-cuda-image (linux/amd64) (push) Waiting to run
Create and publish Docker images with specific build args / build-cuda-image (linux/arm64) (push) Waiting to run
Create and publish Docker images with specific build args / build-ollama-image (linux/amd64) (push) Waiting to run
Create and publish Docker images with specific build args / build-ollama-image (linux/arm64) (push) Waiting to run
Create and publish Docker images with specific build args / merge-main-images (push) Blocked by required conditions
Create and publish Docker images with specific build args / merge-cuda-images (push) Blocked by required conditions
Create and publish Docker images with specific build args / merge-ollama-images (push) Blocked by required conditions
Python CI / Format Backend (3.11) (push) Waiting to run
Frontend Build / Format & Build Frontend (push) Waiting to run
Frontend Build / Frontend Unit Tests (push) Waiting to run
Integration Test / Run Cypress Integration Tests (push) Waiting to run
Integration Test / Run Migration Tests (push) Waiting to run
fix: omit document contents in logging document embedding
2024-10-29 19:17:53 -07:00
execgit
03e9add96d retrieval.utils: omit logging file contents at level INFO 2024-10-29 14:33:37 +02:00
execgit
bc7622c0fe Avoid logging file contents at level INFO
I had problems with document handling in rootless containers. Long documents caused the container to hang. Reducing the verbosity of logging from retrieval.main seemed to fix the issues I was experiencing.
2024-10-29 14:31:47 +02:00
Timothy Jaeryang Baek
b32fbf7c58
Merge pull request #6504 from jeeteshchel/fork/dev
feat: enable bing support
2024-10-28 12:30:11 -07:00
Jeetesh Chellani
f7d8a6ccba feat: enable bing support 2024-10-28 16:51:31 +02:00
Timothy J. Baek
491cff8c63 refac: parse md files with TextLoader 2024-10-28 05:50:56 -07:00
Diwakar Singh Maurya
3d0c32f366 feature to support authentication in chromadb 2024-10-27 14:01:00 +07:00
Timothy J. Baek
f2c78ac0fb refac 2024-10-25 22:23:21 -07:00
Timothy J. Baek
47e4250f58 refac 2024-10-25 22:12:46 -07:00
Timothy J. Baek
076c54c486 refac 2024-10-25 21:47:47 -07:00
Timothy J. Baek
780591e991 refac 2024-10-25 21:46:14 -07:00
Timothy J. Baek
50dcad0f73 fix: tiktoken encoding model issue 2024-10-25 21:38:28 -07:00
Timothy J. Baek
e43e91edd3 refac 2024-10-20 23:45:15 -07:00
Timothy J. Baek
9936583477 chore: format 2024-10-20 18:38:06 -07:00
Timothy J. Baek
4d46bfe03b refac: file table migration 2024-10-20 17:45:37 -07:00
Timothy Jaeryang Baek
c9c79852a5
Merge branch 'dev' into dev 2024-10-17 13:25:39 -07:00
Timothy J. Baek
6336d34b59 fix: web attachment issue 2024-10-17 13:08:10 -07:00
Timothy J. Baek
8eb45acf10 refac 2024-10-14 18:47:41 -07:00
Jannik Streidl
f0f4de59eb Merge branch 'upstream-dev' into dev 2024-10-14 09:50:40 +02:00
Timothy J. Baek
586e005f0f enh: token text splitter support 2024-10-13 04:24:13 -07:00
Timothy J. Baek
8a0da6d376 enh: include file name to context 2024-10-13 03:58:51 -07:00
Timothy J. Baek
797afd0b72 fix: embedding metadata issue 2024-10-13 03:25:11 -07:00
Timothy J. Baek
92605fd59f refac 2024-10-13 03:16:18 -07:00
Timothy J. Baek
5ffd216fca refac 2024-10-13 03:02:02 -07:00
Timothy J. Baek
dff3732fcd enh: tiktoken/token splitter support 2024-10-13 02:07:50 -07:00
Timothy J. Baek
5273dc4535 refac 2024-10-13 00:21:06 -07:00
Timothy J. Baek
333317a7ce refac: add embedding metadata 2024-10-12 18:30:21 -07:00
Jannik Streidl
f47c9c69e3 Merge branch 'upstream-dev' into dev 2024-10-12 15:18:59 +02:00
Jannik Streidl
741230bcdb fix 2024-10-10 17:20:50 +02:00
Jannik Streidl
89c77f05a8 chromadb switch to cosine similarity 2024-10-10 16:46:14 +02:00
Timothy Jaeryang Baek
d7a71f3b34
Merge pull request #6050 from RobinBially/main
feat: add qdrant vector db connector
2024-10-09 12:12:48 -07:00
Timothy J. Baek
c157004e07 refac 2024-10-09 12:05:16 -07:00
Timothy J. Baek
451f1bae15 refac: embeddings function 2024-10-09 11:41:35 -07:00
Robin Bially
34150fc3ed add NO_LIMIT var 2024-10-09 18:34:04 +02:00
Robin Bially
54dc94317c improvements 2024-10-09 18:29:14 +02:00
Robin Bially
2c59f2dcaf fix critical bug 2024-10-09 13:23:04 +02:00
Robin Bially
b56f77ed47 improvements 2024-10-09 13:10:23 +02:00
Robin Bially
878a570a2c add qdrant as vector db 2024-10-09 12:51:43 +02:00
Timothy J. Baek
3391a855f0 fix 2024-10-08 17:03:42 -07:00
Timothy J. Baek
dcab991d44 fix: model dump issue 2024-10-08 11:05:57 -07:00
Peter De-Ath
885b9f1ece refactor: Update GenerateEmbeddingsForm to support batch processing
refactor: Update embedding batch size handling in RAG configuration

refactor: add query_doc query caching

refactor: update logging statements in generate_chat_completion function

change embedding_batch_size to Optional
2024-10-08 00:04:35 +01:00
Timothy J. Baek
48e7f47558 refac 2024-10-07 14:03:42 -07:00
Jannik Streidl
9fc813cfa6 fix: only append if distances are available 2024-10-07 21:13:13 +02:00
Jannik Streidl
b105efa05f enh: append citations with distance scores 2024-10-07 21:11:04 +02:00
Timothy J. Baek
ee22ba9676 fix: web, youtube attachment issue 2024-10-06 19:44:02 -07:00
Timothy J. Baek
05c15b017d fix: milvus 2024-10-06 17:58:09 -07:00
Timothy J. Baek
378223aedb refac: comments 2024-10-05 10:08:48 -07:00
Timothy J. Baek
61b147441c refac 2024-10-05 10:05:12 -07:00
Timothy J. Baek
1f9b5b6456 refac: retain metadata for collection 2024-10-05 09:58:46 -07:00