clearml-serving/clearml_serving/serving
2024-08-15 16:18:41 +03:00
..
__init__.py ClearML-Serving v2 initial working commit 2022-03-06 01:25:56 +02:00
Dockerfile Upgrade to python 3.11 2023-04-12 23:38:56 +03:00
endpoints.py Add Triton support for variable length requests, adds support for HuggingFace Transformers 2022-09-02 23:41:54 +03:00
entrypoint.sh add degug print 2024-08-14 19:47:58 +03:00
init.py Add clearml_serving_inference restart on CUDA OOM (#75) 2024-07-07 15:54:08 +03:00
main.py set status in exit 2024-08-15 16:00:40 +03:00
model_request_processor.py call gc on remove as well 2024-08-15 16:18:41 +03:00
preprocess_service.py call gc on remove as well 2024-08-15 16:18:41 +03:00
requirements.txt Fix requirements 2024-02-27 09:43:18 +02:00
uvicorn_mp_entrypoint.py Optimize async processing for increased speed 2022-10-08 02:12:04 +03:00