clearml-serving/clearml_serving/serving
allegroai f4eed33f10 Add Triton support for variable length requests, adds support for HuggingFace Transformers
Add triton_grpc_compression=False (default) for grpc connection compression control
2022-09-02 23:41:54 +03:00
..
__init__.py ClearML-Serving v2 initial working commit 2022-03-06 01:25:56 +02:00
Dockerfile ClearML-Serving v2 initial working commit 2022-03-06 01:25:56 +02:00
endpoints.py Add Triton support for variable length requests, adds support for HuggingFace Transformers 2022-09-02 23:41:54 +03:00
entrypoint.sh Change default log level to warning UVICORN_LOG_LEVEL 2022-06-07 00:19:51 +03:00
main.py Optimize request serving statistics reporting 2022-06-07 00:20:33 +03:00
model_request_processor.py Add Triton support for variable length requests, adds support for HuggingFace Transformers 2022-09-02 23:41:54 +03:00
preprocess_service.py Add Triton support for variable length requests, adds support for HuggingFace Transformers 2022-09-02 23:41:54 +03:00
requirements.txt Add pandas to the default serving container, update triton client package 2022-06-05 16:12:22 +03:00