clearml-serving/clearml_serving/engines/triton
allegroai f4eed33f10 Add Triton support for variable length requests, adds support for HuggingFace Transformers
Add triton_grpc_compression=False (default) for grpc connection compression control
2022-09-02 23:41:54 +03:00
..
__init__.py ClearML-Serving v2 initial working commit 2022-03-06 01:25:56 +02:00
Dockerfile Update base triton container to v22.04 (Nvidia driver version 510+) 2022-06-05 16:15:03 +03:00
entrypoint.sh Add CLEARML_EXTRA_PYTHON_PACKAGES for additional runtime package installaiton 2022-06-05 16:15:34 +03:00
requirements.txt Add pandas to the default serving container, update triton client package 2022-06-05 16:12:22 +03:00
triton_helper.py Add Triton support for variable length requests, adds support for HuggingFace Transformers 2022-09-02 23:41:54 +03:00