|`--id`|Serving Service (Control plane) Task ID to configure (if not provided, automatically detect the running control plane Task) | <imgsrc="/docs/latest/icons/ico-optional-no.svg"alt="No"className="icon size-md center-md"/> |
|`--tags` |Serving service's user tags. The serving service can be labeled, which can be useful for organizing | <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--endpoint`|Metric endpoint name including version (e.g. `"model/1"` or a prefix `"model/*"`). Notice: it will override any previous endpoint logged metrics| <imgsrc="/docs/latest/icons/ico-optional-no.svg"alt="No"className="icon size-md center-md"/>|
|`--log-freq`|Logging request frequency, between 0.0 to 1.0. Example: 1.0 means all requests are logged, 0.5 means half of the requests are logged if not specified. To use global logging frequency, see [`config --metric-log-freq`](#config)| <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--variable-scalar`|Add float (scalar) argument to the metric logger, `<name>=<histogram>`. Example: with specific buckets: `"x1=0,0.2,0.4,0.6,0.8,1"` or with min/max/num_buckets `"x1=0.0/1.0/5"`. Notice: In cases where 1000s of requests per second reach the serving, it makes no sense to display every datapoint. So scalars can be divided in buckets, and for each minute for example. Then it's possible to calculate what % of the total traffic fell in bucket 1, bucket 2, bucket 3 etc. The Y axis represents the buckets, color is the value in % of traffic in that bucket, and X is time. | <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--endpoint`| Metric endpoint name including version (e.g. `"model/1"` or a prefix `"model/*"`) |<imgsrc="/docs/latest/icons/ico-optional-no.svg"alt="No"className="icon size-md center-md"/>|
|`--variable`| Remove (scalar/enum) argument from the metric logger, `<name>` example: `"x1"` |<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--base-serving-url`|External base serving service url. Example: `http://127.0.0.1:8080/serve`|<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--metric-log-freq`|Set default metric logging frequency between 0.0 to 1.0. 1.0 means that 100% of all requests are logged|<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--name`|Specifying the model name to be registered in| <imgsrc="/docs/latest/icons/ico-optional-no.svg"alt="No"className="icon size-md center-md"/>|
|`--tags`| Add tags to the newly created model| <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--project`| Specify the project for the model to be registered in| <imgsrc="/docs/latest/icons/ico-optional-no.svg"alt="No"className="icon size-md center-md"/>|
|`--publish`| Publish the newly created model (change model state to "published" (i.e. locked and ready to deploy)|<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--path`|Specify a model file/folder to be uploaded and registered| <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--url`| Specify an already uploaded model url (e.g. `s3://bucket/model.bin`, `gs://bucket/model.bin`)|<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--destination`|Specify the target destination for the model to be uploaded (e.g. `s3://bucket/folder/`, `gs://bucket/folder/`)|<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
</div>
### canary
Add model Canary/A/B endpoint.
```bash
clearml-serving model canary [-h] [--endpoint ENDPOINT] [--weights WEIGHTS [WEIGHTS ...]]
|`--endpoint`| Model canary serving endpoint name (e.g. `my_model/latest`)| <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--weights`| Model canary weights (order matching model ep), (e.g. 0.2 0.8) |<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--input-endpoints`|Model endpoint prefixes, can also include version (e.g. `my_model`, `my_model/v1`)| <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--input-endpoint-prefix`| Model endpoint prefix, lexicographic order or by version `<int>` (e.g. `my_model/1`, `my_model/v1`), where the first weight matches the last version.|<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
</div>
### auto-update
Add/Modify model auto-update service.
```bash
clearml-serving model auto-update [-h] [--endpoint ENDPOINT] --engine ENGINE
|`--max-versions`|Max versions to store (and create endpoints) for the model. Highest number is the latest version | <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--name`| Specify model name to be selected and auto-updated (notice regexp selection use `"$name^"` for exact match) | <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--tags`|Specify tags to be selected and auto-updated |<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--project`|Specify model project to be selected and auto-updated | <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--published`| Only select published model for auto-update |<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--preprocess` |Specify Pre/Post processing code to be used with the model (point to local file / folder) - this should hold for all the models |<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--input-size`| Specify the model matrix input size [Rows x Columns X Channels etc ...] | <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--input-type`| Specify the model matrix input type. Examples: uint8, float32, int16, float16 etc. |<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--input-name`|Specify the model layer pushing input into. Example: layer_0 | <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--output-size`|Specify the model matrix output size [Rows x Columns X Channels etc ...]|<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--output_type`| Specify the model matrix output type. Examples: uint8, float32, int16, float16 etc. | <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--output-name`|Specify the model layer pulling results from. Examples: layer_99| <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--aux-config`| Specify additional engine specific auxiliary configuration in the form of key=value. Example: `platform=onnxruntime_onnx response_cache.enable=true max_batch_size=8`. Notice: you can also pass a full configuration file (e.g. Triton "config.pbtxt")|<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
</div>
### add
Add/Update model.
```bash
clearml-serving model add [-h] --engine ENGINE --endpoint ENDPOINT [--version VERSION]
|`--preprocess` |Specify Pre/Post processing code to be used with the model (point to local file / folder) - this should hold for all the models |<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--input-size`| Specify the model matrix input size [Rows x Columns X Channels etc ...] | <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--input-type`| Specify the model matrix input type. Examples: uint8, float32, int16, float16 etc. |<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--input-name`|Specify the model layer pushing input into. Example: layer_0 | <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--output-size`|Specify the model matrix output size [Rows x Columns X Channels etc ...]|<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--output_type`| Specify the model matrix output type. Examples: uint8, float32, int16, float16 etc. | <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--output-name`|Specify the model layer pulling results from. Examples: layer_99| <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--aux-config`| Specify additional engine specific auxiliary configuration in the form of key=value. Example: `platform=onnxruntime_onnx response_cache.enable=true max_batch_size=8`. Notice: you can also pass a full configuration file (e.g. Triton "config.pbtxt")|<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--name`| Instead of specifying `--model-id` select based on model name | <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--tags`|Specify tags to be selected and auto-updated |<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--project`|Instead of specifying `--model-id` select based on model project | <imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|
|`--published`| Instead of specifying `--model-id` select based on model published |<imgsrc="/docs/latest/icons/ico-optional-yes.svg"alt="Yes"className="icon size-md center-md"/>|