clearml-docs/docs/webapp/webapp_model_endpoints.md
2025-01-19 09:50:47 +02:00

90 lines
4.1 KiB
Markdown

---
title: Model Endpoints
---
The Model Endpoints table lists all currently live (active, and being brought up) model endpoints, allowing you to view
endpoint details and monitor status over time. Whenever you deploy a model through the [ClearML Deploy UI applications](applications/apps_overview.md#deploy),
it will be listed in the table.
View the model endpoints in table view <img src="/docs/latest/icons/ico-table-view.svg" alt="Table view" className="icon size-md space-sm" />
or in details view <img src="/docs/latest/icons/ico-split-view.svg" alt="Details view" className="icon size-md space-sm" />
using the buttons on the top left of the page. Use the table view for a comparative view of your endpoints according to
columns of interest. Use the details view to access a selected endpoint's details. Details view can also be accessed by
double-clicking a specific endpoint in the table view to open its details view.
Download the model endpoint table as a CSV file by clicking **Download** <img src="/docs/latest/icons/ico-download.svg" alt="Download" className="icon size-md space-sm" />.
## Endpoint Tables
Active Endpoints are displayed in the **Active** tab. The table provides the following information:
* Endpoint - Endpoint name
* Model - Model Name
* URL - Endpoint URL
* \# Instances - Number of model instances
* Uptime - Longest duration that any of the model instances has been running
* \# Requests - Total number of requests to the endpoint
* Requests / MIN (avg) - Average request rate in the last minute
* Latency (avg) - Average endpoint response latency
![Active endpoints](../img/webapp_model_endpoints_active_table.png)
The **Loading** tab shows endpoints that are being set up, but are not yet active. The table provides the following
information:
* Instance ID - The model deployment application instance ID
* Model - Model Name
* Uptime - Time since this endpoint has started setting up
* Preprocess artifact - Preprocessing code used for the endpoint
* Input type - Model matrix input type (e.g. uint8, float32, int16, float16)
* Input size - Model matrix input size
![Loading endpoints](../img/webapp_model_endpoints_loading_table.png)
You can apply column filters by clicking <img src="/docs/latest/icons/ico-filter-off.svg" alt="Filter" className="icon size-md" />,
and sort endpoints by clicking <img src="/docs/latest/icons/ico-sort-off.svg" alt="Sort order" className="icon size-md space-sm" /> on the relevant column.
## Active Endpoint Details
Clicking on a model endpoint opens it in detailed view.
### Details
The **Details** tab displays the model endpoint information:
* Endpoint Name
* Endpoint URL
* Model name (click to go to ClearML or HuggingFace model page)
* Uptime - Duration of longest running endpoint instance
* Preprocess artifact - Preprocessing code used for the endpoint
* Input type - Model matrix input type (e.g. uint8, float32, int16, float16)
* Input size - Model matrix input size
* Model instances list - Model instances servicing the endpoint. The following information is displayed for each instance:
* Instance ID - Click instance ID to go to the model endpoint's ClearML Application instance dashboard .
* Uptime
* Number of requests - Total count
* Request rate - Average requests per minute
* CPU Count
* GPU Count
* Latency - Average request latency in the last minute
![Endpoints details](../img/webapp_model_endpoints_details.png)
### Monitor
The Monitor tab displays the endpoint's operational metrics and resource usage over time graphs:
* Total number of requests
* Average requests/min
* Average latency
* Machine utilization metrics:
* CPU and GPU Usage
* Memory Usage
* Video Memory Usage
* Network Usage
![Endpoints monitor](../img/webapp_model_endpoints_monitor.png)
The graphs' time span can be controlled through the menu at its top right corner.
Click <img src="/docs/latest/icons/ico-show.svg" alt="Eye Show" className="icon size-md space-sm" /> to control which
plots to display.
For example, to display specific plots, click **HIDE ALL**, and then click <img src="/docs/latest/icons/ico-show.svg" alt="Eye Show" className="icon size-md space-sm" />
on each plot you want to view.