7.2 KiB
| title |
|---|
| Masks |
When applicable, sources contains masks, a list of dictionaries used to connect a special type of
source data to the ClearML Enterprise platform. That source data is a mask.
Masks are used in deep learning for semantic segmentation.
Masks correspond to raw data where the objects to be detected are marked with colors in the masks. The colors are RGB values and represent the objects, which are labeled for segmentation.
In frames used for semantic segmentation, the metadata connecting the mask files / images to the ClearML Enterprise platform, and the RGB values and labels used for segmentation are separate. They are contained in two different dictionaries of a SingleFrame:
-
masks(plural) is insourcesand contains the mask files / imagesURI(in addition to other keys and values). -
mask(singular) is in theroisarray of a Frame.Each
roisdictionary contains:-
RGB values and labels of a mask (in addition to other keys and values)
-
Metadata and data for the labeled area of an image
-
See Example 1, which shows masks in sources, mask in rois, and the key-value pairs used to relate
a mask to its source in a frame.
Masks Structure
The chart below explains the keys and values of the masks dictionary (in the sources
section of a Frame).
| Key | Value Description |
|---|---|
id |
Type: integer.
|
content_type |
Type: string.
|
timestamp |
Type: integer.
|
uri |
Type: string.
|
Examples
Example 1
This example demonstrates an original image, its masks, and its frame containing
the sources and ROI metadata.
Example 1: View the frame
This frame contains the masks list of dictionaries in sources,
and the rois array, as well as several top-level key-value pairs.
{
"timestamp": 1234567889,
"context_id": "car_1",
"meta": {
"velocity": "60"
},
"sources": [
{
"id": "front",
"content_type": "video/mp4",
"width": 800,
"height": 600,
"uri": "https://s3.amazonaws.com/my_cars/car_1/front.mp4",
"timestamp": 1234567889,
"meta" :{
"angle":45,
"fov":129
},
"masks": [
{
"id": "seg",
"content_type": "video/mp4",
"uri": "https://s3.amazonaws.com/seg_masks/car_1/front_seg.mp4",
"timestamp": 123456789
},
{
"id": "seg_instance",
"content_type": "video/mp4",
"uri": "https://s3.amazonaws.com/seg_masks/car_1/front_instance_seg.mp4",
"timestamp": 123456789
}
]
}
],
"rois": [
{
"sources":["front"],
"label": ["seg"],
"mask": {
"id": "car",
"value": [210,210,120]
}
},
{
"sources":["front"],
"label": ["seg"],
"mask": {
"id": "person",
"value": [147,44,209]
}
},
{
"sources":["front"],
"label": ["seg"],
"mask": {
"id": "road",
"value": [197,135,146]
}
},
{
"sources":["front"],
"label": ["seg"],
"mask": {
"id": "street",
"value": [135,198,145]
}
},
{
"sources":["front"],
"label": ["seg"],
"mask": {
"id": "building",
"value": [72,191,65]
}
}
]
}
- In
sources:- The source ID is
front. - In the
masksdictionary, the source contains mask sources with IDs ofsegandseg_instance.
- The source ID is
- In
rois:- Each ROI source is
front, relating the ROI to its original source image. - Each ROI has a label of
seg, indicating segmentation. - Each
maskhas anid(car,person,road,street, andbuilding) and a unique RGBvalue(color-coding).
- Each ROI source is
Example 2
This example shows two masks for video from a camera. The masks label cars and the road.
Example 2: View the frame
"sources": [
{
"id": "front",
"content_type": "video/mp4",
"width": 800,
"height": 600,
"uri": "https://s3.amazonaws.com/my_cars/car_1/front.mp4",
"timestamp": 1234567889,
"meta" :{
"angle":45,
"fov":129
},
"masks": [
{
"id": "car",
"content_type": "video/mp4",
"uri": "https://s3.amazonaws.com/seg_masks/car_1/front_seg.mp4",
"timestamp": 123456789
},
{
"id": "road",
"content_type": "video/mp4",
"uri": "https://s3.amazonaws.com/seg_masks/car_1/front_instance_seg.mp4",
"timestamp": 123456789
}
]
}
],
"rois": [
{
"sources":["front"],
"label": ["right_lane"],
"mask": {
"id": "car",
"value": [210,210,120]
}
},
{
"sources":["front"],
"label": ["right_lane"],
"mask": {
"id": "road",
"value": [197,135,146]
}
}
- In
sources:- The source ID is
front. - The source contains mask sources with IDs of
carandroad.
- The source ID is
- In
rois:- Each ROI source is
frontrelating the ROI to its original source image. - Each ROI has a label of
right_laneindicating the ROI object. - Each
maskhas anid(car,person) and a unique RGBvalue(color-coding).
- Each ROI source is

