* Add hyperdataset examples Co-authored-by: Erez Schnaider <erez@clear.ml>
1.2 KiB
ClearML HypderDatasets
Hyper-Datasets is a data management system that’s designed for unstructured data such as text, audio, or visual data. It is part of the ClearML enterprise offering, which means it includes quite a few upgrades over the open source clearml-data.
The main conceptual difference between the two is that Hyper-Datasets decouples the metadata from the raw data files. This allows you to manipulate the metadata in all kinds of ways, while abstracting away the logistics of having to deal with large amounts of data.
To leverage Hyper-Datasets power, users define Dataviews which are sophisticated queries connecting specific data from one or more datasets to an experiment in the Experiment Manager. Essentially it creates and manages local views of remote Datasets.
Examples Overview
- Hyperdataset registration into ClearML Enterprise
- Hypderdataset usage exmaples, retrieving frames using the Dataview Class and connecting to pytorch dataloader
Further Resources
Learn about ClearML HyperDataset
Watch video Tutorial