clearml/examples/hyperdatasets/README.md
allegroai 47ea919ddc PEP8
2022-12-13 16:07:51 +02:00

19 lines
1.2 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

# ClearML HyperDatasets #
Hyper-Datasets is a data management system thats designed for unstructured data such as text, audio, or visual data. It is part of the ClearML enterprise offering, which means it includes quite a few upgrades over the open source clearml-data.
The main conceptual difference between the two is that Hyper-Datasets decouples the metadata from the raw data files. This allows you to manipulate the metadata in all kinds of ways, while abstracting away the logistics of having to deal with large amounts of data.
To leverage Hyper-Datasets power, users define Dataviews which are sophisticated queries connecting specific data from one or more datasets to an experiment in the Experiment Manager. Essentially it creates and manages local views of remote Datasets.
![Dataview in the UI](../../docs/screenshots/hpd.png)
## Examples Overview ##
- Hyperdataset registration into ClearML Enterprise
- Hypderdataset usage exmaples, retrieving frames using the Dataview Class and connecting to pytorch dataloader
## Further Resources ##
Learn about ClearML [HyperDataset](https://clear.ml/docs/latest/docs/hyperdatasets/overview)
Watch video [Tutorial](https://www.youtube.com/watch?v=1VliYRexeLU)