clearml/examples/datasets/dataset_creation.py
pollfly 23b0c500f1
Add Dataset examples (#485)
* Add data_management example

* split dataset_management into two - dataset creation and data ingerstion examples

* add underscore to project name

* delete space

* black format
2021-10-24 14:31:20 +03:00

22 lines
556 B
Python

# Download CIFAR dataset and create a dataset with ClearML's Dataset class
from clearml import StorageManager, Dataset
manager = StorageManager()
dataset_path = manager.get_local_copy(
remote_url="https://www.cs.toronto.edu/~kriz/cifar-10-python.tar.gz"
)
dataset = Dataset.create(
dataset_name="cifar_dataset", dataset_project="dataset_examples"
)
# Prepare and clean data here before it is added to the dataset
dataset.add_files(path=dataset_path)
# Dataset is uploaded to the ClearML Server by default
dataset.upload()
dataset.finalize()