mirror of
https://github.com/clearml/clearml-docs
synced 2025-02-25 05:24:39 +00:00
82 lines
4.1 KiB
Markdown
82 lines
4.1 KiB
Markdown
|
---
|
||
|
id: overview
|
||
|
title: What is ClearML?
|
||
|
slug: /
|
||
|
---
|
||
|
|
||
|
# ClearML Documentation
|
||
|
|
||
|
## Overview
|
||
|
Welcome to the documentation for ClearML, the end to end platform for streamlining AI development and deployment. ClearML consists of three essential layers:
|
||
|
1. [**Infrastructure Control Plane**](#infrastructure-control-plane) (Cloud/On-Prem Agnostic)
|
||
|
2. [**AI Development Center**](#ai-development-center)
|
||
|
3. [**GenAI App Engine**](#genai-app-engine)
|
||
|
|
||
|
Each layer provides distinct functionality to ensure an efficient and scalable AI workflow from development to deployment.
|
||
|
|
||
|
data:image/s3,"s3://crabby-images/aedc5/aedc5e9bb9b5ee7006daa572395705e5bc6bce4f" alt="Webapp gif"
|
||
|
data:image/s3,"s3://crabby-images/9fcea/9fcea62d5b70b92fa00b5d5aa23a110bd0d74d1f" alt="Webapp gif"
|
||
|
|
||
|
---
|
||
|
|
||
|
## Infrastructure Control Plane
|
||
|
The Infrastructure Control Plane serves as the foundation of the ClearML platform, offering compute resource provisioning and management, enabling administrators to make the compute available through GPUaaS capabilities and no-hassle configuration.
|
||
|
Utilizing the Infrastructure Control Plane, DevOps and IT teams can manage and optimize GPU resources to ensure high performance and cost efficiency.
|
||
|
|
||
|
#### Features
|
||
|
- **Resource Management:** Automates the allocation and management of GPU resources.
|
||
|
- **Workload Autoscaling:** Seamlessly scale GPU resources based on workload demands.
|
||
|
- **Monitoring and Logging:** Provides comprehensive monitoring and logging for GPU utilization and performance.
|
||
|
- **Cost Optimization:** Consolidate cloud and on-prem compute into a seamless GPUaaS offering
|
||
|
- **Deployment Flexibility:** Easily run your workloads on both cloud and on-premises compute.
|
||
|
|
||
|
data:image/s3,"s3://crabby-images/b8bf7/b8bf7bd10158af7fc21f535e5df01d6ad420d0a9" alt="Infrastructure control plane"
|
||
|
data:image/s3,"s3://crabby-images/a96be/a96be991991c8b58da33208cb8d497b75a767ee4" alt="Infrastructure control plane"
|
||
|
|
||
|
---
|
||
|
|
||
|
## AI Development Center
|
||
|
The AI Development Center offers a robust environment for developing, training, and testing AI models. It is designed to be cloud and on-premises agnostic, providing flexibility in deployment.
|
||
|
|
||
|
#### Features
|
||
|
- **Integrated Development Environment:** A comprehensive IDE for training, testing, and debugging AI models.
|
||
|
- **Model Training:** Scalable and distributed model training and hyperparameter optimization.
|
||
|
- **Data Management:** Tools for data preprocessing, management, and versioning.
|
||
|
- **Experiment Tracking:** Track metrics, artifacts and log. manage versions, and compare results.
|
||
|
- **Workflow Automation:** Build pipelines to formalize your workflow
|
||
|
|
||
|
data:image/s3,"s3://crabby-images/4f86b/4f86b68dd855ea4f7484d1f739ed0f4d6ebc1a57" alt="AI Dev center"
|
||
|
data:image/s3,"s3://crabby-images/41d5d/41d5d9cd5ba13d853eecac5b6d3d64e1d633128a" alt="AI Dev center"
|
||
|
|
||
|
---
|
||
|
|
||
|
## GenAI App Engine
|
||
|
The GenAI App Engine is designed to deploy large language models (LLM) into GPU clusters and manage various AI workloads, including Retrieval-Augmented Generation (RAG) tasks. This layer also handles networking, authentication, and role-based access control (RBAC) for deployed services.
|
||
|
|
||
|
#### Features
|
||
|
- **LLM Deployment:** Seamlessly deploy LLMs into GPU clusters.
|
||
|
- **RAG Workloads:** Efficiently manage and execute RAG workloads.
|
||
|
- **Networking and Authentication:** Deploy GenAI through secure, authenticated network endpoints
|
||
|
- **RBAC:** Implement RBAC to control access to deployed services.
|
||
|
|
||
|
data:image/s3,"s3://crabby-images/52421/5242113361f51c306170ac2f3d3bc58e3c924011" alt="GenAI engine"
|
||
|
data:image/s3,"s3://crabby-images/42a70/42a7012ef7f61f903e383901450871c289747c72" alt="GenAI engine"
|
||
|
|
||
|
---
|
||
|
|
||
|
## Getting Started
|
||
|
To begin using the ClearML, follow these steps:
|
||
|
1. **Set Up Infrastructure Control Plane:** Allocate and manage your GPU resources.
|
||
|
2. **Develop AI Models:** Use the AI Development Center to develop and train your models.
|
||
|
3. **Deploy AI Models:** Deploy your models using the GenAI App Engine.
|
||
|
|
||
|
For detailed instructions on each step, refer to the respective sections in this documentation.
|
||
|
|
||
|
---
|
||
|
|
||
|
## Support
|
||
|
For feature requests or bug reports, see ClearML on [GitHub](https://github.com/clearml/clearml/issues).
|
||
|
|
||
|
If you have any questions, join the discussion on the **ClearML** [Slack channel](https://joinslack.clear.ml), or tag your questions on [stackoverflow](https://stackoverflow.com/questions/tagged/clearml) with the **clearml** tag.
|
||
|
|
||
|
Lastly, you can always find us at [support@clearml.ai](mailto:support@clearml.ai?subject=ClearML).
|