A RetroSearch Logo

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Search Query:

Showing content from https://run-ai-docs.nvidia.com/self-hosted/workloads-in-nvidia-run-ai/assets/compute-resources below:

Compute Resources | Run:ai Documentation

Compute Resources | Run:ai Documentation
  1. Workloads in NVIDIA Run:ai
  2. Workload Assets
Compute Resources

This article explains what compute resources are and how to create and use them.

Compute resources are one type of workload assets. A compute resource is a template that simplifies how workloads are submitted and can be used by AI practitioners when they submit their workloads.

A compute resource asset is a preconfigured building block that encapsulates all the specifications of compute requirements for the workload including:

The Compute resource table can be found under Workload manager in the NVIDIA Run:ai UI.

The Compute resource table provides a list of all the compute resources defined in the platform and allows you to manage them.

The Compute resource table consists of the following columns:

The name of the compute resource

A description of the essence of the compute resource

GPU devices request per pod

The number of requested physical devices per pod of the workload that uses this compute resource

GPU memory request per device

The amount of GPU memory per requested device that is granted to each pod of the workload that uses this compute resource

The minimum amount of CPU memory per pod of the workload that uses this compute resource

The maximum amount of CPU memory per pod of the workload that uses this compute resource

The minimum number of CPU cores per pod of the workload that uses this compute resource

The maximum number of CPU cores per pod of the workload that uses this compute resource

The scope of this compute resource within the organizational tree. Click the name of the scope to view the organizational tree diagram

The list of workloads associated with the compute resource

The list of workload templates that use this compute resource

The name of the user who created the compute resource

The timestamp of when the compute resource was created

The timestamp of when the compute resource was last updated

The cluster that the compute resource is associated with

Workloads Associated with the Compute Resource

Click one of the values in the Workload(s) column to view the list of workloads and their parameters.

The workload that uses the compute resource

Workspace/Training/Inference

Customizing the Table View Adding a New Compute Resource

To add a new compute resource:

  1. Go to the Compute resource table

  2. Click +NEW COMPUTE RESOURCE

  3. Select under which cluster to create the compute resource

  4. Enter a name for the compute resource. The name must be unique.

  5. Optional: Provide a description of the essence of the compute resource

  6. Set the resource types needed within a single node. The NVIDIA Run:ai Scheduler tries to match a single node that complies with the compute resource for each of the workload’s pods.

  7. Optional: More settings

  8. Click CREATE COMPUTE RESOURCE

Editing a Compute Resource

To edit a compute resource:

  1. Select the compute resource you want to edit

  2. Update the compute resource and click SAVE COMPUTE RESOURCE

Note

The already bound workload that is using this asset will not be affected.

Copying a Compute Resource

To copy an existing compute resource:

  1. Select the compute resource you want to copy

  2. Enter a name for the compute resource. The name must be unique.

  3. Update the compute resource and click CREATE COMPUTE RESOURCE

Deleting a Compute Resource
  1. Select the compute resource you want to delete

  2. On the dialog, click DELETE to confirm

Note

The already bound workload that is using this asset will not be affected.

Go to the Compute resources API reference to view the available actions


RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4