The NVIDIA Run:ai Administrator (runai-adm
) is a lightweight tool designed to support infrastructure administrators by simplifying two key tasks:
Collecting logs for troubleshooting and sharing with NVIDIA Run:ai support.
Configuring node roles in the cluster for optimal performance and reliability.
This section outlines the installation and usage of the NVIDIA Run:ai Administrator CLI to help you get started quickly.
Before installing the CLI, review the following:
Operating system: The CLI is supported on Mac and Linux.
Kubectl: The Kubernetes command-line interface must be installed and configured to access your cluster. Follow the official guide .
Cluster administrative permissions: The CLI requires a Kubernetes profile with administrative privileges.
To install the NVIDIA Run:ai Administrator CLI, ensure that the CLI version matches the version of your NVIDIA Run:ai cluster. You can either install the latest version or a specific version from the list .
Installing the Latest VersionUse the following commands to download and install the latest version of the CLI:
Macwget --content-disposition https://app.run.ai/v1/k8s/admin-cli/darwin
chmod +x runai-adm
sudo mv runai-adm /usr/local/bin/runai-adm
Linux
wget --content-disposition https://app.run.ai/v1/k8s/admin-cli/linux
chmod +x runai-adm
sudo mv runai-adm /usr/local/bin/runai-adm
Installing a Specific Version
To install a specific version of the Administrator CLI that matches your NVIDIA Run:ai cluster version, append the version number to the download URL. Refer to the list of available versions linked above for the correct version number.
Macwget --content-disposition https://app.run.ai/v1/k8s/admin-cli/<version>/darwin # Replace <version> with the desired version in the format vX.X.X (e.g., v2.19.5)
chmod +x runai-adm
sudo mv runai-adm /usr/local/bin/runai-adm
Linux
wget --content-disposition https://app.run.ai/v1/k8s/admin-cli/<version>/linux # Replace <version> with the desired version in the format vX.X.X (e.g., v2.19.5)
chmod +x runai-adm
sudo mv runai-adm /usr/local/bin/runai-adm
Verify your installation completed successfully by running the following command:
To set or remove node rules using the runai-adm
tool, run the following:
runai-adm set node-role [--runai-system-worker | --gpu-worker | --cpu-worker] <node-name>
runai-adm remove node-role [--runai-system-worker | --gpu-worker | --cpu-worker] <node-name>
Note
Use the --all
flag to set or remove a role to all nodes.
To collect logs using the runai-adm
tool:
Run the following command:
Locate the generated compressed log file.
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4