A RetroSearch Logo

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Search Query:

Showing content from http://cloud.google.com/compute/docs/gpus/create-vm-with-gpus below:

Overview of creating an instance with attached GPUs | Compute Engine Documentation

Overview of creating an instance with attached GPUs

Stay organized with collections Save and categorize content based on your preferences.

Linux Windows

This document provides an overview of the steps required to create a Compute Engine instance with attached graphics processing units (GPUs). You can use GPUs to accelerate specific workloads, such as machine learning and data processing.

You can also use some GPU machine types on AI Hypercomputer. AI Hypercomputer is a supercomputing system that is optimized to support your artificial intelligence (AI) and machine learning (ML) workloads. This option is recommended for creating a densely allocated, performance-optimized infrastructure that has integrations for Google Kubernetes Engine (GKE) and Slurm schedulers.

For more information about GPUs on Compute Engine, see About GPUs.

Select the GPU model

For a list of GPU models that are available, see GPU platforms. Also make a note of the machine type that is supported for the selected GPU model.

For each model, it might also be helpful to review the following:

Limitations

In addition to the restrictions for all instances with GPUs, each machine series with attached GPUs has the following limitations:

A4 instances A3 Ultra instances A3 Mega instances A3 High instances A3 Edge instances A2 Standard instances A2 Ultra instances G4 instances G2 instances N1+GPU instances

To learn about the limitations for N1 instances with GPUs, see features for the N1 machine series and GPUs for the N1 machine series.

Choose an operating system

If you are using GPUs for machine learning, use one of the following operating systems:

Alternatively, you can use a public or custom image. For most public images or custom images, you need to install NVIDIA drivers and CUDA Toolkit. To help identify which drivers are appropriate for your GPU model, see installing GPU drivers.

Check GPU quota

To protect Compute Engine systems and users, new projects have a global GPU quota, which limits the total number of GPUs you can create in any supported zone. To review GPU quota, see GPU quota.

Note: Some regions might display quotas even though GPUs are not currently available in that region. Ensure that the region that you are requesting quotas for support GPUs. For a list of regions with GPUs, see GPUs regions and zone availability.

If you need additional GPU quota, request a quota increase. When you request GPU quota, you must request quota for the GPU types that you want to create in each region and an additional global quota for the total number of GPUs of all types in all zones.

If your project has an established billing history, it will receive quota automatically after you submit the request.

GPU instances and preemptible allocation quotas

Instances that use the standard provisioning model typically can't use preemptible allocation quotas. Preemptible quotas are for temporary workloads and are usually more available. If your project doesn't have preemptible quota, and you have never requested it, then all instances in your project consume standard allocation quotas.

If you request preemptible allocation quota, then instances that use the standard provisioning model must meet all of the following criteria to consume preemptible allocation quota:

When you consume preemptible allocation for time-bound GPU workloads, you can benefit from both uninterrupted run time and the high obtainability of preemptible allocation quota. For more information, see Preemptible quotas.

Create an instance that has attached GPUs

To create an instance that has attached GPUs, complete the following steps:

  1. Create the instance. The method used to create an instance depends on the GPU model selected.

  2. For the instance to use the GPU, you need to install the GPU driver on your instance. If you enabled an NVIDIA RTX virtual workstation (formerly known as NVIDIA GRID), install a driver for virtual workstation.

What's next?

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025-08-14 UTC.

[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-14 UTC."],[[["Compute Engine allows you to add GPUs to your VMs for accelerating workloads like machine learning and data processing."],["Choosing the right GPU model involves considering supported regions, pricing, and restrictions, and you can opt for a Deep Learning VM Image that comes with pre-installed GPU drivers."],["New projects have a global GPU quota, which can be reviewed and increased if needed, to limit the number of GPUs created in any supported zone."],["VMs with attached GPUs that are set to run for seven days or less and do not use reservations can consume preemptible allocation quota, offering a balance of uninterrupted run time and improved obtainability."],["Creating a VM with attached GPUs involves selecting the appropriate VM creation method based on the chosen GPU model and installing the necessary GPU driver for the operating system."]]],[]]


RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4