Stay organized with collections Save and categorize content based on your preferences.
You can choose the amount of memory to provide for your Cloud Run service. This page describes how to specify the amount of memory available for your service.
Understand memory usageCloud Run instances that exceed their allowed memory limit are terminated.
The available memory for your instance needs to be sufficient for:
The size of the deployed container image does not affect memory that is available for the instance.
Set and update memory limitsYou can set memory limits on Cloud Run services. By default, the memory allocated to each instance of a revision is 512 MiB, and the memory allocated for a function is 256 MiB.
Required minimum CPUsThe amount of allocated memory you choose corresponds to an amount of minimum CPU for your service. When setting a memory limit, the following minimum CPU limits are required:
Memory Minimum CPUs required 128 MiB .083 vCPU 256 MiB .167 vCPU 512 MiB .333 vCPU 1 GiB .583 vCPU 2 GiB 1 vCPU More than 4 GiB 2 vCPU More than 8 GiB 4 vCPU More than 16 GiB 6 vCPU More than 24 GiB 8 vCPUIf you use less than 512 MiB of memory, you should use the
first generation execution environment.
Important: If your single-threaded application's memory needs require you to select a configuration with 2 or more vCPUs, be aware that this can lead to potential "vCPU hotspots". resulting in inefficient CPU-based autoscaling. In such scenarios, carefully tuning concurrency settings becomes essential for managing load and achieving desired scaling behavior. Maximum amount of memoryThe maximum amount of memory you can configure is 32 gibibyte (32 Gi
).
You can enforce a maximum memory limit by using
custom organization policies.
Minimum memoryThe minimum memory setting varies depending on whether you use first generation or second generation execution environment:
To get the permissions that you need to configure and deploy Cloud Run services, ask your administrator to grant you the following IAM roles:
roles/run.developer
) on the Cloud Run serviceroles/iam.serviceAccountUser
) on the service identityFor a list of IAM roles and permissions that are associated with Cloud Run, see Cloud Run IAM roles and Cloud Run IAM permissions. If your Cloud Run service interfaces with Google Cloud APIs, such as Cloud Client Libraries, see the service identity configuration guide. For more information about granting roles, see deployment permissions and manage access.
Configure memory limitsAny configuration change leads to the creation of a new revision. Subsequent revisions will also automatically get this configuration setting unless you make explicit updates to change it.
For Cloud Run services, you can set memory limits using the Google Cloud console, the gcloud command line, or a YAML file when you create a new service or deploy a new revision:
ConsoleIn the Google Cloud console, go to Cloud Run:
Select Services from the menu, and click Deploy container to configure a new service. If you are configuring an existing service, click the service, then click Edit and deploy new revision.
If you are configuring a new service, fill out the initial service settings page, then click Container(s), Volumes, Networking, Security to expand the service configuration page.
Click the Container tab.
Click Create or Deploy.
gcloud run deploy
, and have the required roles granted to you.
You can update the memory allocation of a given service by using the following command:
gcloud run services update SERVICE --memory SIZE
Replace SERVICE with the name of your service and SIZE with the desired memory size. The format for size is a fixed or floating point number followed by a unit: G
or M
corresponding to gigabyte or megabyte, respectively, or use the power-of-two equivalents: Gi
or Mi
corresponding to gibibyte or mebibyte respectively.
You can also set memory limits during deployment using the command:
gcloud run deploy --image IMAGE_URL --memory SIZE
Replace the following:
us-docker.pkg.dev/cloudrun/container/hello:latest
. If you use Artifact Registry, the repository REPO_NAME must already be created. The URL follows the format of LOCATION-docker.pkg.dev/PROJECT_ID/REPO_NAME/PATH:TAG
.If you are creating a new service, skip this step. If you are updating an existing service, download its YAML configuration:
gcloud run services describe SERVICE --format export > service.yaml
Update the memory
attribute:
apiVersion: serving.knative.dev/v1 kind: Service metadata: name: SERVICE spec: template: metadata: name: REVISION spec: containers: - image: IMAGE resources: limits: memory: SIZE
Replace the following:
us-docker.pkg.dev/cloudrun/container/hello:latest
. If you use Artifact Registry, the repository REPO_NAME must already be created. The URL follows the format of LOCATION-docker.pkg.dev/PROJECT_ID/REPO_NAME/PATH:TAG
.G
or M
corresponding to gigabyte or megabyte, respectively, or use the power-of-two equivalents: Gi
or Mi
corresponding to gibibyte or mebibyte respectively.SERVICE-
-
-
Create or update the service using the following command:
gcloud run services replace service.yaml
To learn how to apply or remove a Terraform configuration, see Basic Terraform commands.
Add the following to agoogle_cloud_run_v2_service
resource in your Terraform configuration:
Under template.containers.resources.limits
, replace 512Mi
with your service's desired memory limit.
For a Cloud Run service, you can determine the peak memory requirement for a service using the following: (Standing Memory) + (Memory per Request) * (Service Concurrency)
Accordingly,
If you raise the concurrency of your service, you should also increase the memory limit to account for peak usage.
If you lower the concurrency of your service, consider reducing the memory limit to save on memory usage costs.
For more guidance on minimizing per request memory usage read Development Tips on Global Variables.
View memory limit settingsTo view the current memory limit settings for your Cloud Run service:
ConsoleIn the Google Cloud console, go to Cloud Run:
Click the service you are interested in to open the Service details page.
Click the Revisions tab.
In the details panel at the right, the memory limit setting is listed under the Container tab.
Use the following command:
gcloud run services describe SERVICE
Locate the memory limit setting in the returned configuration.
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-08-07 UTC.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-07 UTC."],[],[]]
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4