Stay organized with collections Save and categorize content based on your preferences.
Supervised fine-tuning helps you adapt a Gemini model to your specific needs. By providing a labeled dataset of examples, you can adjust the model's weights to optimize its performance for a particular task. This method is a good option when you have a well-defined task. It's particularly effective for domain-specific applications where the language or content differs significantly from the data the base model was trained on. You can tune models with text, image, audio, and document data.
This page describes the following topics:
To learn more about the benefits of tuning, see When to use supervised fine-tuning for Gemini and Hundreds of organizations are fine-tuning Gemini models. Here's their favorite use cases.
Use cases for supervised fine-tuningFoundation models are a good choice when the expected output or task can be clearly and concisely defined in a prompt, and the prompt consistently produces the expected output. If you want a model to learn something niche or specific that deviates from general patterns, consider tuning that model. For example, you can use model tuning to teach the model the following:
The following examples are use cases that are difficult to capture with only prompt instructions:
Classification: The expected response is a specific word or phrase. Tuning the model can help prevent it from generating verbose responses.
Summarization: The summary follows a specific format. For example, you might need to remove personally identifiable information (PII) in a chat summary. This formatting of replacing names with #Person1
and #Person2
is difficult to describe in a prompt, and the foundation model might not naturally produce such a response.
Extractive question answering: The question is about a context and the answer is a substring of the context.
Chat: You need to customize model response to follow a persona, role, or character.
You can also tune a model in the following situations:
The following Gemini models support supervised fine-tuning:
For models that support thinking, set the thinking budget to off or its lowest value. This can improve performance and reduce costs for tuned tasks. During supervised fine-tuning, the model learns from the training data and omits the thinking process. Therefore, the resulting tuned model can perform tuned tasks effectively without a thinking budget.
Limitations Gemini 2.5 FlashWhen you run a tuning job, your data, including the transformed dataset and the final tuned model, is stored in the region you specify. To use available hardware accelerators, computation might be offloaded to other regions within the US
or EU
multi-regions. This process is transparent and doesn't change where your data is stored.
You can specify the region for a tuning job in the following ways:
Vertex AI SDK: Specify the region when you initialize the client.
import vertexai
vertexai.init(project='myproject', location='us-central1')
REST API: If you create a supervised fine-tuning job by sending a POST
request to the tuningJobs.create
method, use the URL to specify the region. Replace both instances of TUNING_JOB_REGION
with the region where the job runs.
https://TUNING_JOB_REGION-aiplatform.googleapis.com/v1/projects/PROJECT_ID/locations/TUNING_JOB_REGION/tuningJobs
Google Cloud console: In the model creation workflow, select the region from the Region drop-down list on the Model details page.
Quotas limit the number of concurrent tuning jobs that you can run. Each project has a default quota to run at least one tuning job. This is a global quota, shared across all available regions and supported models. To run more jobs concurrently, you need to request additional quota for Global concurrent tuning jobs
.
For pricing details, see Vertex AI pricing.
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-08-18 UTC.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-18 UTC."],[],[]]
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4