Stay organized with collections Save and categorize content based on your preferences.
Preview
This feature is subject to the "Pre-GA Offerings Terms" in the General Service Terms section of the Service Specific Terms. Pre-GA features are available "as is" and might have limited support. For more information, see the launch stage descriptions.
Vertex AI Model Optimizer is a dynamic endpoint designed to simplify model selection by automatically applying the Gemini model which best meets your needs. This lets you point your prompts at a single meta-endpoint and the service intelligently selects the most appropriate Gemini model for your query (Pro, Flash, etc.) based on your cost and quality preferences.
For more information on Model Optimizer pricing, see Pricing.
BenefitsModel Optimizer lets you:
Model Optimizer supports all languages that are also supported by the Gemini models. (See Gemini Language support)
ModalityModel Optimizer supports text use cases, including:
For limitations and how to handle them, see Handling unsupported features.
Getting startedTo get started with Model Optimizer, see our quickstart Colab notebook.
Using Vertex AI Model Optimizer Python Installpip install --upgrade google-genai
To learn more, see the SDK reference documentation.
Set environment variables to use the Gen AI SDK with Vertex AI:
# Replace the `GOOGLE_CLOUD_PROJECT` and `GOOGLE_CLOUD_LOCATION` values # with appropriate values for your project. export GOOGLE_CLOUD_PROJECT=GOOGLE_CLOUD_PROJECT export GOOGLE_CLOUD_LOCATION=global export GOOGLE_GENAI_USE_VERTEXAI=TrueHandling unsupported features
Model Optimizer only supports text input and output. However, the request could include different modalities or tools that aren't supported. The following sections cover how Model Optimizer handles these unsupported features.
Multimodal requestsRequests that include prompts with multimodal data, such as video, images or audio, will throw an INVALID_ARGUMENT
error.
Model Optimizer only supports function declaration for requests. If a request contains other tool types including google_maps
, google_search
, enterprise_web_search
, retrieval
, or browse
, an INVALID_ARGUMENT
error is thrown.
To send feedback about your experience with Model Optimizer, fill out our feedback survey.
If you have questions, technical issues, or feedback about Model Optimizer, contact model-optimizer-support@google.com.
Customer discussion groupTo connect directly with the development team, you can join the Vertex AI Model Optimizer Listening Group, where you can learn about the product and help us understand how to make the features work better for you. The group's activities include:
Activities are offered about once every 6-8 weeks. You can take part in as many or as few as you'd like, or you can opt out entirely at any time. To join the group, complete the Vertex AI Model Optimizer discussion group sign up form.
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-08-14 UTC.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-14 UTC."],[],[]]
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4