Stay organized with collections Save and categorize content based on your preferences.
MySQL | PostgreSQL | SQL Server To use the GA version of vector search, the instance maintenance version should be versionMYSQL_8_0_version.R20241208.01_00
or newer, where version is the minor version number. For information about how to upgrade your instance to a newer version that supports GA vector embeddings, see Self-service maintenance.
For information about the syntax and behavior of vector embeddings for the Public Preview version of this feature, see Work with vector embeddings (Preview).
This page describes how to configure memory for vector indexes, and create, tune, monitor, and drop vector indexes.
Before you beginBefore you create a vector index, you must load data into the base table with vector embedding values. Your base table must have at least 1,000 rows. If you have more data points available, you can get better partitioning and training of the index.
Configure memory allocation for vector indexesThe cloudsql_vector_max_mem_size
database flag controls how much memory your Cloud SQL instance dedicates to vector indexes. This is a static flag that requires a restart of your instance. This memory serves two main purposes:
Storing the vector index structure: the non-leaf portion of the vector index (the TREE_MEMORY
) resides in this memory. The approximate size of this tree depends on the number of leaf nodes (num_leaves
) and the dimensions of your vectors:
Approximate TREE_MEMORY = num_leaves * vector dimensions * 4 * 2
For example, an index with 1000 leaves and 768 dimensions would have an approximate TREE_MEMORY
of 1000 * 768 * 4 * 2 or 6144000 bytes. You can also check the actual TREE_MEMORY
using the information_schema.innodb_vector_indexes
table. Cloud SQL manages that memory. You don't need to allocate space for all vector indexes simultaneously, as inactive indexes are unloaded to make room for other requests.
Memory for index creation (training data): during vector index creation, memory is needed to process a sample of the data from your base table to build the index. This memory is used only during the index creation process and is freed afterward. The approximate size of the memory needed for training is:
approximate_training_memory = num_rows in base table * 0.1 * 4 * vector dimensions
For example, with a table of 1,000,000 rows and 768 dimensions, the training_memory
would be 1000000 * 0.1 * 768 * 4 or 307,200,000 bytes. Only 10% of the base table data is sampled to compute the centroids for the tree.
When you enable the cloudsql_vector
flag, Cloud SQL automatically sets a default cloudsql_vector_max_mem_size
based on your VM size. This default usually suffices for typical workloads. Cloud SQL reduces the innodb_buffer_pool_size
flag to allocate this memory. The default maximum value for cloudsql_vector_max_mem_size
is 16GB. If you need to tune your memory size, you can dynamically adjust cloudsql_vector_max_mem_size
based on your vector index usage.
Important: If you increase cloudsql_vector_max_mem_size
, you must correspondingly decrease innodb_buffer_pool_size
to avoid memory issues.
cloudsql_vector_max_mem_size
values VM size cloudsql_vector_max_mem_size 4GB 194MB 8GB 515MB 16GB 1.2GB 32GB 2.56GB 64GB 5.12GB 128GB 10.24GB 256GB+ 16GB
The range of vector index memory allocated is the following:
You can adjust the memory later, as needed. For more information, see Enable the database flag for vector embeddings.
For information about monitoring the size of your vector index, see Monitor vector indexes.
To update the memory allocated for vector indexes on the instance, use the following command:
gcloud sql instances patch INSTANCE_NAME \
--database-flags= cloudsql_vector_max_mem_size=NEW_MEMORY_VALUE;
Replace the following:
This change takes effect immediately after a database restart.
Create a vector indexThere are two ways to create a vector index:
CREATE VECTOR INDEX
statement, a Cloud SQL extension to standard MySQL syntax.ALTER TABLE
statement with the Cloud SQL ADD VECTOR INDEX
clause extension. You can't run this statement simultaneously with other DDL statements on thetable.Use the following syntax to create a vector index using CREATE VECTOR INDEX
:
CREATE
VECTOR INDEX INDEX_NAME
ON TABLE_NAME(COLUMN_NAME)
USING
SCANN[QUANTIZER = SQ8]
DISTANCE_MEASURE
= L2_SQUARED | COSINE | DOT_PRODUCT[NUM_LEAVES = INT_VALUE { '</var>' }}];
The following are the index options:
USING SCANN
: optional. Indicates the index type to use. SCANN is the only supported value.QUANTIZER
: optional. Maps a high-dimensional vector to a compressed representation. SQ8 is the only supported value.DISTANCE_MEASURE
: required. Specifies a mathematical formula to use to calculate the similarity of two vectors. You must set the same distance measure in this parameter as the distance you set in the approx_distance
search options. The supported literals are:
L2_SQUARED
COSINE
DOT_PRODUCT
NUM_LEAVES
: optional. Specifies how many partitions (leaves) to build. Only change this setting from its default setting if you have a good understanding of ANN search and your dataset. The number specified can't be larger than the number of embeddings in the base table.For example, to create a vector index, run the following:
CREATE
VECTOR INDEX vectorIndex
ON dbname.books(embeddings) DISTANCE_MEASURE = L2_SQUARED;
While the CREATE
statement is running, the base table is put into a read-only mode and no DMLs are allowed on the base table.
You can use the following syntax to create an index on an existing table:
ALTER TABLE tbl_name
ADD VECTOR INDEX index_name(key_part)[index_option];
For example, to create an index on an existing table:
ALTER TABLE t1 ADD VECTOR INDEX index1(j)
USING SCANN QUANTIZER = SQ8 DISTANCE_MEASURE = l2_squared NUM_LEAVES = 10;
Tune the vector index
This section gives further information about the parameters that you use to build the vector index. To tune the vector index, use this information to determine how to influence the build process.
Parameter Description Default Scope Impactcloudsql_vector_max_mem_size
Memory allocated for index training. Varies Instance Insufficient memory can lead to build failures. See Configure memory allocation for vector indexes. innodb_ddl_threads
Degree of parallelism for index training and build. 4 Session Higher values reduce build time but increase CPU load. Set this value to the number of CPUs you can spare without adversely affecting database operations.
Ensure cloudsql_vector_max_mem_size
is configured appropriately for training. Adjust innodb_ddl_threads
to balance build time and CPU load, considering the impact on concurrent database operations. Monitor CPU utilization during the build.
To drop a vector index, use the SQL DROP INDEX
or ALTER TABLE
statements with the index name you want to drop, as shown in the following:
DROP INDEX index_name ON books;
ALTER TABLE table_name
DROP INDEX index_name;
Monitor vector indexes
Cloud SQL provides the following information schema tables with real-time information about vector indexes that are loaded in its memory:
information_schema.innodb_vector_indexes
lists all the vector indexes that are opened in the memory after restart.information_schema.innodb_all_vector_indexes
lists all the vector indexes that exists on the instance (even if they aren't opened in the memory yet).information_schema.innodb_vector_indexes_memory
provides information about overall memory usage of vector indexes in the instance.For more detailed information, see the Information schema.
To view information in the innodb_vector_indexes
table, run the following command:
SELECT * FROM information_schema.innodb_vector_indexes \ G;
The output looks similar to the following:
INDEX_NAME: t1_vec_index
TABLE_NAME: test.t1
INDEX_TYPE: TREE_SQ
DIMENSION: 3
DIST_MEASURE: COSINE
STATUS: Ready
STATE: INDEX_READY_TO_USE
NUM_LEAVES: 10
NUM_LEAVES_TO_SEARCH: 10
QUERIES: 1
MUTATIONS: 1
TREE_MEMORY: 443
What's next
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-08-14 UTC.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-08-14 UTC."],[],[]]
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4