CREATE VECTOR INDEX (Transact-SQL) (Preview)

Applies to: SQL Server 2025 (17.x) Azure SQL Database SQL database in Microsoft Fabric

Create an approximate index on a vector column to improve performances of nearest neighbors search. To learn more about how vector indexing and vector search works, and the differences between exact and approximate search, refer to Vector search and vector indexes in the SQL Database Engine.

Azure SQL Database and SQL database in Fabric

The feature is in preview. Check Limitations and considerations before proceeding.

Note

As a preview feature, the technology presented in this article is subject to Supplemental Terms of Use for Microsoft Azure Previews.

Warning

Deprecation notice: Vector indexes created using an earlier data structure are supported in the current release but will be retired in a future version. To ensure future compatibility and access to the latest vector search capabilities, migrate existing vector indexes using the steps in the Migrating from earlier vector index versions section.

Regional availability

This feature is being deployed across Azure SQL Database and SQL database in Microsoft Fabric. During the rollout, availability and behavior might vary by region and by index version. If a feature or syntax isn't available, it becomes available automatically as deployment completes. For current regional availability status, see Feature availability by region.

SQL Server 2025 Preview feature

In SQL Server 2025 this function is in preview and is subject to change. In order to use this feature, you must enable the PREVIEW_FEATURES database scoped configuration.

Make sure to check out the current limitations before using it.

Note

The latest version of Vector Indexes is only available in Azure SQL Database and SQL database in Microsoft Fabric currently.

Syntax

Transact-SQL syntax conventions

CREATE VECTOR INDEX index_name
ON object ( vector_column )
[ WITH (
    [ , ] METRIC = { 'cosine' | 'dot' | 'euclidean' }
    [ [ , ] TYPE = 'DiskANN' ]
    [ [ , ] MAXDOP = max_degree_of_parallelism ]
) ]
[ ON { filegroup_name | "default" } ]
[;]

Arguments

index_name

The name of the index. Index names must be unique within a table but don't have to be unique within a database. Index names must follow the rules of identifiers.

object

Table on which the index is created. It must be a base table. Views, temporary tables, both local and global, aren't supported.

vector_column

Column to use to create the vector index. It must be of vector type.

METRIC

A string with the name of the distance metric to use to calculate the distance between the two given vectors. The following distance metrics are supported:

cosine - Cosine distance
euclidean - Euclidean distance
dot - (Negative) Dot product

TYPE

The type of ANN algorithm used to build the index. Only DiskANN is currently supported. DiskANN is the default value.

MAXDOP

Overrides the max degree of parallelism configuration option for the index operation. For more information, see Server configuration: max degree of parallelism. Use MAXDOP to limit the degree of parallelism and the resulting resource consumption for an index build operation.

max_degree_of_parallelism can be:

1

Suppresses parallel plan generation.
>1

Restricts the maximum degree of parallelism used in a parallel index operation to the specified number or less based on the current system workload.
0 (default)

Uses the degree of parallelism specified at the server, database, or workload group level, unless reduced based on the current system workload.

For more information, see Configure parallel index operations.

Note

Parallel index operations aren't available in every edition of SQL Server. For a list of features that are supported by the editions of SQL Server, see Editions and supported features of SQL Server 2022 or Editions and supported features of SQL Server 2025.

Upgrade vector indexes to the latest version

Important

Newly created vector indexes automatically use the latest data structure, which provides:

Full DML support: Removes the previous limitation that made vector-indexed tables read-only after index creation. You can now perform INSERT, UPDATE, DELETE, and MERGE operations while maintaining vector index functionality with automatic, real-time index maintenance
Iterative filtering: Predicates in the WHERE clause are applied during the vector search process, not after retrieval
Optimizer-driven: The query optimizer automatically determines whether to use the DiskANN index or kNN search based on query characteristics
Advanced quantization: Vector quantization techniques have been integrated to provide better storage efficiency and faster query performance, with these optimizations being transparent to users

For details on earlier vector index version limitations, see the Limitations and considerations section.

Migrating from earlier vector index versions

Vector indexes created using an earlier version must be dropped and recreated to enable the latest capabilities. This section explains how to identify, migrate, and verify vector index versions.

Step 1: Identify existing vector indexes

Use the following query to identify vector indexes that require migration:

SELECT
    i.name AS index_name,
    t.name AS table_name,
    JSON_VALUE(v.build_parameters, '$.Version') AS index_version,
    CASE
        WHEN JSON_VALUE(v.build_parameters, '$.Version') >= '3'
            THEN 'Uses latest version (no migration required)'
        WHEN JSON_VALUE(v.build_parameters, '$.Version') < '3'
            THEN 'Created using an earlier version (migration recommended)'
        ELSE 'Unknown format'
    END AS migration_status
FROM sys.vector_indexes AS v
    INNER JOIN sys.indexes AS i
        ON v.object_id = i.object_id
        AND v.index_id = i.index_id
    INNER JOIN sys.tables AS t
        ON v.object_id = t.object_id
ORDER BY t.name, i.name;

How to interpret the results

Uses latest version

Already supports iterative filtering, full DML support, optimizer-driven execution and improved quantization
No migration required

Created using an earlier version

Uses legacy post-filter behavior
Doesn't support the latest vector search capabilities
Migration is strongly recommended to ensure future compatibility

Step 2: Drop and recreate the vector index

Vector indexes created using an earlier format can't be upgraded in place. To enable the latest DiskANN capabilities, drop and recreate the index.

Warning

Service impact: Dropping a vector index immediately disables approximate vector search on the affected table until the index is recreated. Plan migrations during maintenance windows for production systems.

Drop the existing index

DROP INDEX vec_idx ON dbo.wikipedia_articles;

Recreate the index

CREATE VECTOR INDEX vec_idx
    ON dbo.wikipedia_articles (title_vector)
    WITH (
        TYPE = 'DISKANN',
        METRIC = 'COSINE'
    );

Note

Vector indexes created using the current CREATE VECTOR INDEX statement automatically use the latest DiskANN format. No additional options or flags are required.

Step 3: Verify the index version

After recreation, verify the index is using the latest version:

SELECT
    i.name AS index_name,
    t.name AS table_name,
    JSON_VALUE(v.build_parameters, '$.Version') AS index_version
FROM sys.vector_indexes AS v
    INNER JOIN sys.indexes AS i
        ON v.object_id = i.object_id
        AND v.index_id = i.index_id
    INNER JOIN sys.tables AS t
        ON v.object_id = t.object_id
WHERE i.name = 'vec_idx';

The index_version column should display 3 for the latest version.

Error behavior with version incompatibility

If you attempt to use the TOP_N parameter in VECTOR_SEARCH with a latest version vector index, SQL Server returns the following error:

Msg 42274, Level 16, State 1
Vector search with version 3 index does not support explicit TOP_N parameter.

To resolve this error, remove the TOP_N parameter from VECTOR_SEARCH and use the SELECT TOP (N) WITH APPROXIMATE syntax instead. For detailed information, see Error using legacy syntax.

Limitations and considerations

Earlier vector index version limitations

Earlier vector index versions have the following additional limitations. To check your index version, see Verify the index version.

Post-filtering only: Predicates are applied only after vector retrieval, not during the search process. This can result in fewer rows returned than expected when filters are applied.
Read-only tables: Tables with vector indexes are read-only. No DML operations (INSERT, UPDATE, DELETE, MERGE) are allowed after the vector index is created. Use the ALLOW_STALE_VECTOR_INDEX database scoped configuration to enable DML operations if you can tolerate stale search results.
Manual TOP_N tuning: You must manually adjust the TOP_N parameter in VECTOR_SEARCH to compensate for post-filtering, often requiring oversized values to get the desired number of results.

Current limitations (applies to the latest version too)

The current preview has the following limitations:

Vector indexes can't be partitioned. No partition support.
The table must have a primary key clustered index.
Vector indexes aren't replicated to subscribers.
Tables with vector indexes can't be truncated using TRUNCATE TABLE. To remove all data, drop the vector index first, truncate the table, repopulate with at least 100 rows, then recreate the index. For more information, see TRUNCATE TABLE restrictions.

Minimum data requirements

Vector indexes require a minimum number of rows with non-NULL vector values before the index can be created.

Minimum row count: At least 100 rows with non-NULL vector values must exist in the table.
Error behavior: Attempting to create a vector index on a table with fewer than 100 rows fails with error Msg 42266.

Example error:

Msg 42266, Level 16, State 1
Cannot create a vector index. The table contains only 8 rows with non-null vectors, 
but at least 100 are required for vector index creation.

Best practice: Populate the table with at least 100 rows before creating the vector index. For development and testing scenarios where fewer rows are needed, VECTOR_SEARCH works without an index using a brute-force scan approach, though performance degrades with larger datasets.

DML support

Once a DiskANN vector index is created using the latest version, the table is no longer read-only. You can freely modify data using standard data manipulation language (DML) operations, and changes are automatically reflected in vector search results.

This capability makes vector search suitable for live, transactional workloads where data changes over time.

Behavior notes

DML operations don't require dropping or rebuilding the vector index.
Changes are visible to vector search queries after the transaction commits.
For large-scale data replacement (for example, deleting most rows and inserting an entirely new set of embeddings), consider dropping and recreating the vector index after the data load to ensure optimal search quality.

Note

DML support is only available with vector indexes created using the latest version. Earlier versions require tables to be read-only or use the ALLOW_STALE_VECTOR_INDEX database scoped configuration.

Monitoring vector index maintenance

Vector indexes perform background maintenance to incorporate DML changes. Use the sys.dm_db_vector_indexes dynamic management view to monitor index health and maintenance task status.

Combining vector indexes with traditional indexes

Vector indexes work alongside traditional B-tree indexes to provide optimal query performance. When using iterative filtering with VECTOR_SEARCH, consider creating traditional indexes on columns used in filter predicates.

For detailed information about iterative filtering behavior and how it differs from earlier versions, see Iterative filtering behavior.

Tip

The query optimizer automatically selects the best execution strategy (approximate nearest neighbor index vs. kNN search). To force the use of the approximate nearest neighbor index, use the FORCE_ANN_ONLY table hint. For more information, see Table hints for vector search.

Example scenario:

-- Create vector index for similarity search
CREATE VECTOR INDEX idx_embeddings_vector
ON product_embeddings(embedding)
WITH (METRIC = 'cosine');

-- Create traditional index for filter columns
CREATE NONCLUSTERED INDEX idx_embeddings_filters
ON product_embeddings(category);

Performance benefit:

When executing queries with iterative filtering, the SQL Server query optimizer uses both index types:

DECLARE @qv VECTOR(1536) = AI_GENERATE_EMBEDDINGS(N'wireless headphones' USE MODEL EmbeddingModel);

SELECT TOP (10) WITH APPROXIMATE
    p.name,
    p.price,
    vs.distance
FROM products p
INNER JOIN VECTOR_SEARCH(
    TABLE = product_embeddings AS e,
    COLUMN = embedding,
    SIMILAR_TO = @qv,
    METRIC = 'cosine'
) AS vs ON p.id = e.product_id
WHERE e.approved = 1             
  AND e.category = 'Electronics'  -- Can use traditional index
ORDER BY vs.distance;

In this query:

The vector index identifies similar embeddings based on the query vector
The traditional index on (category) filters candidates efficiently during the iterative search process

This composite strategy can improve query performance significantly compared to using only a vector index, particularly when filter predicates have high selectivity.

Data quality and maintenance guidance for vector indexes

Avoid datasets with high duplicate embeddings

Vector indexing works best when embeddings represent diverse semantic content. Datasets with a high proportion of duplicate vectors aren't recommended for vector indexing.

High duplication can lead to:

Poor result quality: Duplicate vectors appear repeatedly in results, crowding out more relevant semantic matches.
Reduced effectiveness: Duplicate embeddings displace better neighbors, lowering the usefulness of similarity search.
Unnecessary resource usage: Vector indexes are expensive to build and maintain, and duplicates add cost without adding value.

Best practice: Deduplicate embeddings before creating a vector index to improve both performance and result quality.

Large-scale data replacement scenarios

Vector indexes support inserts, updates, and deletes. However, when most or all embeddings are replaced—for example, re-embedding a dataset with a new model—the existing index might no longer reflect the new data distribution.

In large-scale replacement scenarios:

Vector search queries continue to return valid results
But Recall and Ranking quality may degrade, because the index structure was built for a different embedding distribution.

Best practice: When performing near-complete data replacement (delete and insert of fresh embeddings), drop and recreate the vector index after loading the new data. Recreating the index ensures it's optimized for the new embedding distribution and restores predictable query behavior.

Known issues

For more information, review Known issues.

Permissions

The user must have ALTER permission on the table.

Examples

Download and import the Wikipedia article with vector embeddings sample.

Examples assume the existence of a table named wikipedia_articles with a column title_vector of type vector that stores title's embeddings of Wikipedia articles. title_vector is assumed to be an embedding generated with an embedding model like text-embedding-ada-002 or text-embedding-3-small, which returns vectors with 1,536 dimensions.

For more examples, including end-to-end solutions, go to the Azure SQL Database Vector Search Samples GitHub repo.

Example 1

The following example creates a vector index on the title_vector column using the cosine metric.

CREATE VECTOR INDEX vec_idx
    ON [dbo].[wikipedia_articles] ([title_vector])
        WITH (METRIC = 'COSINE', TYPE = 'DISKANN');

Example 2

The following example creates a vector index on the title_vector column using the (negative) dot product metric, limiting the parallelism to 8 and storing the vector in the SECONDARY filegroup.

CREATE VECTOR INDEX vec_idx
    ON [dbo].[wikipedia_articles] ([title_vector])
        WITH (METRIC = 'DOT', TYPE = 'DISKANN', MAXDOP = 8)
    ON [SECONDARY];

Example 3

A basic end-to-end example using CREATE VECTOR INDEX and the related VECTOR_SEARCH function. The embeddings are mocked. In a real world scenario, embeddings are generated using an embedding model and AI_GENERATE_EMBEDDINGS, or an external library such as OpenAI SDK.

Note

Latest version vector indexes require at least 100 rows of data before index creation. This example inserts 100 rows to meet this requirement. For more information, see Minimum data requirements.

The following code block demonstrates CREATE VECTOR INDEX with mock embeddings:

Enables the preview feature (required for SQL Server 2025 only; not needed for Azure SQL Database or SQL database in Fabric).
Create a sample table dbo.Articles with a column embedding with data type vector(5).
Insert 100 rows of sample data with mock embedding data.
Create a vector index on dbo.Articles.embedding.
Demonstrate the vector similarity search with the VECTOR_SEARCH function.

-- Step 0: Enable Preview Feature (SQL Server 2025 only)
ALTER DATABASE SCOPED CONFIGURATION
SET PREVIEW_FEATURES = ON;
GO

-- Step 1: Create a sample table with a VECTOR(5) column
CREATE TABLE dbo.Articles
(
    id INT PRIMARY KEY,
    title NVARCHAR(100),
    content NVARCHAR(MAX),
    embedding VECTOR(5) -- mocked embeddings
);
GO

-- Step 2: Insert sample data (100 rows required for latest version indexes)
INSERT INTO Articles (id, title, content, embedding)
SELECT
    value AS id,
    'Article ' + CAST(value AS NVARCHAR(10)),
    'Content for article ' + CAST(value AS NVARCHAR(10)),
    CAST(JSON_ARRAY(
        CAST(value * 0.01 AS FLOAT),
        CAST(value * 0.02 AS FLOAT),
        CAST(value * 0.03 AS FLOAT),
        CAST(value * 0.04 AS FLOAT),
        CAST(value * 0.05 AS FLOAT)
    ) AS VECTOR(5))
FROM GENERATE_SERIES(1, 100);
GO

-- Step 3: Create a vector index on the embedding column
CREATE VECTOR INDEX vec_idx ON Articles(embedding)
WITH (METRIC = 'cosine', TYPE = 'diskann');
GO

-- Step 4: Perform a vector similarity search
DECLARE @qv VECTOR(5) = '[0.3, 0.3, 0.3, 0.3, 0.3]';
SELECT TOP(3) WITH APPROXIMATE
    t.id,
    t.title,
    t.content,
    s.distance
FROM
    VECTOR_SEARCH(
        TABLE = Articles AS t,
        COLUMN = embedding,
        SIMILAR_TO = @qv,
        METRIC = 'cosine'
    ) AS s
ORDER BY s.distance, t.title;

The syntax for querying depends on the vector index version:

Vector index version	Syntax example
Latest version	Use `SELECT TOP (N) WITH APPROXIMATE` without `TOP_N` parameter
Earlier versions (deprecated)	Use `TOP_N` parameter in `VECTOR_SEARCH` function

For earlier version indexes (deprecated syntax):

DECLARE @qv VECTOR(5) = '[0.3, 0.3, 0.3, 0.3, 0.3]';
SELECT TOP(3)
    t.id,
    t.title,
    t.content,
    s.distance
FROM
    VECTOR_SEARCH(
        TABLE = Articles AS t,
        COLUMN = embedding,
        SIMILAR_TO = @qv,
        METRIC = 'cosine',
        TOP_N = 3
    ) AS s
ORDER BY s.distance, t.title;

Example 4: Working with DML operations

The following examples demonstrate DML operations on a table with a vector index created using the latest version.

Delete rows

Deleting rows removes them from both the table and vector search results.

DELETE FROM dbo.wikipedia_articles
WHERE id = 12345;

After the delete completes, the removed row no longer appears in vector search queries.

Insert new rows

You can insert new rows with embeddings, and they become immediately searchable without rebuilding the index.

INSERT INTO dbo.wikipedia_articles (id, title, title_vector)
VALUES (
    99999,
    N'Quantum Computing Basics',
    AI_GENERATE_EMBEDDINGS(N'Quantum Computing Basics' USE MODEL Ada2Embeddings)
);

Newly inserted embeddings are automatically incorporated into the vector index and can be returned by subsequent vector search queries.

Update existing rows

Updating vector or non-vector columns is fully supported.

DECLARE @new_embedding VECTOR(1536);
SET @new_embedding = AI_GENERATE_EMBEDDINGS(N'Updated article title' USE MODEL Ada2Embeddings);

UPDATE dbo.wikipedia_articles
SET title_vector = @new_embedding,
    title = N'Updated article title'
WHERE id = 50000;

If a vector column is updated, the index is updated accordingly so future vector searches use the new embedding.

Use MERGE for complex operations

The MERGE statement allows you to perform insert, update, and delete operations in a single statement.

MERGE INTO dbo.wikipedia_articles AS target
USING (
    SELECT 
        id,
        title,
        AI_GENERATE_EMBEDDINGS(title USE MODEL Ada2Embeddings) AS title_vector
    FROM dbo.staging_articles
) AS source
ON target.id = source.id
WHEN MATCHED THEN
    UPDATE SET 
        title = source.title,
        title_vector = source.title_vector
WHEN NOT MATCHED BY TARGET THEN
    INSERT (id, title, title_vector)
    VALUES (source.id, source.title, source.title_vector)
WHEN NOT MATCHED BY SOURCE AND target.id > 100000 THEN
    DELETE;

The vector index is automatically updated to reflect all changes made by the MERGE statement.

Feedback

Was this page helpful?

Last updated on 2026-03-18

Share via

CREATE VECTOR INDEX (Transact-SQL) (Preview)

Azure SQL Database and SQL database in Fabric

Regional availability

SQL Server 2025 Preview feature

Syntax

Arguments

index_name

object

vector_column

METRIC

TYPE

MAXDOP

Upgrade vector indexes to the latest version

Migrating from earlier vector index versions

Step 1: Identify existing vector indexes

How to interpret the results

Step 2: Drop and recreate the vector index

Drop the existing index

Recreate the index

Step 3: Verify the index version

Error behavior with version incompatibility

Limitations and considerations

Earlier vector index version limitations

Current limitations (applies to the latest version too)

Minimum data requirements

DML support

Behavior notes

Monitoring vector index maintenance

Combining vector indexes with traditional indexes

Data quality and maintenance guidance for vector indexes

Avoid datasets with high duplicate embeddings

Large-scale data replacement scenarios

Known issues

Permissions

Examples

Example 1

Example 2

Example 3

Example 4: Working with DML operations

Delete rows

Insert new rows

Update existing rows

Use MERGE for complex operations

Related content

Feedback

Additional resources