Indexing
Indexing is the process of building a data structure that allows for efficient search. pgvecto.rs
supports three indexing algorithms: brute force (FLAT), IVF, and HNSW. The default algorithm is HNSW.
Assuming there is a table items
and there is a column named embedding
of type vector(n)
, you can create a vector index for squared Euclidean distance with the following SQL.
CREATE INDEX ON items USING vectors (embedding vector_l2_ops);
The vector_l2_ops
is an operator class for squared Euclidean distance. There are all operator classes for each vector type and distance type.
Vector type | Distance type | Operator class |
---|---|---|
vector | squared Euclidean distance | vector_l2_ops |
vector | negative dot product | vector_dot_ops |
vector | cosine distance | vector_cos_ops |
vecf16 | squared Euclidean distance | vecf16_l2_ops |
vecf16 | negative dot product | vecf16_dot_ops |
vecf16 | cosine distance | vecf16_cos_ops |
Now you can perform a KNN search with the following SQL again, this time the vector index is used for searching.
SELECT * FROM items ORDER BY embedding <-> '[3,2,1]' LIMIT 5;
Details
pgvecto.rs
constructs the index asynchronously. When you insert new rows into the table, they will first be placed in an append-only file. The background thread will periodically merge the newly inserted row to the existing index. When a user performs any search prior to the merge process, it scans the append-only file to ensure accuracy and consistency.
Options
We utilize TOML syntax to express the index's configuration. Here's what each key in the configuration signifies:
Key | Type | Description |
---|---|---|
segment | table | Options for segments. |
optimizing | table | Options for background optimizing. |
indexing | table | The algorithm to be used for indexing. |
Options for table segment
.
Key | Type | Default | Description |
---|---|---|---|
max_growing_segment_size | integer | 20_000 | Maximum size of unindexed vectors. |
max_sealed_segment_size | integer | 1_000_000 | Maximum size of vectors for indexing. |
Options for table optimizing
.
Key | Type | Default | Description |
---|---|---|---|
optimizing_threads | integer | sqrt of the number of cores | Maximum threads for indexing. |
sealing_secs | integer | 60 | See below. |
sealing_size | integer | 1 | See below. |
If a write segment larger than sealing_size
do not accept new data for sealing_secs
seconds, the write segment will be turned to a sealed segment.
Options for table indexing
.
Key | Type | Description |
---|---|---|
flat | table | If this table is set, brute force algorithm will be used for the index. |
ivf | table | If this table is set, IVF will be used for the index. |
hnsw | table | If this table is set, HNSW will be used for the index. |
You can choose only one algorithm in above indexing algorithms. Default value is hnsw
.
Options for table flat
.
Key | Type | Description |
---|---|---|
quantization | table | The algorithm to be used for quantization. |
Options for table ivf
.
Key | Type | Default | Description |
---|---|---|---|
nlist | integer | 1000 | Number of cluster units. |
nsample | integer | 65536 | Number of samples for K-Means clustering. |
least_iterations | integer | 16 | Least iterations for K-Means clustering. |
iterations | integer | 500 | Max iterations for K-Means clustering. |
quantization | table | The quantization algorithm to be used. |
Options for table hnsw
.
Key | Type | Default | Description |
---|---|---|---|
m | integer | 12 | Maximum degree of the node. |
ef_construction | integer | 300 | Search scope in building. |
quantization | table | The quantization algorithm to be used. |
Options for table quantization
.
Key | Type | Description |
---|---|---|
trivial | table | If this table is set, no quantization is used. |
scalar | table | If this table is set, scalar quantization is used. |
product | table | If this table is set, product quantization is used. |
You can choose only one algorithm in above indexing algorithms. Default value is trivial
.
Options for table product
.
Key | Type | Default | Description |
---|---|---|---|
sample | integer | 65535 | Samples to be used for quantization. |
ratio | string | enum("x4", "x8", "x16", "x32", "x64") | Compression ratio for quantization. |
Compression ratio is how many memory you are saved for saving vectors. For example, "x4"
is compared to before. If you are creating indexes on vecf16
, "x4"
is compared to before.
Examples
There are some examples.
-- HNSW algorithm, default settings.
CREATE INDEX ON items USING vectors (embedding vector_l2_ops);
--- Or using bruteforce with PQ.
CREATE INDEX ON items USING vectors (embedding vector_l2_ops)
WITH (options = $$
[indexing.flat]
quantization.product.ratio = "x16"
$$);
--- Or using IVFPQ algorithm.
CREATE INDEX ON items USING vectors (embedding vector_l2_ops)
WITH (options = $$
[indexing.ivf]
quantization.product.ratio = "x16"
$$);
-- Use more threads for background building the index.
CREATE INDEX ON items USING vectors (embedding vector_l2_ops)
WITH (options = $$
optimizing.optimizing_threads = 16
$$);
-- Prefer smaller HNSW graph.
CREATE INDEX ON items USING vectors (embedding vector_l2_ops)
WITH (options = $$
segment.max_growing_segment_size = 200000
$$);