Work with data
Multivector Search
Working with multiple vectors per document in LanceDB Enterprise
A complete example for Multivector search is in this notebook
Multivector Type
LanceDB natively supports multivector data types, enabling advanced search scenarios where a single data item is represented by multiple embeddings (e.g., using models like ColBERT or CoLPali). In this framework, documents and queries are encoded as collections of contextualized vectors—precomputed for documents and indexed for queries.
Key features:
- Indexing on multivector column: store and index multiple vectors per row.
- Supporint query being a single vector or multiple vectors
- Optimized search performance with XTR with improved recall.
Currently, only the cosine
metric is supported for multivector search.
The vector value type can be float16
, float32
, or float64
.
Using Multivector in Python
Currently, multivector search is only support in our Python SDK. Below is an example of using multivector search in LanceDB:
Python