We support lightning fast vector search on massive scale data. Following performance data shows search latency from a 1M dataset with warmed up cache.

PercentileLatency
P5025ms
P9026ms
P9935ms
Max49ms

Other than latency, users can also tune the following parameters for better search quality.

  • nprobes: the number of partitions to search (probe)
  • refine factor: a multiplier to control how many additional rows are taken during the refine step
  • distance range: search for vectors within the distance range

Metadata filtering combined with the vector search is also supported, with as low as 65ms query latency on a 15M dataset. Our benchmark tests have more details.