Consistency - LanceDB

You can set read_consistency_interval on the connection to control how often reads check for updates from other writers. There are three possible settings for read_consistency_interval:

Unset (default): no automatic cross-process refresh checks.
Zero seconds: check for updates on every read (strongest freshness).
Non-zero interval: check for updates after the interval elapses (eventual refresh).

The value you set depends on your application’s consistency needs and performance requirements. For example, a real-time dashboard might require strong consistency, while a batch analytics job might be fine with eventual consistency. Stronger consistency is not free — the smaller the interval, the more often each read pays the cost of refreshing against object storage, which raises per-read latency and cost. This setting works for both local (LanceTable) and remote tables. It only affects read operations — write operations are always consistent.

Consistency in Remote TablesFor remote tables, read_consistency_interval is also respected by the client. The interval is sent to the server as a freshness bound on each read:

Unset (default): no freshness header is sent; reads use the server’s cached view of the table.
Zero seconds: every read asks the server for the latest committed version.
Non-zero interval: reads accept data at least as fresh as now - interval.

In addition, after any write through the client (add, update, delete, merge_insert, add_columns, alter_columns, drop_columns), the next read on the same table automatically pins the minimum version so you read your own writes without extra configuration. checkout, checkout_tag, checkout_latest, and restore reset this state appropriately.Stronger consistency is not free — the smaller the interval, the more often each read pays the cost of refreshing against storage, which raises per-read latency and cost.In Enterprise deployments, the server-side default freshness is still controlled by the cluster-level weak_read_consistency_interval_seconds parameter; the client setting tightens that bound on a per-connection basis.

Configure Consistency Parameters

To set strong consistency, set the interval to 0: For eventual consistency, use a non-zero interval: With the default unset interval, tables do not auto-refresh from other writers. To manually check for updates, use checkout_latest / checkoutLatest: For reproducible reads, you can also pin a table to a specific snapshot with checkout(...) or a tag, restore a table to a prior version, then return to the live table with checkout_latest / checkoutLatest. See Versioning for the full version and tag workflow.

Handle bad vectors

This section is currently specific to the Python SDK.

In LanceDB Python, you can use the on_bad_vectors parameter to choose how invalid vector values are handled. Invalid vectors are vectors that are not valid because:

They are the wrong dimension
They contain NaN values
They are null but are on a non-nullable field

By default, LanceDB will raise an error if it encounters a bad vector. You can also choose one of the following options:

drop: Ignore rows with bad vectors
fill: Replace bad values (NaNs) or missing values (too few dimensions) with the fill value specified in the fill_value parameter. An input like [1.0, NaN, 3.0] will be replaced with [1.0, 0.0, 3.0] if fill_value=0.0.
null: Replace bad vectors with null (only works if the column is nullable). A bad vector [1.0, NaN, 3.0] will be replaced with null if the column is nullable. If the vector column is non-nullable, then bad vectors will cause an error

​Configure Consistency Parameters

​Handle bad vectors

Configure Consistency Parameters

Handle bad vectors