Documentation Index
Fetch the complete documentation index at: https://docs.lancedb.com/llms.txt
Use this file to discover all available pages before exploring further.
How to find metrics
Job metrics can be found in the Geneva Console UI, by clicking on a job’s ID to get to the “Job details” page.Core diagnostic metrics
| Metric | What it means | Common signal |
|---|---|---|
rows_checkpointed | Rows finished by read/UDF/checkpoint stage. | High value means upstream compute is progressing. |
rows_ready_for_commit | Rows ready for atomic commit (becoming visible to other DB connections). | If much lower than rows_checkpointed, writer path is likely bottlenecked. |
rows_committed | Rows already visible to other DB connections. | If lagging far behind rows_ready_for_commit, commit stage may be bottlenecked. |
cnt_geneva_workers_active | Current parallel UDF executors. | Lower than expected means reduced effective parallelism. |
cnt_geneva_workers_pending | Deficit from desired parallelism. | Persistently high value usually means scheduling/resource pressure. |
read_io_time_ms | Cumulative read IO time. | Dominant value suggests storage/read bottleneck. |
udf_processing_time | Cumulative UDF execution time. | Dominant value suggests compute/UDF bottleneck. |
batch_checkpointing_time | Cumulative batch checkpoint overhead. | High value suggests checkpoint overhead is expensive. |
writer_write_time | Cumulative writer output time. | High value often points to object storage throughput/throttling issues. |
writer_queue_wait_time_ms | Cumulative writer queue wait time. | High value can indicate writer starvation/backpressure. |
commit_time_ms | Cumulative commit time. | High value means commit itself is expensive. |
commit_conflict_retries | Commit retries due to version conflicts. | Non-trivial counts indicate commit contention. |
commit_backoff_time_ms | Time spent backing off during commit retries. | High value indicates contention/retry pressure. |
commit_concurrent_writer_retries | Retries from “Too many concurrent writers”. | High value indicates writer concurrency contention. |
Quick diagnosis workflow
- Check
rows_checkpointedvsrows_ready_for_commit.- If
rows_checkpointedis high butrows_ready_for_commitis low, fragment writer is usually the bottleneck. - This often indicates object storage read/write pressure (for example S3).
- If
- Compare read, UDF, and checkpoint timing.
- High
read_io_time_ms: storage or scan bottleneck. - High
udf_processing_time: UDF compute bottleneck. - High
batch_checkpointing_time: checkpoint overhead bottleneck. - Typical mitigations: increase
checkpoint_size, increasemax_checkpoint_size, or compact the table to produce larger fragments.
- High
- Check writer timing.
- High
writer_write_timeis commonly object storage throttling/throughput limit. - Typical mitigations: use higher network-bandwidth node types, and keep object storage and compute nodes in the same region.
- High
- Check commit pressure.
- High
commit_conflict_retries,commit_backoff_time_ms, orcommit_concurrent_writer_retriesindicates commit contention.
- High
- Check parallelism deficit.
- If
cnt_geneva_workers_pendingstays high whilecnt_geneva_workers_activestays low, the job is running below desired parallelism due to cluster/resource constraints.
- If
Notes
- Timing metrics are cumulative and may overlap; do not sum them as exact wall time.
- For completed jobs, row counters should settle to stable final values.