Documentation Index
Fetch the complete documentation index at: https://docs.lancedb.com/llms.txt
Use this file to discover all available pages before exploring further.
View on Hugging Face
Source dataset card and downloadable files for
lance-format/gqa-testdev-balanced-lance.testdev_balanced slice — 12,578 compositional VQA questions joined with the matching 398 images — sourced from lmms-lab/GQA.
lmms-lab/GQA exposes instructions and images as separate parquet configs; this Lance dataset joins them on imageId, so each row has the question, the answer, the GQA reasoning-program tags, and the image bytes inline.
Splits
| Split | Rows | Distinct images |
|---|---|---|
testdev.lance | 12,578 | 398 |
Train (train_balanced_instructions×train_balanced_images, ~943k Q’s × 72k images, ~10 GB images) and val splits are not bundled by default — pass--instr-config/--images-configtogqa/dataprep.pyto extend.
Schema
| Column | Type | Notes |
|---|---|---|
id | int64 | Row index |
image | large_binary | Inline JPEG bytes (image is duplicated across rows that share an image_id) |
image_id | string | GQA scene-graph image id |
question_id | string | GQA question id |
question | string | Compositional natural-language question |
answers | list<string> | One-element list (the GQA short answer) |
answer | string | Same short answer (canonical / FTS target) |
full_answer | string? | Full sentence answer |
structural | string? | One of verify, query, compare, choose, logical |
semantic | string? | One of attr, cat, global, obj, rel |
detailed | string? | Fine-grained type (e.g. weatherVerifyC) |
is_balanced | bool | GQA balanced subset flag |
group_global / group_local | string? | GQA reasoning-group ids |
semantic_str | string? | Compact description of the reasoning program |
image_emb | fixed_size_list<float32, 512> | CLIP image embedding (cosine-normalized) |
question_emb | fixed_size_list<float32, 512> | CLIP text embedding of the question |
Pre-built indices
IVF_PQonimage_embandquestion_emb—metric=cosineINVERTED(FTS) onquestionandanswerBITMAPonstructural,semantic,detailedBTREEonimage_id,question_id
Quick start
Load with LanceDB
These tables can also be consumed by LanceDB, the multimodal lakehouse and embedded search library built on top of Lance, for simplified vector search and other queries.LanceDB vector search
LanceDB full-text search
Filter by reasoning type
Filter with LanceDB
Why Lance?
- One dataset for the joined image + question + answer + reasoning-program metadata + dual embeddings + indices — no instructions/images parquet split to keep in sync.
- Schema evolution: add columns (alternate scene graphs, model predictions) without rewriting the data.
Source & license
Converted fromlmms-lab/GQA. GQA is released under CC BY 4.0 by Hudson and Manning (Stanford NLP).