Documentation Index
Fetch the complete documentation index at: https://docs.lancedb.com/llms.txt
Use this file to discover all available pages before exploring further.
View on Hugging Face
Source dataset card and downloadable files for
lance-format/chartqa-lance.lmms-lab/ChartQA.
Splits
| Split | Rows |
|---|---|
test.lance | 2,500 |
Thelmms-lab/ChartQAredistribution exposes test only. Train and validation live in the original release (https://github.com/vis-nlp/ChartQA); add them viachartqa/dataprep.py --splitsonce a parquet mirror is identified.
Schema
| Column | Type | Notes |
|---|---|---|
id | int64 | Row index |
image | large_binary | Inline chart image bytes |
image_id / question_id | string? | (Source does not assign explicit ids — null for now) |
question | string | Natural-language question |
answers | list<string> | Reference answer (typically a single string) |
answer | string | First answer — used as canonical |
type | string? | Question type (human vs augmented) |
image_emb | fixed_size_list<float32, 512> | CLIP image embedding (cosine-normalized) |
question_emb | fixed_size_list<float32, 512> | CLIP text embedding of the question |
Pre-built indices
IVF_PQonimage_embandquestion_emb—metric=cosineINVERTED(FTS) onquestionandanswerBITMAPontype
Quick start
Load with LanceDB
These tables can also be consumed by LanceDB, the multimodal lakehouse and embedded search library built on top of Lance, for simplified vector search and other queries.LanceDB vector search
LanceDB full-text search
Source & license
Converted fromlmms-lab/ChartQA. The original ChartQA dataset is released under the GNU GPL-3.0 license by Masry et al.