City Research Online

Capturing Visualization Design Rationale

Hutchinson, M. ORCID: 0009-0008-0983-7524, Jianu, R. ORCID: 0000-0002-5834-2658, Slingsby, A. ORCID: 0000-0003-3941-553X , Wood, J. ORCID: 0000-0001-9270-247X & Madhyastha, P. ORCID: 0000-0002-4438-8161 (2025). Capturing Visualization Design Rationale. In: 2025 IEEE Visualization and Visual Analytics (VIS). 2025 IEEE Visualization and Visual Analytics (VIS), 1-7 Nov 2025, Vienna, Austria. doi: 10.1109/vis60296.2025.00052

Abstract

Prior natural language datasets for data visualization have focused on tasks such as visualization literacy assessment, insight generation, and visualization generation from natural language instructions. These studies often rely on controlled setups with purpose-built visualizations and artificially constructed questions. As a result, they tend to prioritize the interpretation of visualizations, focusing on decoding visualizations rather than understanding their encoding. In this paper, we present a new dataset and methodology for probing visualization design rationale through natural language. We leverage a unique source of real-world visualizations and natural language narratives: literate visualization notebooks created by students as part of a data visualization course. These notebooks combine visual artifacts with design exposition, in which students make explicit the rationale behind their design decisions. We also use large language models (LLMs) to generate and categorize question-answer-rationale triples from the narratives and articulations in the notebooks. We then carefully validate the triples and curate a dataset that captures and distills the visualization design choices and corresponding rationales of the students.

Publication Type: Conference or Workshop Item (Paper)
Additional Information: © 2025 IEEE. This accepted manuscript is made available under the terms of the Creative Commons Attribution License (CC-BY), which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited.
Publisher Keywords: Visual analytics, Large language models, Natural languages, Data visualization, Focusing, Encoding , Decoding
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Departments: School of Science & Technology
School of Science & Technology > Department of Computer Science
SWORD Depositor:
[thumbnail of VIS_25_Revisions_FINAL.pdf]
Preview
Text - Accepted Version
Available under License Creative Commons Attribution.

Download (9MB) | Preview

Export

Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Downloads

Downloads per month over past year

View more statistics

Actions (login required)

Admin Login Admin Login