The State of the Art in Creating Visualization Corpora for Automated Chart Analysis

Thumbnail Image
Date
2023
Journal Title
Journal ISSN
Volume Title
Publisher
The Eurographics Association and John Wiley & Sons Ltd.
Abstract
We present a state-of-the-art report on visualization corpora in automated chart analysis research. We survey 56 papers that created or used a visualization corpus as the input of their research techniques or systems. Based on a multi-level task taxonomy that identifies the goal, method, and outputs of automated chart analysis, we examine the property space of existing chart corpora along five dimensions: format, scope, collection method, annotations, and diversity. Through the survey, we summarize common patterns and practices of creating chart corpora, identify research gaps and opportunities, and discuss the desired properties of future benchmark corpora and the required tools to create them.
Description

CCS Concepts: Computing methodologies -> Machine learning; Human-centered computing -> Visualization

        
@article{
10.1111:cgf.14855
, journal = {Computer Graphics Forum}, title = {{
The State of the Art in Creating Visualization Corpora for Automated Chart Analysis
}}, author = {
Chen, Chen
and
Liu, Zhicheng
}, year = {
2023
}, publisher = {
The Eurographics Association and John Wiley & Sons Ltd.
}, ISSN = {
1467-8659
}, DOI = {
10.1111/cgf.14855
} }
Citation
Collections