An Empirical Study on the Reliability of Perceiving Correlation Indices using Scatterplots
dc.contributor.author | Sher, Varshita | en_US |
dc.contributor.author | Bemis, Karen G. | en_US |
dc.contributor.author | Liccardi, Ilaria | en_US |
dc.contributor.author | Chen, Min | en_US |
dc.contributor.editor | Heer, Jeffrey and Ropinski, Timo and van Wijk, Jarke | en_US |
dc.date.accessioned | 2017-06-12T05:22:19Z | |
dc.date.available | 2017-06-12T05:22:19Z | |
dc.date.issued | 2017 | |
dc.description.abstract | Scatterplots have been in use for about two centuries, primarily for observing the relationship between two variables and commonly for supporting correlation analysis. In this paper, we report an empirical study that examines how humans' perception of correlation using scatterplots relates to the Pearson's product-moment correlation coefficient (PPMCC) - a commonly used statistical measure of correlation. In particular, we study human participants' estimation of correlation under different conditions, e.g., different PPMCC values, different densities of data points, different levels of symmetry of data enclosures, and different patterns of data distribution. As the participants were instructed to estimate the PPMCC of each stimulus scatterplot, the difference between the estimated and actual PPMCC is referred to as an offset. The results of the study show that varying PPMCC values, symmetry of data enclosure, or data distribution does have an impact on the average offsets, while only large variations in density cause an impact that is statistically significant. This study indicates that humans' perception of correlation using scatterplots does not correlate with computed PPMCC in a consistent manner. The magnitude of offsets may be affected not only by the difference between individuals, but also by geometric features of data enclosures. It suggests that visualizing scatterplots does not provide adequate support to the task of retrieving their corresponding PPMCC indicators, while the underlying model of humans' perception of correlation using scatterplots ought to feature other variables in addition to PPMCC. The paper also includes a theoretical discussion on the cost-benefit of using scatterplots. | en_US |
dc.description.number | 3 | |
dc.description.sectionheaders | Evaluating Visualization | |
dc.description.seriesinformation | Computer Graphics Forum | |
dc.description.volume | 36 | |
dc.identifier.doi | 10.1111/cgf.13168 | |
dc.identifier.issn | 1467-8659 | |
dc.identifier.pages | 061-072 | |
dc.identifier.uri | https://doi.org/10.1111/cgf.13168 | |
dc.identifier.uri | https://diglib.eg.org:443/handle/10.1111/cgf13168 | |
dc.publisher | The Eurographics Association and John Wiley & Sons Ltd. | en_US |
dc.title | An Empirical Study on the Reliability of Perceiving Correlation Indices using Scatterplots | en_US |