Exploring and Comparing Clusterings of Multivariate Data Sets Using Persistent Homology

dc.contributor.authorRieck, Bastianen_US
dc.contributor.authorLeitte, Heikeen_US
dc.contributor.editorKwan-Liu Ma and Giuseppe Santucci and Jarke van Wijken_US
dc.date.accessioned2016-06-09T09:32:36Z
dc.date.available2016-06-09T09:32:36Z
dc.date.issued2016en_US
dc.description.abstractClustering algorithms support exploratory data analysis by grouping inputs that share similar features. Especially the clustering of unlabelled data is said to be a fiendishly difficult problem, because users not only have to choose a suitable clustering algorithm but also a suitable number of clusters. The known issues of existing clustering validity measures comprise instabilities in the presence of noise and restrictive assumptions about cluster shapes. In addition, they cannot evaluate individual clusters locally. We present a new measure for assessing and comparing different clusterings both on a global and on a local level. Our measure is based on the topological method of persistent homology, which is stable and unbiased towards cluster shapes. Based on our measure, we also describe a new visualization that displays similarities between different clusterings (using a global graph view) and supports their comparison on the individual cluster level (using a local glyph view). We demonstrate how our visualization helps detect different—-but equally valid-clusterings of data sets from multiple application domains.en_US
dc.description.number3en_US
dc.description.sectionheadersStructures, Clusters, and Patternsen_US
dc.description.seriesinformationComputer Graphics Forumen_US
dc.description.volume35en_US
dc.identifier.doi10.1111/cgf.12884en_US
dc.identifier.issn1467-8659en_US
dc.identifier.pages081-090en_US
dc.identifier.urihttps://doi.org/10.1111/cgf.12884en_US
dc.identifier.urihttps://diglib.eg.org:443/handle/10
dc.publisherThe Eurographics Association and John Wiley & Sons Ltd.en_US
dc.subjectI.3.6 [Computer Graphics]en_US
dc.subjectMethodology and Techniquesen_US
dc.subjectInteraction techniquesen_US
dc.titleExploring and Comparing Clusterings of Multivariate Data Sets Using Persistent Homologyen_US
Files