Topic Tomographies (TopTom): a Visual Approach to Distill Information From Media Streams
dc.contributor.author | Gobbo, Beatrice | en_US |
dc.contributor.author | Balsamo, Duilio | en_US |
dc.contributor.author | Mauri, Michele | en_US |
dc.contributor.author | Bajardi, Paolo | en_US |
dc.contributor.author | Panisson, André | en_US |
dc.contributor.author | CIUCCARELLI, PAOLO | en_US |
dc.contributor.editor | Gleicher, Michael and Viola, Ivan and Leitte, Heike | en_US |
dc.date.accessioned | 2019-06-02T18:28:46Z | |
dc.date.available | 2019-06-02T18:28:46Z | |
dc.date.issued | 2019 | |
dc.description.abstract | In this paper we present TopTom, a digital platform whose goal is to provide analytical and visual solutions for the exploration of a dynamic corpus of user-generated messages and media articles, with the aim of i) distilling the information from thousands of documents in a low-dimensional space of explainable topics, ii) cluster them in a hierarchical fashion while allowing to drill down to details and stories as constituents of the topics, iii) spotting trends and anomalies. TopTom implements a batch processing pipeline able to run both in near-real time with time stamped data from streaming sources and on historical data with a temporal dimension in a cold start mode. The resulting output unfolds along three main axes: time, volume and semantic similarity (i.e. topic hierarchical aggregation). To allow the browsing of data in a multiscale fashion and the identification of anomalous behaviors, three visual metaphors were adopted from biological and medical fields to design visualizations, i.e. the flowing of particles in a coherent stream, tomographic cross sectioning and contrast-like analysis of biological tissues. The platform interface is composed by three main visualizations with coherent and smooth navigation interactions: calendar view, flow view, and temporal cut view. The integration of these three visual models with the multiscale analytic pipeline proposes a novel system for the identification and exploration of topics from unstructured texts. We evaluated the system using a collection of documents about the emerging opioid epidemics in the United States. | en_US |
dc.description.number | 3 | |
dc.description.sectionheaders | Geospatial and Social Data | |
dc.description.seriesinformation | Computer Graphics Forum | |
dc.description.volume | 38 | |
dc.identifier.doi | 10.1111/cgf.13714 | |
dc.identifier.issn | 1467-8659 | |
dc.identifier.pages | 609-621 | |
dc.identifier.uri | https://doi.org/10.1111/cgf.13714 | |
dc.identifier.uri | https://diglib.eg.org:443/handle/10.1111/cgf13714 | |
dc.publisher | The Eurographics Association and John Wiley & Sons Ltd. | en_US |
dc.subject | Human | |
dc.subject | centered computing | |
dc.subject | Visualization | |
dc.subject | Information systems | |
dc.subject | Document topic models | |
dc.subject | Expert search | |
dc.title | Topic Tomographies (TopTom): a Visual Approach to Distill Information From Media Streams | en_US |