AVOCADO: Visualization of Workflow-Derived Data Provenance for Reproducible Biomedical Research

dc.contributor.authorStitz, Holgeren_US
dc.contributor.authorLuger, Stefanen_US
dc.contributor.authorStreit, Marcen_US
dc.contributor.authorGehlenborg, Nilsen_US
dc.contributor.editorKwan-Liu Ma and Giuseppe Santucci and Jarke van Wijken_US
dc.date.accessioned2016-06-09T09:33:07Z
dc.date.available2016-06-09T09:33:07Z
dc.date.issued2016en_US
dc.description.abstractA major challenge in data-driven biomedical research lies in the collection and representation of data provenance information to ensure that findings are reproducibile. In order to communicate and reproduce multi-step analysis workflows executed on datasets that contain data for dozens or hundreds of samples, it is crucial to be able to visualize the provenance graph at different levels of aggregation. Most existing approaches are based on node-link diagrams, which do not scale to the complexity of typical data provenance graphs. In our proposed approach, we reduce the complexity of the graph using hierarchical and motif-based aggregation. Based on user action and graph attributes, a modular degree-of-interest (DoI) function is applied to expand parts of the graph that are relevant to the user. This interest-driven adaptive approach to provenance visualization allows users to review and communicate complex multi-step analyses, which can be based on hundreds of files that are processed by numerous workflows. We have integrated our approach into an analysis platform that captures extensive data provenance information, and demonstrate its effectiveness by means of a biomedical usage scenario.en_US
dc.description.number3en_US
dc.description.sectionheadersStory, History, and Evolutionen_US
dc.description.seriesinformationComputer Graphics Forumen_US
dc.description.volume35en_US
dc.identifier.doi10.1111/cgf.12924en_US
dc.identifier.issn1467-8659en_US
dc.identifier.pages481-490en_US
dc.identifier.urihttps://doi.org/10.1111/cgf.12924en_US
dc.identifier.urihttps://diglib.eg.org:443/handle/10
dc.publisherThe Eurographics Association and John Wiley & Sons Ltd.en_US
dc.subjectH.5.2 [Information Systems]en_US
dc.subjectInformation Interfaces and Presentationen_US
dc.subjectUser Interfacesen_US
dc.subjectGraphical user interfaces (GUI)en_US
dc.subjecten_US
dc.subjectJ.3 [Life and Medical Science]en_US
dc.subjectBiology and geneticsen_US
dc.titleAVOCADO: Visualization of Workflow-Derived Data Provenance for Reproducible Biomedical Researchen_US
Files