Visual Exploration of Indirect Bias in Language Models

Louis-Alexandre, Judith; Waldner, Manuela

Visual Exploration of Indirect Bias in Language Models

dc.contributor.author	Louis-Alexandre, Judith	en_US
dc.contributor.author	Waldner, Manuela	en_US
dc.contributor.editor	Hoellt, Thomas	en_US
dc.contributor.editor	Aigner, Wolfgang	en_US
dc.contributor.editor	Wang, Bei	en_US
dc.date.accessioned	2023-06-10T06:34:30Z
dc.date.available	2023-06-10T06:34:30Z
dc.date.issued	2023
dc.description.abstract	Language models are trained on large text corpora that often include stereotypes. This can lead to direct or indirect bias in downstream applications. In this work, we present a method for interactive visual exploration of indirect multiclass bias learned by contextual word embeddings. We introduce a new indirect bias quantification score and present two interactive visualizations to explore interactions between multiple non-sensitive concepts (such as sports, occupations, and beverages) and sensitive attributes (such as gender or year of birth) based on this score.	en_US
dc.description.sectionheaders	VA and Perception
dc.description.seriesinformation	EuroVis 2023 - Short Papers
dc.identifier.doi	10.2312/evs.20231034
dc.identifier.isbn	978-3-03868-219-6
dc.identifier.pages	1-5
dc.identifier.pages	5 pages
dc.identifier.uri	https://doi.org/10.2312/evs.20231034
dc.identifier.uri	https://diglib.eg.org:443/handle/10.2312/evs20231034
dc.publisher	The Eurographics Association	en_US
dc.rights	Attribution 4.0 International License
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	CCS Concepts: Human-centered computing -> Visual analytics; Computing methodologies -> Natural language processing
dc.subject	Human centered computing
dc.subject	Visual analytics
dc.subject	Computing methodologies
dc.subject	Natural language processing
dc.title	Visual Exploration of Indirect Bias in Language Models	en_US