<div class="csl-bib-body">
<div class="csl-entry">Louis-Alexandre Dit Petit-Frere, J., & Waldner, M. (2023). Visual Exploration of Indirect Bias in Language Models. In T. Hoelt, W. Aigner, & B. Wang (Eds.), <i>EuroVis 2023 - Short Papers</i>. The Eurographics Association. https://doi.org/10.2312/evs.20231034</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/187890
-
dc.description.abstract
Language models are trained on large text corpora that often include stereotypes. This can lead to direct or indirect bias in downstream applications. In this work, we present a method for interactive visual exploration of indirect multiclass bias learned by contextual word embeddings. We introduce a new indirect bias quantification score and present two interactive visualizations to explore interactions between multiple non-sensitive concepts (such as sports, occupations, and beverages) and sensitive attributes (such as gender or year of birth) based on this score.
en
dc.description.sponsorship
FWF Fonds zur Förderung der wissenschaftlichen Forschung (FWF)
-
dc.language.iso
en
-
dc.rights.uri
http://creativecommons.org/licenses/by/4.0/
-
dc.subject
visual exploration
en
dc.subject
language models
en
dc.subject
bias
en
dc.title
Visual Exploration of Indirect Bias in Language Models