The Susceptibility of Example-Based Explainability Methods to Class Outliers

Nematov, Ikhtiyor; Sacharidis, Dimitris; Hose, Katja; Sagi, Tomer

doi:10.48550/arXiv.2407.20678

Record link:

http://hdl.handle.net/20.500.12708/224910

Title:

The Susceptibility of Example-Based Explainability Methods to Class Outliers

Citation:

Nematov, I., Sacharidis, D., Hose, K., & Sagi, T. (2024). The Susceptibility of Example-Based Explainability Methods to Class Outliers. arXiv. https://doi.org/10.48550/arXiv.2407.20678

Publisher DOI:

10.48550/arXiv.2407.20678

Publication Type:

Preprint

Language:

English

Authors:

Nematov, Ikhtiyor
Sacharidis, Dimitris
Hose, Katja
Sagi, Tomer

Organisational Unit:

E192-02 - Forschungsbereich Databases and Artificial Intelligence
E056-23 - Fachbereich Innovative Combinations and Applications of AI and ML (iCAIML)

Date (published):

1-Aug-2024

Preprint Server:

arXiv

Keywords:

explainability; interpretability; explainability evaluation

Abstract:

This study explores the impact of class outliers on the effectiveness of example-based explainability methods for black-box machine learning models. We reformulate existing explainability evaluation metrics, such as correctness and relevance, specifically for example-based methods, and introduce a new metric, distinguishability. Using these metrics, we highlight the shortcomings of current example-based explainability methods, including those who attempt to suppress class outliers. We conduct experiments on two datasets, a text classification dataset and an image classification dataset, and evaluate the performance of four state-of-the-art explainability methods. Our findings underscore the need for robust techniques to tackle the challenges posed by class outliers.

Research Areas:

Logic and Computation: 10%
Information Systems Engineering: 90%

Science Branch:

1020 - Informatik: 80%
1010 - Mathematik: 20%

Appears in Collections:

Preprint

Show full item record

Page view(s)

checked on Jan 19, 2026

Google Scholar^TM

Check

Page view(s)

Google ScholarTM

Google Scholar^TM