<div class="csl-bib-body">
<div class="csl-entry">Naseer, M., Prabakaran, B. S., Hasan, O., & Shafique, M. (2023). UnbiasedNets: a dataset diversification framework for robustness bias alleviation in neural networks. <i>Machine Learning</i>. https://doi.org/10.1007/s10994-023-06314-z</div>
</div>
-
dc.identifier.issn
0885-6125
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/192607
-
dc.description.abstract
Performance of trained neural network (NN) models, in terms of testing accuracy, has improved remarkably over the past several years, especially with the advent of deep learning. However, even the most accurate NNs can be biased toward a specific output classification due to the inherent bias in the available training datasets, which may propagate to the real-world implementations. This paper deals with the robustness bias, i.e., the bias exhibited by the trained NN by having a significantly large robustness to noise for a certain output class, as compared to the remaining output classes. The bias is shown to result from imbalanced datasets, i.e., the datasets where all output classes are not equally represented. Towards this, we propose the UnbiasedNets framework, which leverages K-means clustering and the NN’s noise tolerance to diversify the given training dataset, even from relatively smaller datasets. This generates balanced datasets and reduces the bias within the datasets themselves. To the best of our knowledge, this is the first framework catering to the robustness bias problem in NNs. We use real-world datasets to demonstrate the efficacy of the UnbiasedNets for data diversification, in case of both binary and multi-label classifiers. The results are compared to well-known tools aimed at generating balanced datasets, and illustrate how existing works have limited success while addressing the robustness bias. In contrast, UnbiasedNets provides a notable improvement over existing works, while even reducing the robustness bias significantly in some cases, as observed by comparing the NNs trained on the diversified and original datasets.
en
dc.description.sponsorship
European Commission
-
dc.language.iso
en
-
dc.publisher
Springer
-
dc.relation.ispartof
Machine Learning
-
dc.rights.uri
http://creativecommons.org/licenses/by/4.0/
-
dc.subject
Bias
en
dc.subject
Data-centric bias alleviation
en
dc.subject
K-means clustering
en
dc.subject
Neural networks
en
dc.subject
Noise tolerance
en
dc.title
UnbiasedNets: a dataset diversification framework for robustness bias alleviation in neural networks