<div class="csl-bib-body">
<div class="csl-entry">Pichler, G., Romanelli, M., Manivannan, D. P., Krishnamurthy, P., Khorrami, F., & Garg, S. (2024). On the (In)feasibility of ML Backdoor Detection as an Hypothesis Testing Problem. In <i>Proceedings of The 27th International Conference on Artificial Intelligence and Statistics</i> (pp. 4051–4059). PMLR. https://doi.org/10.34726/8503</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/210566
-
dc.identifier.uri
https://doi.org/10.34726/8503
-
dc.description.abstract
We introduce a formal statistical definition for the problem of backdoor detection in machine learning systems and use it to analyze the feasibility of such problems, providing evidence for the utility and applicability of our definition. The main contributions of this work are an impossibility result and an achievability result for backdoor detection. We show a no-free-lunch theorem, proving that universal (adversary-unaware) backdoor detection is impossible, except for very small alphabet sizes. Thus, we argue, that backdoor detection methods need to be either explicitly, or implicitly adversary-aware. However, our work does not imply that backdoor detection cannot work in specific scenarios, as evidenced by successful backdoor detection methods in the scientific literature. Furthermore, we connect our definition to the probably approximately correct (PAC) learnability of the out-of-distribution detection problem.
en
dc.language.iso
en
-
dc.relation.ispartofseries
Proceedings of Machine Learning Research
-
dc.rights.uri
http://rightsstatements.org/vocab/InC/1.0/
-
dc.subject
backdoor attacks
en
dc.subject
backdoor detection
en
dc.subject
Out-of-distribution
en
dc.subject
Statistics
en
dc.subject
hypothesis testing
en
dc.subject
PAC learning
en
dc.title
On the (In)feasibility of ML Backdoor Detection as an Hypothesis Testing Problem
en
dc.type
Inproceedings
en
dc.type
Konferenzbeitrag
de
dc.rights.license
Urheberrechtsschutz
de
dc.rights.license
In Copyright
en
dc.identifier.doi
10.34726/8503
-
dc.contributor.affiliation
New York University, United States of America (the)
-
dc.contributor.affiliation
New York University, United States of America (the)
-
dc.contributor.affiliation
New York University, United States of America (the)
-
dc.contributor.affiliation
New York University, United States of America (the)
-
dc.contributor.affiliation
New York University, United States of America (the)
-
dc.description.startpage
4051
-
dc.description.endpage
4059
-
dc.rights.holder
Copyright 2024 by the author(s)
-
dc.type.category
Full-Paper Contribution
-
tuw.booktitle
Proceedings of The 27th International Conference on Artificial Intelligence and Statistics