On the (In)feasibility of ML Backdoor Detection as an Hypothesis Testing Problem

Pichler, Georg; Romanelli, Marco; Manivannan, Divya Prakash; Krishnamurthy, Prashanth; Khorrami, Farshad; Garg, Siddharth

doi:10.34726/8503

DC Field

Value

Language

dc.contributor.author

Pichler, Georg

dc.contributor.author

Romanelli, Marco

dc.contributor.author

Manivannan, Divya Prakash

dc.contributor.author

Krishnamurthy, Prashanth

dc.contributor.author

Khorrami, Farshad

dc.contributor.author

Garg, Siddharth

dc.date.accessioned

2025-02-03T13:17:09Z

dc.date.available

2025-02-03T13:17:09Z

dc.date.issued

2024-05-02

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Pichler, G., Romanelli, M., Manivannan, D. P., Krishnamurthy, P., Khorrami, F., & Garg, S. (2024). On the (In)feasibility of ML Backdoor Detection as an Hypothesis Testing Problem. In <i>Proceedings of The 27th International Conference on Artificial Intelligence and Statistics</i> (pp. 4051–4059). PMLR. https://doi.org/10.34726/8503</div> </div>

dc.identifier.uri

http://hdl.handle.net/20.500.12708/210566

dc.identifier.uri

https://doi.org/10.34726/8503

dc.description.abstract

We introduce a formal statistical definition for the problem of backdoor detection in machine learning systems and use it to analyze the feasibility of such problems, providing evidence for the utility and applicability of our definition. The main contributions of this work are an impossibility result and an achievability result for backdoor detection. We show a no-free-lunch theorem, proving that universal (adversary-unaware) backdoor detection is impossible, except for very small alphabet sizes. Thus, we argue, that backdoor detection methods need to be either explicitly, or implicitly adversary-aware. However, our work does not imply that backdoor detection cannot work in specific scenarios, as evidenced by successful backdoor detection methods in the scientific literature. Furthermore, we connect our definition to the probably approximately correct (PAC) learnability of the out-of-distribution detection problem.

dc.language.iso

dc.relation.ispartofseries

Proceedings of Machine Learning Research

dc.rights.uri

http://rightsstatements.org/vocab/InC/1.0/

dc.subject

backdoor attacks

dc.subject

backdoor detection

dc.subject

Out-of-distribution

dc.subject

Statistics

dc.subject

hypothesis testing

dc.subject

PAC learning

dc.title

On the (In)feasibility of ML Backdoor Detection as an Hypothesis Testing Problem

dc.type

Inproceedings

dc.type

Konferenzbeitrag

dc.rights.license

Urheberrechtsschutz

dc.rights.license

In Copyright

dc.identifier.doi

10.34726/8503

dc.contributor.affiliation