A Theory of Interpretable Approximations

Bressan, Marco; Cesa-Bianchi, Nicolò; Esposito, Emmanuel; Mansour, Yishay; Moran, Shay; Thiessen, Maximilian

Record link:

http://hdl.handle.net/20.500.12708/199886

Title:

A Theory of Interpretable Approximations

Citation:

Bressan, M., Cesa-Bianchi, N., Esposito, E., Mansour, Y., Moran, S., & Thiessen, M. (2024). A Theory of Interpretable Approximations. In S. Agrawal & A. Roth (Eds.), Proceedings of Thirty Seventh Conference on Learning Theory. http://hdl.handle.net/20.500.12708/199886

CatalogPlus:

AC17356795

Publication Type:

Inproceedings - Full-Paper Contribution

Language:

English

Authors:

Bressan, Marco
Cesa-Bianchi, Nicolò
Esposito, Emmanuel
Mansour, Yishay
Moran, Shay
Thiessen, Maximilian

Organisational Unit:

E194-06 - Forschungsbereich Machine Learning

Published in:

Proceedings of Thirty Seventh Conference on Learning Theory

Volume:

247

Date (published):

2024

Event name:

37th Annual Conference on Learning Theory

Event date:

30-Jun-2024 - 3-Jul-2024

Event place:

Edmonton, Canada

Number of Pages:

Peer reviewed:

Yes

Keywords:

Machine Learning; Interpretability; Learning Theory; Decision Trees

Abstract:

Can a deep neural network be approximated by a small decision tree based on simple features? This question and its variants are behind the growing demand for machine learning models that are \emph{interpretable} by humans. In this work we study such questions by introducing \emph{interpretable approximations}, a notion that captures the idea of approximating a target concept $c$ by a small aggregation of concepts from some base class $\mathcal{H}$. In particular, we consider the approximation of a binary concept $c$ by decision trees based on a simple class $\mathcal{H}$ (e.g., of bounded VC dimension), and use the tree depth as a measure of complexity. Our primary contribution is the following remarkable trichotomy. For any given pair of $\mathcal{H}$ and $c$, exactly one of these cases holds: (i) $c$ cannot be approximated by $\mathcal{H}$ with arbitrary accuracy; (ii) $c$ can be approximated by $\mathcal{H}$ with arbitrary accuracy, but there exists no universal rate that bounds the complexity of the approximations as a function of the accuracy; or (iii) there exists a constant $\kappa$ that depends only on $\mathcal{H}$ and $c$ such that, for \emph{any} data distribution and \emph{any} desired accuracy level, $c$ can be approximated by $\mathcal{H}$ with a complexity not exceeding $\kappa$. This taxonomy stands in stark contrast to the landscape of supervised classification, which offers a complex array of distribution-free and universally learnable scenarios. We show that, in the case of interpretable approximations, even a slightly nontrivial a-priori guarantee on the complexity of approximations implies approximations with constant (distribution-free and accuracy-free) complexity. We extend our trichotomy to classes $\mathcal{H}$ of unbounded VC dimension and give characterizations of interpretability based on the algebra generated by $\mathcal{H}$.

Project (external):

European Union Horizon Europe
Robert J. Shillman Fellowship
European Research Council

Project ID:

101120237
1225/20 ; 2018385
101039692 ; 882396

Research Areas:

Information Systems Engineering: 100%

Science Branch:

1020 - Informatik: 100%

License:

In Copyright

Appears in Collections:

Conference Paper