Learning Small Decision Trees with Large Domain

Eiben, Eduard; Ordyniak, Sebastian; Paesani, Giacomo; Szeider, Stefan

doi:10.24963/ijcai.2023/355

Datensatz Zitierlink:

http://hdl.handle.net/20.500.12708/190630

Titel:

Learning Small Decision Trees with Large Domain

Zitat:

Eiben, E., Ordyniak, S., Paesani, G., & Szeider, S. (2023). Learning Small Decision Trees with Large Domain. In Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI-23) (pp. 3184–3192). https://doi.org/10.24963/ijcai.2023/355

Verlags-DOI:

10.24963/ijcai.2023/355

Publikationstyp:

Konferenzbeitrag - Full-Paper Contribution

Sprache:

Englisch

Autor_innen:

Eiben, Eduard
Ordyniak, Sebastian
Paesani, Giacomo
Szeider, Stefan

Organisationseinheit:

E192-01 - Forschungsbereich Algorithms and Complexity

Erschienen in:

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI-23)

ISBN:

978-1-956792-03-4

Datum (veröffentlicht):

2023

Veranstaltungsname:

Thirty-Second International Joint Conference on Artificial Intelligence

Veranstaltungszeitraum:

19-Aug-2023 - 25-Aug-2023

Veranstaltungsort:

Macao, China

Umfang:

Keywords:

Computational complexity of reasoning; Knowledge Representation; Machine Learning

Abstract:

One favors decision trees (DTs) of the smallest size or depth to facilitate explainability and interpretability. However, learning such an optimal DT from data is well-known to be NP-hard. To overcome this complexity barrier, Ordyniak and Szeider (AAAI 21) initiated the study of optimal DT learning under the parameterized complexity perspective. They showed that solution size (i.e., number of nodes or depth of the DT) is insufficient to obtain fixed-parameter tractability (FPT). Therefore, they proposed an FPT algorithm that utilizes two auxiliary parameters: the maximum difference (as a structural property of the data set) and maximum domain size. They left it as an open question of whether bounding the maximum domain size is necessary. The main result of this paper answers this question. We present FPT algorithms for learning a smallest or lowest-depth DT from data, with the only parameters solution size and maximum difference. Thus, our algorithm is significantly more potent than the one by Szeider and Ordyniak as it can handle problem inputs with features that range over unbounded domains. We also close several gaps concerning the quality of approximation one obtains by only considering DTs based on minimum support sets.

Forschungsschwerpunkte:

Logic and Computation: 100%

Wissenschaftszweig:

1020 - Informatik: 80%
1010 - Mathematik: 20%

Enthalten in den Sammlungen:

Conference Paper

Zur Langanzeige

Seiten Aufrufe

185

aufgerufen am 22.12.2023

Download(s)

aufgerufen am 22.12.2023

Google Scholar^TM

Check

Seiten Aufrufe

Download(s)

Google ScholarTM

Google Scholar^TM