<div class="csl-bib-body">
<div class="csl-entry">Meghdouri, F., Iglesias Vazquez, F., & Zseby, T. (2022). Modeling Data with Observers. <i>Intelligent Data Analysis</i>, <i>26</i>(3), 785–803. https://doi.org/10.3233/ida-215741</div>
</div>
-
dc.identifier.issn
1088-467X
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/138696
-
dc.description.abstract
Compact data models have become relevant due to the massive, ever-increasing generation of data. We propose Observers-based Data Modeling (ODM), a lightweight algorithm to extract low density data models (aka coresets) that are suitable for both static and stream data analysis. ODM coresets keep data internal structures while alleviating computational costs of machine learning during evaluation phases accounting for a O(n log n) worst-case complexity. We compare ODM with previous proposals in classification, clustering, and outlier detection. Results show the preponderance of ODM for obtaining the best trade-off in accuracy, versatility, and speed.