Statistical classification of multispectral imaging data

Appe, Andrea

Record link:

https://resolver.obvsg.at/urn:nbn:at:at-ubtuw:1-50703
http://hdl.handle.net/20.500.12708/9878

Title:

Statistical classification of multispectral imaging data

Citation:

Appe, A. (2012). Statistical classification of multispectral imaging data [Diploma Thesis, Technische Universität Wien]. reposiTUm. https://resolver.obvsg.at/urn:nbn:at:at-ubtuw:1-50703

CatalogPlus:

AC07813489

Publication Type:

Thesis - Diplomarbeit

Language:

English

Authors:

Appe, Andrea

Advisor:

Filzmoser, Peter

Organisational Unit:

E105 - Institut für Statistik und Wahrscheinlichkeitstherorie

Date (published):

2012

Number of Pages:

Keywords:

Zwei-Gruppen-Klassifikation; überwachtes Lernen; Diskriminanzanalyse; Support Vector Machine

two-group classification; supervised learning; discriminant analysis; support vector machine

Abstract:

Multispektrale Daten, die mit Hilfe eines tragbaren Aufnahmegeräts gesammelt wurden, werden mittels verschiedener statistischer Klassifikationsmethoden einer von zwei möglichen Gruppen zugeordnet. Die für die Analyse verwendeten Daten sind Pixel-Bilder bei neun Wellenlängen von sechs Objekten und werden zu Matrizen umgeformt: die Spalten sind die für die Datengewinnung verwendeten Wellenlängen, die Zeilen sind die Beobachtungen, deren Anzahl sich aus der Dimension der Bilder ergibt. Die Leistung der folgenden Methoden der überwachten Klassifikation wird ausgewertet: Lineare Diskriminanzanalyse (LDA), Quadratische Diskriminanzanalyse (QDA), robuste LDA, robuste QDA und Support Vector Machines (SVM) mit linearen und radial basis Kernen. Misklassifikationsraten werden verwendet um die Modelle zu evaluieren und Receiver Operator Characteristics helfen die Ergebnisse zu veranschaulichen. Die Ergebnisse zeigen, dass die robuste LDA für diese Art von Daten, welche Ausreißer beinhaltet, am besten funktioniert. Um die Leistung auf Teilen des Gesamtbildes zu analysieren wird Mehrheitsvoting verwendet. Damit wird auch dem Gesamtbild ein Label zugeordnet anstatt jedem einzelnen Pixel. Für die Datenaufbereitung und die statistische Analyse wurde die Software Matlab bzw. R verwendet.<br />

Multispectral data which have been collected by a hand-held device are classified into two groups by employing different statistical classification methods. The data used for the analysis are pixel images at nine wavelenghts of six objects which have been reshaped to matrices: the columns are the wavelengths used for data acquisition, the rows are the number of observations resulting from the dimension of the images. The performance of the following supervised classication methods is evaluated: Linear Discriminant Analysis (LDA), Quadratic Discriminant Analysis (QDA), robust LDA, robust QDA and Support Vector Machines (SVM) with linear and radial basis kernels. Misclassication rates are used to evaluate the models and Receiver Operator Characteristics help to visualise their performance. The results show that robust LDA works best for this kind of data, which contain outliers. Majority voting is applied to analyse the performance on parts of the images and to assign exactly one label to the whole image instead of each pixel. Data preprocessing and the statistical analysis are done with the software Matlab and R, respectively.

License:

In Copyright

Appears in Collections:

Thesis