UUM Electronic Theses and Dissertation
UUM ETD | Universiti Utara Malaysian Electronic Theses and Dissertation
FAQs | Feedback | Search Tips | Sitemap

Robust linear discriminant rules with coordinatewise and distance based approaches

Lim, Yai Fung (2020) Robust linear discriminant rules with coordinatewise and distance based approaches. Doctoral thesis, Universiti Utara Malaysia.

[thumbnail of Deposit Permission_s900800.pdf] Text
Deposit Permission_s900800.pdf
Restricted to Repository staff only

Download (1MB) | Request a copy
[thumbnail of s900800_01.pdf] Text

Download (3MB)
[thumbnail of s900800_references.docx] Text

Download (90kB)


Linear discriminant analysis (LDA) is one of the supervised classification techniques to deal with relationship between a categorical variable and a set of continuous variables. The main objective of LDA is to create a function to distinguish between groups and allocating future observations to previously defined groups. Under the assumptions of normality and homoscedasticity, the LDA yields optimal linear discriminant rule (LDR) between two or more groups. However, the optimality of LDA highly relies on the sample mean and sample covariance matrix which are known to be sensitive to outliers. To abate these conflicts, robust location and scale estimators via coordinatewise and distance based approaches have been applied in constructing new robust LDA. These robust estimators were used to replace the classical sample mean and sample covariance to form robust linear discriminant rules (RLDR). A total of six RLDR, namely four coordinatewise (RLDRM, RLDRMw, RLDRW, RLDRWw) and two distance based (RLDRV, RLDRT) approaches have been proposed and implemented in this study. Simulation and real data study were conducted to investigate on the performance of the proposed RLDR, measured in terms of misclassification error rates and computational time. Several data conditions such as non-normality, heteroscedasticity, balanced and unbalanced data set were manipulated in the simulation study to evaluate the performance of these proposed RLDR. In real data study, a set of diabetes data was used. This data set violated the assumptions of normality as well as homoscedasticity. The results showed that the novel RLDRV is the best proposed RLDR to solve classification problem since it provides as much as 91.03% accuracy in classification as shown in the real data study. The proposed RLDR are good alternatives to the classical LDR as well as existing RLDR since these RLDR perform well in classification problems even under contaminated data.

Item Type: Thesis (Doctoral)
Supervisor : Syed Yahaya, Sharipah and Ali, Hazlina
Item ID: 8799
Uncontrolled Keywords: Linear discriminant analysis, Coordinatewise based robust estimators, Distance based robust estimators, Misclassification error rates
Subjects: Q Science > QA Mathematics > QA299.6-433 Analysis
Divisions: Awang Had Salleh Graduate School of Arts & Sciences
Date Deposited: 14 Nov 2021 07:40
Last Modified: 14 Nov 2021 07:40
Department: Awang Had Salleh Graduate School of Arts & Sciences
Name: Syed Yahaya, Sharipah and Ali, Hazlina
URI: https://etd.uum.edu.my/id/eprint/8799

Actions (login required)

View Item
View Item