ADAPTIVE VARIABLE EXTRACTIONS WITH LDA FOR CLASSIFICATION OF MIXED VARIABLES, AND APPLICATIONS TO MEDICAL DATA-Reference-Cited by-同舟云学术

ADAPTIVE VARIABLE EXTRACTIONS WITH LDA FOR CLASSIFICATION OF MIXED VARIABLES, AND APPLICATIONS TO MEDICAL DATA

Published:2021-06-11 Issue:Number 3 Volume:20 Page:305-327
ISSN:2180-3862
Container-title:Journal of Information and Communication Technology
language:en
Short-container-title:JICT

Author:

Hamid Hashibah¹,Mahat Nor Idayu¹,Ibrahim Safwati²

Affiliation:

1. School of Quantitative Sciences, UUM College of Arts and Sciences, 06010 UUM Sintok, Kedah

2. Institute of Engineering Mathematics, Universiti Malaysia Perlis, 02600 UniMAP Arau, Perlis

Abstract

The strategy surrounding the extraction of a number of mixed variables is examined in this paper in building a model for Linear Discriminant Analysis (LDA). Two methods for extracting crucial variables from a dataset with categorical and continuous variables were employed, namely, multiple correspondence analysis (MCA) and principal component analysis (PCA). However, in this case, direct use of either MCA or PCA on mixed variables is impossible due to restrictions on the structure of data that each method could handle. Therefore, this paper executes some adjustments including a strategy for managing mixed variables so that those mixed variables are equivalent in values. With this, both MCA and PCA can be performed on mixed variables simultaneously. The variables following this strategy of extraction were then utilised in the construction of the LDA model before applying them to classify objects going forward. The suggested models, using three real sets of medical data were then tested, where the results indicated that using a combination of the two methods of MCA and PCA for extraction and LDA could reduce the model’s size, having a positive effect on classifying and better performance of the model since it leads towards minimising the leave-one-out error rate. Accordingly, the models proposed in this paper, including the strategy that was adapted was successful in presenting good results over the full LDA model. Regarding the indicators that were used to extract and to retain the variables in the model, cumulative variance explained (CVE), eigenvalue, and a non-significant shift in the CVE (constant change), could be considered a useful reference or guideline for practitioners experiencing similar issues in future.

Publisher

UUM Press, Universiti Utara Malaysia

Subject

General Mathematics,General Computer Science

Reference179 articles.

1. Alheety, M. (2020). New versions of liu-type estimator in weighted

2. and non-weighted mixed regression model. Baghdad Science

3. Journal, 17(1(Suppl.), 0361. http://bsj.uobaghdad.edu.iq/

4. index.php/BSJ/article/view/ 5022

5. Ali, F., Dissanayake, D., Bell, M., & Farrow, M. (2018). Investigating