A Classification Study in High-Dimensional Data of Linear Discriminant Analysis and Regularized Discriminant Analysis
Author:
Araveeporn Autcha1, Banditvilai Somsri1
Affiliation:
1. Department of Statistics, School of Science, King Mongkut's Institute of Technology Ladkrabang 10520, Bangkok, THAILAND
Abstract
The objective of this work is to compare linear discriminant analysis (LDA) and regularized discriminant analysis (RDA) for classification in high-dimensional data. This dataset consists of the response variable as a binary or dichotomous variable and the explanatory as a continuous variable. The LDA and RDA methods are well-known in statistical and probabilistic learning classification. The LDA has created the decision boundary as a linear function where the covariance of two classes is equal. Then the RDA is extended from the LDA to resolve the estimated covariance when the number of observations exceeds the explanatory variables, or called high-dimensional data. The explanatory dataset is generated from the normal distribution, contaminated normal distribution, and uniform distribution. The binary of the response variables is computed from the logit function depending on the explanatory variable. The highest average accuracy percentage evaluates to propose the performance of the classification methods in several situations. Through simulation results, the LDA was successful when using large sample sizes, but the RDA performed when using the most sample sizes.
Publisher
World Scientific and Engineering Academy and Society (WSEAS)
Subject
General Mathematics
Reference18 articles.
1. T. Ramayah, N.H. Ahmad, H. A. Halim, S, R. M. Zanai, M. H. Lo, Discriminant analysis: An illustrate example, African Journal of Business Management, Vol.4, No.9, 2010, pp. 1654-1667. 2. C. Liu, Discriminant analysis and similarity measure, Pattern Recognition, Vol.47, No.1, 2014, pp.359 -367. 3. A. Tharwat, T. Gaber, A. Ibrahim, A. E. Hassanien, Linear discriminant analysis: detailed tutorial, Ai Communications, Vol.30, No.2, 2017, pp.169-190. 4. A. Sharma, K. K. Paliwal, Linear discriminant analysis for the small sample size problem: an overview, International Journal of Machine Learning and Cybernetics, Vol.6, 2017, pp.443-454. 5. F. Zhu, J. Gao, J. Yang, N. Ye, Neighborhood linear discriminant analysis, Pattern Recognition, Vol.123, No.2, 2022, Article no. 108422.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|