Improved method for correcting sample Mahalanobis distance without estimating population eigenvalues or eigenvectors of covariance matrix
-
Published:2019-12-07
Issue:2
Volume:10
Page:121-134
-
ISSN:2364-415X
-
Container-title:International Journal of Data Science and Analytics
-
language:en
-
Short-container-title:Int J Data Sci Anal
Abstract
AbstractThe recognition performance of the sample Mahalanobis distance (SMD) deteriorates as the number of learning samples decreases. Therefore, it is important to correct the SMD for a population Mahalanobis distance (PMD) such that it becomes equivalent to the case of infinite learning samples. In order to reduce the computation time and cost for this main purpose, this paper presents a correction method that does not require the estimation of the population eigenvalues or eigenvectors of the covariance matrix. In short, this method only requires the sample eigenvalues of the covariance matrix, number of learning samples, and dimensionality to correct the SMD for the PMD. This method involves the summation of the SMD’s principal components (each of which is divided by its expectation obtained using the delta method), Lawley’s bias estimation, and the variances of the sample eigenvectors. A numerical experiment demonstrates that this method works well for various cases of learning sample number, dimensionality, population eigenvalues sequence, and non-centrality. The application of this method also shows improved performance of estimating a Gaussian mixture model using the expectation–maximization algorithm.
Funder
Japan Society for the Promotion of Science
Publisher
Springer Science and Business Media LLC
Subject
Applied Mathematics,Computational Theory and Mathematics,Computer Science Applications,Modeling and Simulation,Information Systems
Reference41 articles.
1. Mahalanobis, P.C.: On tests and measures of group divergence. J. Proc. Asiat. Soc. Bengal 26, 541–588 (1930) 2. Hotelling, H.: The generalization of Student’s ratio. Ann. Math. Stat. 2, 360–378 (1931) 3. Patterson, M.T., Anderson, N., Bennett, C., Bruggemann, J., Grossman, R.L., Handy, M., Ly, V., Mandl, D.J., Pederson, S., Pivarski, J., Powell, R., Spring, J., Wells, W., Xia, J.: The Matsu Wheel: a reanalysis framework for Earth satellite imagery in data commons. Int. J. Data Sci. Anal. 4, 251–264 (2017) 4. Cudney, E.A., Hong, J., Jugulum, R., Paryani, K., Ragsdell, K.M., Taguchi, G.: An evaluation of Mahalanobis-Taguchi system and neural network for multivariate pattern recognition. J. Ind. Syst. Eng. 1, 139–150 (2007) 5. Su, C.-T., Wang, P.-C., Chen, Y.-C., Chen, L.-F.: Data mining techniques for assisting the diagnosis of pressure ulcer development in surgical patients. J. Med. Syst. 36, 2387–2399 (2012)
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|