Principal component analysis as tool for data reduction with an application-Reference-Cited by-同舟云学术

Principal component analysis as tool for data reduction with an application

Published:2022-09-30 Issue:5 Volume: Page:184-198
ISSN:2461-4262
Container-title:EUREKA: Physics and Engineering
language:
Short-container-title:Eureka: PE

Author:

Latif Shereen Hamdy Abdel^ORCID,Alwan Asraa Sadoon^ORCID,Mohamed Amany Mousa^ORCID

Abstract

The recent trends in collecting huge datasets have posed a great challenge that is brought by the high dimensionality and aggravated by the presence of irrelevant dimensions. Machine learning models for regression is recognized as a convenient way of improving the estimation for empirical models. Popular machine learning models is support vector regression (SVR). However, the usage of principal component analysis (PCA) as a variable reduction method along with SVR is suggested. The principal component analysis helps in building a predictive model that is simple as it contains the smallest number of variables and efficient. In this paper, we investigate the competence of SVR with PCA to explore its performance for a more accurate estimation. Simulation study and Renal Failure (RF) data of SVR optimized by four different kernel functions; linear, polynomial, radial basis, and sigmoid functions using R software, version (R x64 3.2.5) to compare the behavior of ε SVR and v-SVR models for different sample sizes ranges from small, moderate to large such as; 50, 100, and 150. The performance criteria are root mean squared error (RMSE) and coefficient of determination R2 showed the superiority of ε-SVR over v- SVR. Furthermore, the implementation of SVR after employing PCA improves the results. Also, the simulation results showed that the best performing kernel function is the linear kernel. For real data the results showed that the best kernels are linear and radial basis function. It is also clear that, with ε-SVR and v-SVR, the RMSE values for almost kernel functions decreased with increasing sample size. Therefore, the performance of ε-SVR improved after applying PCA. In addition sample size n=50 gave good results for linear and radial kernel

Publisher

OU Scientific Route

Subject

General Physics and Astronomy,General Engineering

Reference20 articles.

1. Chowdhury, U. N., Chakravarty, S. K., Hossain, Md. T. (2018). Short-Term Financial Time Series Forecasting Integrating Principal Component Analysis and Independent Component Analysis with Support Vector Regression. Journal of Computer and Communications, 06 (03), 51–67. doi: https://doi.org/10.4236/jcc.2018.63004

2. Yu, H., Chen, R., Zhang, G. (2014). A SVM Stock Selection Model within PCA. Procedia Computer Science, 31, 406–412. doi: https://doi.org/10.1016/j.procs.2014.05.284

3. ‏Glaser, J. I., Benjamin, A. S., Farhoodi, R., Kording, K. P. (2019). The roles of supervised machine learning in systems neuroscience. Progress in Neurobiology, 175, 126–137. doi: https://doi.org/10.1016/j.pneurobio.2019.01.008

4. Lee, J. A., Verleysen, M. (2009). Quality assessment of dimensionality reduction: Rank-based criteria. Neurocomputing, 72 (7-9), 1431–1443. doi: https://doi.org/10.1016/j.neucom.2008.12.017

5. Jolliffe, I. T., Cadima, J. (2016). Principal component analysis: a review and recent developments. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 374 (2065), 20150202. doi: https://doi.org/10.1098/rsta.2015.0202

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Evaluation of the Impact of 60Co Irradiation on Volatile Organic Compounds of Olibanum Using Gas Chromatography Ion Mobility Spectrometry;Molecules;2024-04-08