Abstract
In real life, the influencing factors that lead to an outcome are varied and complex. However, how to organize the complex variable relationship into a concise and effective model for prediction is critical. Therefore, the Stepwise method was used in this study to organize the relationship between malignant and benign detection of breast cancer tumors and changes in cell characteristics into a model. In this paper, the diagnosis data of breast cancer of patients in the hospital were quoted. The benign and malignant characteristics of breast cancer were taken as independent variables, and the characteristic changes of cells were taken as dependent variables to conduct data analysis with R language, so as to obtain the most effective model. The results show that the prediction accuracy of the model obtained by AIC method is the highest. AIC selected seven dependent variables and reached 79.8 percent. The model prediction accuracy of BIC was 77.2 percent. Compared with AIC method, BIC only selected four dependent variables and obtained a more concise model. However, BIC is too concise and loses the accuracy of model prediction in some aspects.
Publisher
Darcy & Roy Press Co. Ltd.
Reference10 articles.
1. George Edward Pelham Box, Norman R. Draper. (1986). “Empirical Model-Building and Response Surfaces”). John Wiley & Sons, Inc.605 Third Ave. New York, NY, United States
2. Stoica, P.; Selen, Y. (2004). "Model-order selection: a review of information criterion rules", IEEE Signal Processing Magazine (July): 36–47, doi:10.1109/MSP.2004.1311138, S2CID 17338979
3. Fawcett, Tom (2006). "An Introduction to ROC Analysis". Pattern Recognition Letters. 27 (8): 861–874. doi:10.1016/j.patrec.2005.10.010.
4. Piryonesi S. Madeh; El-Diraby Tamer E. (2020-03-01). "Data Analytics in Asset Management: Cost-Effective Prediction of the Pavement Condition Index". Journal of Infrastructure Systems. 26 (1): 04019036.
5. Trnecka, M., & Trneckova, M. (2021). Model order selection for approximate Boolean matrix factorization problem. Knowledge-Based Systems, 227, 107184.