Affiliation:
1. School of Information Science and Engineering Yanshan University Qinhuangdao People's Republic of China
2. The Key Laboratory for Computer Virtual Technology and System Integration of Hebei Province Yanshan University Qinhuangdao People's Republic of China
3. College of Computer Science Sichuan University Chengdu People's Republic of China
Abstract
SummaryHospital readmission prediction is defined as an evaluation task to model the historical medical data to predict whether patients will be readmitted after discharge. In the past few years, many feasible and effective prediction methods have been proposed, however, most of them neglect the imbalanced distribution of medical data, which causes great difficulties in modeling. Thus, we proposed a new hospital readmission prediction method, which utilizes hybrid‐sampling and self‐paced balance learning strategies to solve the class‐imbalance problem. To be specifically, we first employ an interference negative sample deletion strategy to reduce the probability of important majority class samples being deleted. Then, we design a hard positive sample generation strategy to generate more positive samples. Meanwhile, we also introduce a self‐paced balance factor during the oversampling process to improve the similarity between newly generated minority class samples and hard positive samples. Finally, we perform the experiments on six real‐world readmission datasets to indicate the superiority of the proposed method.