Author:
Zhang Jing,Cao Peng,Gross Douglas P,Zaiane Osmar R
Abstract
Abstract
Recommending optimal rehabilitation intervention for injured workers that would lead to successful return-to-work (RTW) is a challenge for clinicians. Currently, the clinicians are unable to identify with complete confidence which intervention is best for a patient and the referral is often made in trial and error fashion. Only 58% recommendations are successful in our dataset. We aim to develop an interpretable decision support system using machine learning to assist the clinicians. We proposed an alternate ripper (ARIPPER) combined with a hybrid re-sampling technique, and a balanced weighted random forests (BWRF) ensemble method respectively, in order to tackle the multi-class imbalance, class overlap and noise problem in real world application data. The final models have shown promising potential in classification compared to human baseline and has been integrated into a web-based decision-support tool that requires additional validation in a clinical sample.
Publisher
Springer Science and Business Media LLC
Reference29 articles.
1. Chawla NV, Japkowicz N, Kolcz A: Editorial: special issue on learning from imbalanced data sets. SIGKDD Explorations Spec Issue Learn Imbalanced Datasets. 2004, 6: 1-6.
2. He H, Garcia E: Learning from imbalanced data. IEEE Trans Knowl Data Eng. 2009, 21 (9): 1263-1284.
3. Kotsiantis S, Kanellopoulos D, Pintelas P: Handling imbalanced datasets: a review. GESTS Int Trans Comput Sci Eng. 2006, 30: 25-36.
4. Yang Q, Wu X: 10 challenging problems in data mining research. Int J Inf Technol Decis Mak. 2006, 5 (4): 597-604. 10.1142/S0219622006002258.
5. Zhou ZH, Liu XY: Training cost-sensitive neural networks with methods addressing the class imbalance problem. IEEE Trans Knowl Data Eng. 2006, 18 (1): 63-77.
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献