Author:
Liu Xuanxuan,Guo Li,Wang Hexiang,Guo Jia,Yang Shifeng,Duan Lisha
Abstract
Abstract
Background
Soft tissue sarcoma is a rare and highly heterogeneous tumor in clinical practice. Pathological grading of the soft tissue sarcoma is a key factor in patient prognosis and treatment planning while the clinical data of soft tissue sarcoma are imbalanced. In this paper, we propose an effective solution to find the optimal imbalance machine learning model for predicting the classification of soft tissue sarcoma data.
Methods
In this paper, a large number of features are first obtained based on $$T_1$$
T
1
WI images using the radiomics methods.Then, we explore the methods of feature selection, sampling and classification, get 17 imbalance machine learning models based on the above features and performed extensive experiments to classify imbalanced soft tissue sarcoma data. Meanwhile, we used another dataset splitting method as well, which could improve the classification performance and verify the validity of the models.
Results
The experimental results show that the combination of extremely randomized trees (ERT) classification algorithm using SMOTETomek and the recursive feature elimination technique (RFE) performs best compared to other methods. The accuracy of RFE+STT+ERT is 81.57% , which is close to the accuracy of biopsy, and the accuracy is 95.69% when using another dataset splitting method.
Conclusion
Preoperative predicting pathological grade of soft tissue sarcoma in an accurate and noninvasive manner is essential. Our proposed machine learning method (RFE+STT+ERT) can make a positive contribution to solving the imbalanced data classification problem, which can favorably support the development of personalized treatment plans for soft tissue sarcoma patients.
Funder
National Natural Science Foundation of China
Natural Science Foundation of Shandong Province
Publisher
Springer Science and Business Media LLC
Subject
Radiology, Nuclear Medicine and imaging
Reference49 articles.
1. Hexiang W, Jihua L, Dapeng H, Shaofeng D, Wenjian X. Mrt1wi based radiomics and machine learning model for predicting the histopathological grades of soft tissue sarcomas. Chin J Radiol. 2020;54(4):6.
2. Siegel R, Naishadham D, Jemal A. Cancer statistics, 2012. CA Cancer J Clin. 2012;62(1):10–29.
3. Coindre JM. Grading of soft tissue sarcomas—review and update. Archiv Pathol Lab Med. 2006;130(10):1448–53.
4. Pasquali S, Gronchi A. Neoadjuvant chemotherapy in soft tissue sarcomas: latest evidence and clinical implications. Therap Adv Med Oncol. 2017;9(6):415.
5. Gronchi A, Ferrari S, Quagliuolo V, Broto JM, Pousa AL, Grignani G, Basso U, Blay JY, Tendero O, Beveridge RDa Histotype-tailored neoadjuvant chemotherapy versus standard chemotherapy in patients with high-risk soft-tissue sarcomas (isg-sts 1001): an international, open-label, randomised, controlled, phase 3, multicentre trial. The Lancet Oncology 2017
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献