Affiliation:
1. Department of Pediatrics, the Affiliated Changsha Central Hospital, Hengyang Medical School, University of South China, Changsha, Hunan, China
2. Department of Joint Surgery, he Hong-he Affiliated Hospital of Kunming Medical University/The Southern Central Hospital of Yun-nan Province (The First People’s Hospital of Honghe State), Changsha, Hunan, China
Abstract
Objective
This study aimed to create a predictive model based on machine learning to identify the risk for tracheobronchial tuberculosis (TBTB) occurring alongside Mycoplasma pneumoniae pneumonia in pediatric patients.
Methods
Clinical data from 212 pediatric patients were examined in this retrospective analysis. This cohort included 42 individuals diagnosed with TBTB and Mycoplasma pneumoniae pneumonia (combined group) and 170 patients diagnosed with lobar pneumonia alone (pneumonia group). Three predictive models, namely XGBoost, decision tree, and logistic regression, were constructed, and their performances were assessed using the receiver’s operating characteristic (ROC) curve, precision-recall curve (PR), and decision curve analysis (DCA). The dataset was divided into a 7:3 ratio to test the first and second groups, utilizing them to validate the XGBoost model and to construct the nomogram model.
Results
The XGBoost highlighted eight significant signatures, while the decision tree and logistic regression models identified six and five signatures, respectively. The ROC analysis revealed an area under the curve (AUC) of 0.996 for XGBoost, significantly outperforming the other models (p < 0.05). Similarly, the PR curve demonstrated the superior predictive capability of XGBoost. DCA further confirmed that XGBoost offered the highest AIC (43.226), the highest average net benefit (0.764), and the best model fit. Validation efforts confirmed the robustness of the findings, with the validation groups 1 and 2 showing ROC and PR curves with AUC of 0.997, indicating a high net benefit. The nomogram model was shown to possess significant clinical value.
Conclusion
Compared to machine learning approaches, the XGBoost model demonstrated superior predictive efficacy in identifying pediatric patients at risk of concurrent TBTB and Mycoplasma pneumoniae pneumonia. The model’s identification of critical signatures provides valuable insights into the pathogenesis of these conditions.