Abstract
AbstractClostridioides difficileis the main cause of antibiotic related diarrhea and some ribotypes (RT), such as RT027, RT181 or RT078, are considered high risk clones. A fast and reliable approach forC. difficileribotyping is needed for a correct clinical approach. This study analyses high-molecular-weight proteins forC. difficileribotyping with MALDI-TOF MS. Sixty-nine isolates representative of the most common ribotypes in Europe were analyzed in the 17,000-65,000m/zregion and classified into 4 categories (RT027, RT181, RT078 and ‘Other RTs’). Five supervised Machine Learning algorithms were tested for this purpose: K-Nearest Neighbors, Support Vector Machine, Partial Least Squares-Discriminant Analysis, Random Forest and Light-Gradient Boosting Machine. All algorithms yielded cross-validation results >70%, being RF and Light-GBM the best performing, with 88% of agreement. Area under the ROC curve of these two algorithms was >0.9. RT078 was correctly classified with 100% accuracy and isolates from the RT181 category could not be differentiated from RT027.
Publisher
Cold Spring Harbor Laboratory