Affiliation:
1. Department of Mathematics King's College London London UK
2. Institute of Structural and Molecular Biology University College London London UK
Abstract
AbstractStatistical and machine learning methods have proved useful in many areas of immunology. In this paper, we address for the first time the problem of predicting the occurrence of class switch recombination (CSR) in B‐cells, a problem of interest in understanding antibody response under immunological challenges. We propose a framework to analyze antibody repertoire data, based on clonal (CG) group representation in a way that allows us to predict CSR events using CG level features as input. We assess and compare the performance of several predicting models (logistic regression, LASSO logistic regression, random forest, and support vector machine) in carrying out this task. The proposed approach can obtain an unweighted average recall of with models based on variable region descriptors and measures of CG diversity during an immune challenge and, most notably, before an immune challenge.
Funder
Biotechnology and Biological Sciences Research Council