Abstract
AbstractInvariant concept classes form the backbone of classification algorithms immune to specific data transformations, ensuring consistent predictions regardless of these alterations. However, this robustness can come at the cost of limited access to the original sample information, potentially impacting generalization performance. This study introduces an addition to these classes—the permutation-invariant linear classifiers. Distinguished by their structural characteristics, permutation-invariant linear classifiers are unaffected by permutations on feature vectors, a property not guaranteed by other non-constant linear classifiers. The study characterizes this new concept class, highlighting its constant capacity, independent of input dimensionality. In practical assessments using linear support vector machines, the permutation-invariant classifiers exhibit superior performance in permutation experiments on artificial datasets and real mutation profiles. Interestingly, they outperform general linear classifiers not only in permutation experiments but also in permutation-free settings, surpassing unconstrained counterparts. Additionally, findings from real mutation profiles support the significance of tumor mutational burden as a biomarker.
Funder
Deutsche Forschungsgemeinschaft
Bundesministerium für Bildung und Forschung
Universität Ulm
Publisher
Springer Science and Business Media LLC
Reference39 articles.
1. Abe, S. (2010). Support vector machines for pattern classification. Berlin: Springer.
2. Anthony, M. H. G., & Biggs, N. (1997). Computational learning theory (Vol. 30). Cambridge: Cambridge University Press.
3. Bishop, C. M. (2006). Pattern recognition and machine learning (information science and statistics). Berlin: Springer.
4. Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32.
5. Burges, C. J. C. (1998). A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 2(2), 121–167.