Constraining classifiers in molecular analysis: invariance and robustness
-
Published:2020-02
Issue:163
Volume:17
Page:20190612
-
ISSN:1742-5689
-
Container-title:Journal of The Royal Society Interface
-
language:en
-
Short-container-title:J. R. Soc. Interface.
Author:
Lausser Ludwig1,
Szekely Robin1,
Klimmek Attila1,
Schmid Florian1,
Kestler Hans A.12ORCID
Affiliation:
1. Institute of Medical Systems Biology, Ulm University, Ulm, Germany
2. Leibniz Institute on Aging, Jena, Germany
Abstract
Analysing molecular profiles requires the selection of classification models that can cope with the high dimensionality and variability of these data. Also, improper reference point choice and scaling pose additional challenges. Often model selection is somewhat guided by
ad hoc
simulations rather than by sophisticated considerations on the properties of a categorization model. Here, we derive and report four linked linear concept classes/models with distinct invariance properties for high-dimensional molecular classification. We can further show that these concept classes also form a half-order of complexity classes in terms of Vapnik–Chervonenkis dimensions, which also implies increased generalization abilities. We implemented support vector machines with these properties. Surprisingly, we were able to attain comparable or even superior generalization abilities to the standard linear one on the 27 investigated RNA-Seq and microarray datasets. Our results indicate that
a priori
chosen invariant models can replace
ad hoc
robustness analysis by interpretable and theoretically guaranteed properties in molecular categorization.
Funder
Bundesministerium für Bildung und Forschung
Deutsche Forschungsgemeinschaft
Ministerium für Wissenschaft, Forschung und Kunst Baden-Württemberg
Publisher
The Royal Society
Subject
Biomedical Engineering,Biochemistry,Biomaterials,Bioengineering,Biophysics,Biotechnology
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Permutation-invariant linear classifiers;Machine Learning;2024-07-09
2. Detecting Ordinal Subcascades;Neural Processing Letters;2020-10-19