Abstract
AbstractRecent progress in theoretical systems biology, applied mathematics and computational statistics allows us to compare quantitatively the performance of different candidate models at describing a particular biological system. Model selection has been applied with great success to problems where a small number — typically less than 10 — of models are compared, but recently studies have started to consider thousands and even millions of candidate models. Often, however, we are left with sets of models that are compatible with the data, and then we can use ensembles of models to make predictions. These ensembles can have very desirable characteristics, but as I show here are not guaranteed to improve on individual estimators or predictors. I will show in the cases of model selection and network inference when we can trust ensembles, and when we should be cautious. The analyses suggests that the careful construction of an ensemble – choosing good predictors – is of paramount importance, more than had perhaps been realised before: merely adding different methods does not suffice. The success of ensemble network inference methods is also shown to rest on their ability to suppress false-positive results. A Jupyter notebook which allows carrying out an assessment of ensemble estimators is provided.
Publisher
Cold Spring Harbor Laboratory
Reference60 articles.
1. Biophysically Motivated Regulatory Network Inference: Progress and Prospects
2. Akaike H (1974). A new look at the statistical model identification. In Selected Papers of Hirotugu Akaike, pp. 215–222. Springer.
3. Arfken G , HJ W , & Harris F (2013). Mathematical Methods for Physicists. Academic Press.
4. Babtie A C & Stumpf M P H (2017). How to deal with parameters for whole-cell modelling. Journal of the Royal Society, Interface / the Royal Society, 14:20170,237.
5. Topological sensitivity analysis for systems biology
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献