Abstract
In this work, we study the problem of inferring a discrete probability distribution using both expert knowledge and empirical data. This is an important issue for many applications where the scarcity of data prevents a purely empirical approach. In this context, it is common to rely first on an a priori from initial domain knowledge before proceeding to an online data acquisition. We are particularly interested in the intermediate regime, where we do not have enough data to do without the initial a priori of the experts, but enough to correct it if necessary. We present here a novel way to tackle this issue, with a method providing an objective way to choose the weight to be given to experts compared to data. We show, both empirically and theoretically, that our proposed estimator is always more efficient than the best of the two models (expert or data) within a constant.
Funder
Agence Nationale de la Recherche
Subject
General Physics and Astronomy
Reference32 articles.
1. A Model-Based Reinforcement Learning Approach for a Rare Disease Diagnostic Task;Besson;arXiv,2018
2. An introduction to the maximum entropy approach and its application to inference problems in biology
3. Bayesian Data Analysis;Gelman,2004
4. Information Theory and Statistical Mechanics
5. Mémoire sur la Probabilité des Causes par les évènementshttps://gallica.bnf.fr/ark:/12148/bpt6k77596b/f32
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献