Information theory for data-driven model reduction in physics and biology

Author:

Schmitt Matthew S.ORCID,Koch-Janusz Maciej,Fruchart Michel,Seara Daniel S.ORCID,Rust Michael,Vitelli VincenzoORCID

Abstract

Model reduction is the construction of simple yet predictive descriptions of the dynamics of many-body systems in terms of a few relevant variables. A prerequisite to model reduction is the identification of these relevant variables, a task for which no general method exists. Here, we develop a systematic approach based on the information bottleneck to identify the relevant variables, defined as those most predictive of the future. We elucidate analytically the relation between these relevant variables and the eigenfunctions of the transfer operator describing the dynamics. Further, we show that in the limit of high compression, the relevant variables are directly determined by the slowest-decaying eigenfunctions. Our information-based approach indicates when to optimally stop increasing the complexity of the reduced model. Furthermore, it provides a firm foundation to construct interpretable deep learning tools that perform model reduction. We illustrate how these tools work in practice by considering uncurated videos of atmospheric flows from which our algorithms automatically extract the dominant slow collective variables, as well as experimental videos of cyanobacteria colonies in which we discover an emergent synchronization order parameter.Significance StatementThe first step to understand natural phenomena is to intuit which variables best describe them. An ambitious goal of artificial intelligence is to automate this process. Here, we develop a framework to identify these relevant variables directly from complex datasets. Very much like MP3 compression is about retaining information that matters most to the human ear, our approach is about keeping information that matters most to predict the future. We formalize this insight mathematically and systematically answer the question of when to stop increasing the complexity of minimal models. We illustrate how interpretable deep learning tools built on these ideas reveal emergent collective variables in settings ranging from satellite recordings of atmospheric fluid flows to experimental videos of cyanobacteria colonies.

Publisher

Cold Spring Harbor Laboratory

Reference78 articles.

1. Yoshiki Kuramoto . Chemical Oscillations, Waves, And Turbulence. Springer, 1984. ISBN 9780486428819..

2. H. Haken . Synergetics: An Introduction. Springer Series in Synergetics. Springer Berlin Heidelberg, 1983. ISBN 9783540123569.

3. G.A. Pavliotis and A. Stuart . Multiscale Methods: Averaging and Homogenization. Texts in Applied Mathematics. Springer New York, 2010. ISBN 9781441925329.

4. Extracting macroscopic dynamics: model problems and algorithms

5. On the concept of dynamical reduction: the case of coupled oscillators;Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences,2019

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3