Lightning Pose: improved animal pose estimation via semi-supervised learning, Bayesian ensembling, and cloud-native open-source tools-Reference-Cited by-同舟云学术

Lightning Pose: improved animal pose estimation via semi-supervised learning, Bayesian ensembling, and cloud-native open-source tools

Published:2023-04-28 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Biderman Dan^ORCID,Whiteway Matthew R^ORCID,Hurwitz Cole^ORCID,Greenspan Nicholas,Lee Robert S,Vishnubhotla Ankit^ORCID,Warren Richard^ORCID,Pedraja Federico^ORCID,Noone Dillon^ORCID,Schartner Michael^ORCID,Huntenburg Julia M^ORCID,Khanal Anup^ORCID,Meijer Guido T^ORCID,Noel Jean-Paul^ORCID,Pan-Vazquez Alejandro^ORCID,Socha Karolina Z^ORCID,Urai Anne E^ORCID,Cunningham John P^ORCID,Sawtell Nathaniel^ORCID,Paninski Liam^ORCID,

Abstract

AbstractPose estimation algorithms are shedding new light on animal behavior and intelligence. Most existing models are only trained with labeled frames (supervised learning). Although effective in many cases, the fully supervised approach requires extensive image labeling, struggles to generalize to new videos, and produces noisy outputs that hinder downstream analyses. We address each of these limitations with a semi-supervised approach that leverages the spatiotemporal statistics of unlabeled videos in two different ways. First, we introduce unsupervised training objectives that penalize the network whenever its predictions violate smoothness of physical motion, multiple-view geometry, or depart from a low-dimensional subspace of plausible body configurations. Second, we design a new network architecture that predicts pose for a given frame using temporal context from surrounding unlabeled frames. These context frames help resolve brief occlusions or ambiguities between nearby and similar-looking body parts. The resulting pose estimation networks achieve better performance with fewer labels, generalize better to unseen videos, and provide smoother and more reliable pose trajectories for downstream analysis; for example, these improved pose trajectories exhibit stronger correlations with neural activity. We also propose a Bayesian post-processing approach based on deep ensembling and Kalman smoothing that further improves tracking accuracy and robustness. We release a deep learning package that adheres to industry best practices, supporting easy model development and accelerated training and prediction. Our package is accompanied by a cloud application that allows users to annotate data, train networks, and predict new videos at scale, directly from the browser.

Publisher

Cold Spring Harbor Laboratory

Reference93 articles.

1. Deep ensembles work, but are they necessary?;arXiv,2022

2. Neuroscience Cloud Analysis As a Service: An open-source platform for scalable, reproducible data analysis;Neuron,2022

3. Striatal dopamine explains novelty-induced behavioral dynamics and individual variability in threat prediction;Neuron,2022

4. William H Beluch et al. “The power of ensembles for active learning in image classification.” Proceedings of the IEEE conference on computer vision and pattern recognition. 2018, pp. 9368–9377 (page 16).

5. Mapping the stereotyped behaviour of freely moving fruit flies;Journal of The Royal Society Interface,2014

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Unbiased preclinical phenotyping reveals neuroprotective properties of pioglitazone;2024-09-01

2. Keypoint-MoSeq: parsing behavior by linking point tracking to pose dynamics;Nature Methods;2024-07

3. A 3D whole-face movement analysis system to uncover underlying physiology in mice;2024-05-08

4. Application of a novel deep learning–based 3D videography workflow to bat flight;Annals of the New York Academy of Sciences;2024-04-23

5. Brain-wide representations of prior information in mouse decision-making;2023-07-04