Author:
Liu XiaoLe,Yu Si-yang,Flierman Nico A.,Loyola Sebastián,Kamermans Maarten,Hoogland Tycho M.,De Zeeuw Chris I.
Abstract
Animal pose estimation tools based on deep learning have greatly improved animal behaviour quantification. These tools perform pose estimation on individual video frames, but do not account for variability of animal body shape in their prediction and evaluation. Here, we introduce a novel multi-frame animal pose estimation framework, referred to as OptiFlex. This framework integrates a flexible base model (i.e., FlexibleBaseline), which accounts for variability in animal body shape, with an OpticalFlow model that incorporates temporal context from nearby video frames. Pose estimation can be optimised using multi-view information to leverage all four dimensions (3D space and time). We evaluate FlexibleBaseline using datasets of four different lab animal species (mouse, fruit fly, zebrafish, and monkey) and introduce an intuitive evaluation metric—adjusted percentage of correct key points (aPCK). Our analyses show that OptiFlex provides prediction accuracy that outperforms current deep learning based tools, highlighting its potential for studying a wide range of behaviours across different animal species.
Funder
ZonMw
Nederlandse Organisatie voor Wetenschappelijk Onderzoek
European Research Council
Subject
Cellular and Molecular Neuroscience
Reference43 articles.
1. BehaveNet: nonlinear embedding and Bayesian neural decoding of behavioral videos;Batty,2019
2. Creatures great and SMAL: recovering the shape and motion of animals from video;Biggs,2018
3. The OpenCV library;Bradski;Dr. Dobb's J. Software Tools,2000
4. Adversarial posenet: a structure-aware convolutional network for human pose estimation;Chen;IEEE International Conference on Computer Vision,2017
5. CholletF.
Keras2015
Cited by
24 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献