Automatic diagnosis of depression based on attention mechanism and feature pyramid model
Author:
Xu Ningya,Huo Hua,Xu Jiaxin,Ma Lan,Wang Jinxuan
Abstract
Currently, most diagnoses of depression are evaluated by medical professionals, with the results of these evaluations influenced by the subjective judgment of physicians. Physiological studies have shown that depressed patients display facial movements, head posture, and gaze direction disorders. To accurately diagnose the degree of depression of patients, this paper proposes a comprehensive framework, Cross-Channel Attentional Depression Detection Network, which can automatically diagnose the degree of depression of patients by inputting information from the facial images of depressed patients. Specifically, the comprehensive framework is composed of three main modules: (1) Face key point detection and cropping for video images based on Multi-Task Convolutional Neural Network. (2) The improved Feature Pyramid Networks model can fuse shallow features and deep features in video images and reduce the loss of miniscule features. (3) A proposed Cross-Channel Attention Convolutional Neural Network can enhance the interaction between tensor channel layers. Compared to other methods for automatic depression identification, a superior method was obtained by conducting extensive experiments on the depression dataset AVEC 2014, where the Root Mean Square Error and the Mean Absolute Error were 8.65 and 6.66, respectively.
Funder
National Natural Science Foundation of China
Central Government Guiding Local Science and Technology Development Fund Program
Major Science and Technology Program of Henan Province
Publisher
Public Library of Science (PLoS)
Reference54 articles.
1. WHO. Depression and other common mental disorders: global health estimates: Technical report. World Health Organization. 2017.
2. Major depressive disorder;C Otte;Nat Rev Dis Primers,2016
3. Dinkel H, Wu M, Yu K. Text-based depression detection on sparse data. arXiv preprint arXiv:1904.05154, 2019 Apr. https://doi.org/10.48550/arXiv.1904.05154
4. Semi-structural interview-based Chinese multimodal depression corpus towards automatic preliminary screening of depressive disorders;B Zou;IEEE Transactions on Affective Computing,2022
5. Automatic detection of depression symptoms in twitter using multimodal analysis;R Safa;Supercomput,2022