Author:
Li Zongwei,Song Jia,Qiao Kai,Li Chenghai,Zhang Yanhui,Li Zhenyu
Abstract
Facial expressions, whether simple or complex, convey pheromones that can affect others. Plentiful sensory input delivered by marketing anchors' facial expressions to audiences can stimulate consumers' identification and influence decision-making, especially in live streaming media marketing. This paper proposes an efficient feature extraction network based on the YOLOv5 model for detecting anchors' facial expressions. First, a two-step cascade classifier and recycler is established to filter invalid video frames to generate a facial expression dataset of anchors. Second, GhostNet and coordinate attention are fused in YOLOv5 to eliminate latency and improve accuracy. YOLOv5 modified with the proposed efficient feature extraction structure outperforms the original YOLOv5 on our self-built dataset in both speed and accuracy.
Funder
National Natural Science Foundation of China
National Social Science Fund of China
Subject
Cellular and Molecular Neuroscience,Neuroscience (miscellaneous)
Reference35 articles.
1. Benchmark analysis of representative deep neural network architectures;Bianco;IEEE Access,2018
2. YOLOv4: optimal speed and accuracy of object detection
BochkovskiyA.
WangC.-Y.
LiaoH-Y. M.
34300543arXiv [Preprint]2020
3. Rosetta: Large scale system for text detection and recognition in images;Borisyuk,2018
4. Parasocial interaction with YouTubers: does sensory appeal in the YouTubers' video influences purchase intention?;Chen,2021
5. Deep learning approaches for facial emotion recognition: a case study on FER-2013;Giannopoulos,2018
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献