Abstract
The shot-type decision is a very important pre-task in movie analysis due to the vast information, such as the emotion, psychology of the characters, and space information, from the shot type chosen. In order to analyze a variety of movies, a technique that automatically classifies shot types is required. Previous shot type classification studies have classified shot types by the proportion of the face on-screen or using a convolutional neural network (CNN). Studies that have classified shot types by the proportion of the face on-screen have not classified the shot if a person is not on the screen. A CNN classifies shot types even in the absence of a person on the screen, but there are certain shots that cannot be classified because instead of semantically analyzing the image, the method classifies them only by the characteristics and patterns of the image. Therefore, additional information is needed to access the image semantically, which can be done through semantic segmentation. Consequently, in the present study, the performance of shot type classification was improved by preprocessing the semantic segmentation of the frame extracted from the movie. Semantic segmentation approaches the images semantically and distinguishes the boundary relationships among objects. The representative technologies of semantic segmentation include Mask R-CNN and Yolact. A study was conducted to compare and evaluate performance using these as pretreatments for shot type classification. As a result, the average accuracy of shot type classification using a frame preprocessed with semantic segmentation increased by 1.9%, from 93% to 94.9%, when compared with shot type classification using the frame without such preprocessing. In particular, when using ResNet-50 and Yolact, the classification of shot type showed a 3% performance improvement (to 96% accuracy from 93%).
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference27 articles.
1. Film Directing: Shot by Shot;Katz,1991
2. Grammar of the Edit;Thompson,2009
Cited by
12 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Aesthetic Assessment of Movie Still Frame for Various Field of Views;2024 IEEE International Conference on Multimedia and Expo Workshops (ICMEW);2024-07-15
2. Mask-VGG: A Shot Scale Classification Model Based on Mask Generation;2023 International Conference on Culture-Oriented Science and Technology (CoST);2023-10-11
3. Toward Unified and Quantitative Cinematic Shot Attribute Analysis;Electronics;2023-10-08
4. LEMMS: Label Estimation of Multi-feature Movie Segments;2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW);2023-10-02
5. Recognition of Camera Angle and Camera Level in Movies from Single Frames;Proceedings of the 2023 ACM International Conference on Interactive Media Experiences Workshops;2023-06-12