Lester: Rotoscope Animation through Video Object Segmentation and Tracking
-
Published:2024-07-30
Issue:8
Volume:17
Page:330
-
ISSN:1999-4893
-
Container-title:Algorithms
-
language:en
-
Short-container-title:Algorithms
Affiliation:
1. Department of Computer Architecture, Universitat Politecnica de Catalunya, 08034 Barcelona, Spain
Abstract
This article introduces Lester, a novel method to automatically synthesize retro-style 2D animations from videos. The method approaches the challenge mainly as an object segmentation and tracking problem. Video frames are processed with the Segment Anything Model (SAM) and the resulting masks are tracked through subsequent frames with DeAOT, a method of hierarchical propagation for semi-supervised video object segmentation. The geometry of the masks’ contours is simplified with the Douglas–Peucker algorithm. Finally, facial traits, pixelation and a basic rim light effect can be optionally added. The results show that the method exhibits an excellent temporal consistency and can correctly process videos with different poses and appearances, dynamic shots, partial shots and diverse backgrounds. The proposed method provides a more simple and deterministic approach than diffusion models based video-to-video translation pipelines, which suffer from temporal consistency problems and do not cope well with pixelated and schematic outputs. The method is also more feasible than techniques based on 3D human pose estimation, which require custom handcrafted 3D models and are very limited with respect to the type of scenes they can process.
Funder
Spanish Ministry of Science and Innovation Catalan Government
Reference39 articles.
1. Kirillov, A., Mintun, E., Ravi, N., Mao, H., Rolland, C., Gustafson, L., Xiao, T., Whitehead, S., Berg, A.C., and Lo, W.Y. (2023). Segment anything. arXiv. 2. Cheng, Y., Li, L., Xu, Y., Li, X., Yang, Z., Wang, W., and Yang, Y. (2023). Segment and Track Anything. arXiv. 3. Douglas, D.H., and Peucker, T.K. (2011). Algorithms for the Reduction of the Number of Points Required to Represent a Digitized Line or its Caricature. Classics in Cartography: Reflections on Influential Articles from Cartographica, John Wiley & Sons. 4. Chen, Y., Lai, Y.K., and Liu, Y.J. (2018, January 18–23). CartoonGAN: Generative Adversarial Networks for Photo Cartoonization. Proceedings of the CVPR. IEEE Computer Society, Salt Lake City, UT, USA. 5. Chen, J., Liu, G., and Chen, X. (2020). AnimeGAN: A Novel Lightweight GAN for Photo Animation. Artificial Intelligence Algorithms and Applications, Proceedings of the 11th International Symposium, ISICA 2019, Guangzhou, China, 16–17 November 2019, Springer.
|
|