Lester: Rotoscope Animation through Video Object Segmentation and Tracking-Reference-Cited by-同舟云学术

Lester: Rotoscope Animation through Video Object Segmentation and Tracking

Published:2024-07-30 Issue:8 Volume:17 Page:330
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Tous Ruben¹^ORCID

Affiliation:

1. Department of Computer Architecture, Universitat Politecnica de Catalunya, 08034 Barcelona, Spain

Abstract

This article introduces Lester, a novel method to automatically synthesize retro-style 2D animations from videos. The method approaches the challenge mainly as an object segmentation and tracking problem. Video frames are processed with the Segment Anything Model (SAM) and the resulting masks are tracked through subsequent frames with DeAOT, a method of hierarchical propagation for semi-supervised video object segmentation. The geometry of the masks’ contours is simplified with the Douglas–Peucker algorithm. Finally, facial traits, pixelation and a basic rim light effect can be optionally added. The results show that the method exhibits an excellent temporal consistency and can correctly process videos with different poses and appearances, dynamic shots, partial shots and diverse backgrounds. The proposed method provides a more simple and deterministic approach than diffusion models based video-to-video translation pipelines, which suffer from temporal consistency problems and do not cope well with pixelated and schematic outputs. The method is also more feasible than techniques based on 3D human pose estimation, which require custom handcrafted 3D models and are very limited with respect to the type of scenes they can process.

Funder

Spanish Ministry of Science and Innovation

Catalan Government

Publisher

MDPI AG

Link

https://www.mdpi.com/1999-4893/17/8/330/pdf

Reference39 articles.

1. Kirillov, A., Mintun, E., Ravi, N., Mao, H., Rolland, C., Gustafson, L., Xiao, T., Whitehead, S., Berg, A.C., and Lo, W.Y. (2023). Segment anything. arXiv.

2. Cheng, Y., Li, L., Xu, Y., Li, X., Yang, Z., Wang, W., and Yang, Y. (2023). Segment and Track Anything. arXiv.

3. Douglas, D.H., and Peucker, T.K. (2011). Algorithms for the Reduction of the Number of Points Required to Represent a Digitized Line or its Caricature. Classics in Cartography: Reflections on Influential Articles from Cartographica, John Wiley & Sons.

4. Chen, Y., Lai, Y.K., and Liu, Y.J. (2018, January 18–23). CartoonGAN: Generative Adversarial Networks for Photo Cartoonization. Proceedings of the CVPR. IEEE Computer Society, Salt Lake City, UT, USA.

5. Chen, J., Liu, G., and Chen, X. (2020). AnimeGAN: A Novel Lightweight GAN for Photo Animation. Artificial Intelligence Algorithms and Applications, Proceedings of the 11th International Symposium, ISICA 2019, Guangzhou, China, 16–17 November 2019, Springer.