Automatic Comic Generation with Stylistic Multi-page Layouts and Emotion-driven Text Balloon Generation

Author:

Yang Xin1,Ma Zongliang1,Yu Letian1,Cao Ying2,Yin Baocai1,Wei Xiaopeng1,Zhang Qiang1,Lau Rynson W. H.2

Affiliation:

1. Dalian University of Technology, Liaoning, China

2. City University of Hong Kong, Hong Kong, China

Abstract

In this article, we propose a fully automatic system for generating comic books from videos without any human intervention. Given an input video along with its subtitles, our approach first extracts informative keyframes by analyzing the subtitles and stylizes keyframes into comic-style images. Then, we propose a novel automatic multi-page layout framework that can allocate the images across multiple pages and synthesize visually interesting layouts based on the rich semantics of the images (e.g., importance and inter-image relation). Finally, as opposed to using the same type of balloon as in previous works, we propose an emotion-aware balloon generation method to create different types of word balloons by analyzing the emotion of subtitles and audio. Our method is able to vary balloon shapes and word sizes in balloons in response to different emotions, leading to more enriched reading experience. Once the balloons are generated, they are placed adjacent to their corresponding speakers via speaker detection. Our results show that our method, without requiring any user inputs, can generate high-quality comic pages with visually rich layouts and balloons. Our user studies also demonstrate that users prefer our generated results over those by state-of-the-art comic generation systems.

Funder

National Natural Science Foundation of China

Innovation Technology Funding of Dalian

National Key Research and Development Program of China

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Cited by 9 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Comic exploration and Insights: Recent trends in LDA-Based recognition studies;Expert Systems with Applications;2024-12

2. EmoComicNet: A multi-task model for comic emotion recognition;Pattern Recognition;2024-06

3. Image segmentation, classification and recognition methods for comics: A decade systematic literature review;Engineering Applications of Artificial Intelligence;2024-05

4. Investigating Neural Networks and Transformer Models for Enhanced Comic Decoding;Lecture Notes in Computer Science;2024

5. Fusing Deep Learning and Ensemble Techniques for Comic Character Emotion Recognition;2023 4th International Conference on Data Analytics for Business and Industry (ICDABI);2023-10-25

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3