MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text-Reference-Cited by-同舟云学术

MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text

Published:2023-10-26 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 31st ACM International Conference on Multimedia
language:
Short-container-title:

Author:

Zhu Junchen¹^ORCID,Yang Huan²^ORCID,Wang Wenjing²^ORCID,He Huiguo²^ORCID,Tuo Zixi³^ORCID,Yu Yongsheng⁴^ORCID,Cheng Wen-Huang⁵^ORCID,Gao Lianli¹^ORCID,Song Jingkuan¹^ORCID,Fu Jianlong²^ORCID,Luo Jiebo⁴^ORCID

Affiliation:

1. University of Electronic Science and Technology of China, Chengdu, China

2. Microsoft Research, Beijing, China

3. Microsoft Research, Xi'an, China

4. University of Rochester, Rochester, NY, USA

5. National Taiwan University, Taipei, Taiwan Roc

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3581783.3612667

Reference16 articles.

1. Max Bain Arsha Nagrani Gül Varol and Andrew Zisserman. 2021. Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval. In ICCV. Max Bain Arsha Nagrani Gül Varol and Andrew Zisserman. 2021. Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval. In ICCV.

2. Andreas Blattmann , Robin Rombach , Huan Ling , Tim Dockhorn , Seung Wook Kim , Sanja Fidler, and Karsten Kreis. 2023 . Align your Latents : High-Resolution Video Synthesis with Latent Diffusion Models. In CVPR. Andreas Blattmann, Robin Rombach, Huan Ling, Tim Dockhorn, Seung Wook Kim, Sanja Fidler, and Karsten Kreis. 2023. Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models. In CVPR.

3. A. Sophia Koepke , Andreea-Maria Oncescu , JoHenriques, Zeynep Akata , and Samuel Albanie . 2022. Audio Retrieval with Natural Language Queries: A Benchmark Study . IEEE TMM ( 2022 ). A. Sophia Koepke, Andreea-Maria Oncescu, JoHenriques, Zeynep Akata, and Samuel Albanie. 2022. Audio Retrieval with Natural Language Queries: A Benchmark Study. IEEE TMM (2022).

4. Zhen Li , Zuo-Liang Zhu , Linghao Han , Qibin Hou , Chun-Le Guo , and Ming-Ming Cheng . 2023 . AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation. In CVPR. Zhen Li, Zuo-Liang Zhu, Linghao Han, Qibin Hou, Chun-Le Guo, and Ming-Ming Cheng. 2023. AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation. In CVPR.

5. Chengxu Liu , Huan Yang , Jianlong Fu , and Xueming Qian . 2022 . TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation. arXiv (2022). Chengxu Liu, Huan Yang, Jianlong Fu, and Xueming Qian. 2022. TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation. arXiv (2022).

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. TA2V: Text-Audio Guided Video Generation;IEEE Transactions on Multimedia;2024