Author:
Liu Wu-Qin,Lin Min-Xuan,Huang Hai-Bin,Ma Chong-Yang,Song Yu,Dong Wei-Ming,Xu Chang-Sheng
Publisher
Springer Science and Business Media LLC
Subject
Computational Theory and Mathematics,Computer Science Applications,Hardware and Architecture,Theoretical Computer Science,Software
Reference39 articles.
1. Narasimhan M, Rohrbach A, Darrell T. Clip-it! Language-guided video summarization. In Proc. the 35th International Conference on Neural Information Processing Systems, Dec. 2021, pp.13988–14000.
2. Lin J C, Wei W L, Wang H M. EMV-matchmaker: Emotional temporal course modeling and matching for automatic music video generation. In Proc. the 23rd ACM International Conference on Multimedia, Oct. 2015, pp.899–902. https://doi.org/10.1145/2733373.2806359.
3. Lin J C, Wei W L, Wang H M. DEMV-matchmaker: Emotional temporal course representation and deep similarity matching for automatic music video generation. In Proc. the 2016 IEEE International Conference on Acoustics, Speech and Signal Processing, Mar. 2016, pp.2772– 2776. https://doi.org/10.1109/ICASSP.2016.7472182.
4. Murch W. In the Blink of an Eye. Silman-James Press, 2001.
5. Radford A, Kim J W, Hallacy C, Ramesh A, Goh G, Agarwal S, Sastry G, Askell A, Mishkin P, Clark J, Krueger G, Sutskever I. Learning transferable visual models from natural language supervision. In Proc. the 38th International Conference on Machine Learning, Jul. 2021, pp.8748–8763.