Affiliation:
1. Toronto Metropolitan University, Toronto, Canada
Abstract
In this article, we present an optimal edge-weighted graph semantic correlation (EWGSC) framework for multi-view feature representation learning. Different from most existing multi-view representation methods, local structural information and global correlation in multi-view feature spaces are exploited jointly in the EWGSC framework, leading to a new and high-quality multi-view feature representation. Specifically, a novel edge-weighted graph model is first conceptualized and developed to preserve local structural information in each of the multi-view feature spaces. Then, the explored structural information is integrated with a semantic correlation algorithm, labeled multiple canonical correlation analysis (LMCCA), to form a powerful platform for effectively exploiting local and global relations across multi-view feature spaces jointly. We then theoretically verified the relation between the upper limit on the number of projected dimensions and the optimal solution to the multi-view feature representation problem. To validate the effectiveness and generality of the proposed framework, we conducted experiments on five datasets of different scales, including visual-based (University of California Irvine (UCI) iris database, Olivetti Research Lab (ORL) face database, and Caltech 256 database), text-image-based (Wiki database), and video-based (Ryerson Multimedia Lab (RML) audio-visual emotion database) examples. The experimental results show the superiority of the proposed framework on multi-view feature representation over state-of-the-art algorithms.
Publisher
Association for Computing Machinery (ACM)
Reference89 articles.
1. A survey of multi-view representation learning;Li Yingming;IEEE Trans. Knowl. Data Eng.,2018
2. Multi-modal deep analysis for multimedia;Zhu Wenwu;IEEE Trans. Circ. Syst. Vid. Technol.,2019
3. Multi-view learning overview: Recent progress and new challenges;Zhao Jing;Inf. Fusion,2017
4. Jan Rupnik and John Shawe-Taylor. 2010. Multi-view canonical correlation analysis. In Conference on Data Mining and Data Warehouses (SiKDD’10). 1–4.
5. An overview of cross-media retrieval: Concepts, methodologies, benchmarks, and challenges;Peng Yuxin;IEEE Trans. Circ. Syst. Vid. Technol.,2017