Con&Net: A Cross-Network Anchor Link Discovery Method Based on Embedding Representation

Author:

Wang Xueyuan1,Zhang Hongpo1ORCID,Wang Zongmin1,Qiao Yaqiong2,Ma Jiangtao3,Dai Honghua4

Affiliation:

1. Cooperative Innovation Center of Internet Healthcare, Zhengzhou University, Zhengzhou, Henan, China

2. College of Information Engineering, North China University of Water Resources and Electric Power, Zhengzhou, Henan, China

3. College of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou, Henan, China

4. Institute of Intelligent System, Deakin University, Burwood, VIC, Australia

Abstract

Cross-network anchor link discovery is an important research problem and has many applications in heterogeneous social network. Existing schemes of cross-network anchor link discovery can provide reasonable link discovery results, but the quality of these results depends on the features of the platform. Therefore, there is no theoretical guarantee to the stability. This article employs user embedding feature to model the relationship between cross-platform accounts, that is, the more similar the user embedding features are, the more similar the two accounts are. The similarity of user embedding features is determined by the distance of the user features in the latent space. Based on the user embedding features, this article proposes an embedding representation-based method Con&Net(Content and Network) to solve cross-network anchor link discovery problem. Con&Net combines the user’s profile features, user-generated content (UGC) features, and user’s social structure features to measure the similarity of two user accounts. Con&Net first trains the user’s profile features to get profile embedding. Then it trains the network structure of the nodes to get structure embedding. It connects the two features through vector concatenating, and calculates the cosine similarity of the vector based on the embedding vector. This cosine similarity is used to measure the similarity of the user accounts. Finally, Con&Net predicts the link based on similarity for account pairs across the two networks. A large number of experiments in Sina Weibo and Twitter networks show that the proposed method Con&Net is better than state-of-the-art method. The area under the curve (AUC) value of the receiver operating characteristic (ROC) curve predicted by the anchor link is 11% higher than the baseline method, and Precision@30 is 25% higher than the baseline method.

Funder

Integration of Cloud Computing and Big Integration of Cloud Computing and Big Data, Innovation of Science and Education

Key Research, Development, and Dissemination Program of Henan Province

Key Science and Technology Project of Xinjiang Production and Construction Corps

Henan Province Science and Technology Department Foundation

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Reference41 articles.

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3