Affiliation:
1. Chengdu Institute of Computer Applications, Chinese Academy of Sciences, Chengdu 610299, China
2. School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing 100049, China
Abstract
As a structural indicator of dense subgraphs, k-core has been widely used in community search due to its concise and efficient calculation. Many community search algorithms have been expanded on the basis of k-core. However, relevant algorithms often set k values based on empirical analysis of datasets or require users to input manually. Once users are not familiar with the graph network structure, they may miss the optimal solution due to an improper k setting. Especially in attribute social networks, characterizing communities with only k-cores may lead to a lack of semantic interpretability of communities. Consequently, this article proposes a method for identifying the optimal k-core with the greatest attribute score in the attribute social network as the target community. The difficulty of the problem is that the query needs to integrate both structural and textual indicators of the community while fully considering the diversity of attribute scoring functions. To effectively reduce computational costs, we incorporate the topological characteristics of the k-core and the attribute characteristics of entities to construct a hierarchical forest. It is worth noting that we name tree nodes in a way similar to pre-order traversal and can maintain the order of all tree nodes during the forest creation process. In such an attribute forest, it is possible to quickly locate the initial solution containing all query vertices and reuse intermediate results during the process of expanding queries. We conducted effectiveness and performance experiments on multiple real datasets. As the results show, attribute scoring functions are not monotonic, and the algorithm proposed in this paper can avoid scores falling into local optima. With the help of the attribute k-core forest, the actual query time of the Advanced algorithm has improved by two orders of magnitude compared to the BaseLine algorithm. In addition, the average F1 score of our target community has increased by 2.04 times and 26.57% compared to ACQ and SFEG, respectively.
Funder
Sichuan Sciences and Technology Program
Reference30 articles.
1. Influential scholar recommendation model in academic social network;Li;J. Comput. Appl.,2020
2. Sozio, M., and Gionis, A. (2010, January 25–28). The community-search problem and how to plan a successful cocktail party. Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA.
3. Learning to discover social circles in ego networks;Leskovec;Adv. Neural Inf. Process. Syst.,2012
4. Uncovering the overlapping community structure of complex networks in nature and society;Palla;Nature,2005
5. Cui, W., Xiao, Y., Wang, H., Lu, Y., and Wang, W. (2013, January 22–27). Online search of overlapping communities. Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, New York, NY, USA.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献