Improved Hybrid Collaborative Fitering Algorithm Based on Spark Platform-Reference-Cited by-同舟云学术

Improved Hybrid Collaborative Fitering Algorithm Based on Spark Platform

Published:2023-10 Issue:5 Volume:28 Page:451-460
ISSN:1007-1202
Container-title:Wuhan University Journal of Natural Sciences
language:
Short-container-title:Wuhan Univ. J. Nat. Sci.

Author:

YOU Zhen,HU Hongwen,WANG Yutao,XUE Jinyun,YI Xinwu

Abstract

An improved Hybrid Collaborative Filtering algorithm (H-CF) is proposed, addressing the issues of data sparsity, low recommendation accuracy, and poor scalability present in traditional collaborative filtering algorithms. The core of H-CF is a linear weighted hybrid algorithm based on the Latent Factor Model (LFM) and the Improved Item Clustering and Similarity Calculation Collaborative Filtering Algorithm (ITCSCF). To begin with, the items are clustered based on their attribute dimension, which accelerates the computation of the nearest neighbor set. Subsequently, H-CF enhances the formula for scoring similarity by penalizing popular items and optimizing unpopular items. This improvement enhances the rationality of scoring similarity and reduces the impact of data sparseness. Furthermore, a weighting function is employed to combine the various improved algorithms. The balance factor of the weighting function is dynamically adjusted to attain the optimal recommendation list. To address the real-time and scalability concerns, the algorithm leverages the Spark big data distributed cluster computing framework. Experiments were conducted using the public dataset MovieLens, where the improved algorithm's performance was compared against the algorithm before enhancement and the algorithm running on a single machine. The experimental results demonstrate that the improved algorithm outperforms in terms of data sparsity, recommendation personalization, accuracy, recall, and efficiency.

Publisher

EDP Sciences

Subject

Multidisciplinary

Link

https://wujns.edpsciences.org/10.1051/wujns/2023285451/pdf

Reference17 articles.

1. Hyper-parameter-evolutionary latent factor analysis for high-dimensional and sparse data from recommender systems

2. Yan J, Zeng Q T, Zhang F Q. Summary of recommendation algorithm research[J]. Journal of Physics: Conference Series, 2021, 1754(1): 012224.

3. A collaborative filtering recommendation system with dynamic time decay

4. Deep Item-based Collaborative Filtering for Top-N Recommendation

5. Collaborative filtering recommendation algorithm based on user fuzzy similarity

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Improve Code Summarization via Prompt-Tuning CodeT5;Wuhan University Journal of Natural Sciences;2023-12