Affiliation:
1. Bishop Heber College, India
Abstract
The advancement of technology has led to an exponential increase in the volume, velocity, and variety of data generated, necessitating the development of effective methods for analyzing and extracting valuable insights from large datasets. This research focuses on enhancing big data analytics and recommendation systems using Python, specifically employing hierarchical clustering and a filtering approach with the slicing technique. This study proposes a novel approach to leverage Python's capabilities in processing and analyzing big data. Hierarchical clustering algorithms organize and structure data into hierarchical groups, enabling efficient exploration and extraction of relevant information. Additionally, a filtering mechanism is integrated with the slicing technique, allowing for identifying and extracting specific subsets of data based on predefined criteria. Experiments are conducted using real-world datasets in the context of recommendation systems to evaluate the approach's effectiveness.
Reference29 articles.
1. Akbar, I., Hashem, T., Yaqoob, I., Anuar, N.B., Mokhtar, S., Gani, A., & Khan, S.A. (2015). The rise of “big data” on cloud computing: review and open research issues. Inf Syst., 47, 98–115.
2. An empirical comparison of social, collaborative filtering, and hybrid recommenders
3. Beyer & Laney. (2012). The importance of “big data”: A definition. Gartner, Tech. Rep.
4. VDMR-DBSCAN: Varied Density MapReduce DBSCAN
5. DDSC : A Density Differentiated Spatial Clustering Technique