WS-BD-Based Two-Level Match: Interesting Sequential Patterns and Bayesian Fuzzy Clustering for Predicting the Web Pages from Weblogs

Author:

Prakash Pg Om1,Jaya A2

Affiliation:

1. Research Scholar, Department of Computer Science and Engineering, B.S. Abdur Rahman Crescent Institute of Science & Technology, Chennai 600048, India

2. Professor & Head, Department of Computer Applications, B.S. Abdur Rahman Crescent Institute of Science & Technology, Chennai 600048

Abstract

Abstract The rapid increase in information and technology has led to the increased amount of web pages, which raises the complexity in sticking to relevant web pages, and the visitor suffers due to wastage of time resulting in lack of satisfaction. This paper proposes a web page prediction method using a weighed support and Bhattacharya distance-based (WS-BD) two-level match. The major aim of the proposed method is to attain customer satisfaction. Initially, interesting sequential patterns are obtained using the weighed support that filters the sequential patterns obtained using a PrefixSpan algorithm based on the frequency, duration and recurrence of the web pages. Interesting sequential patterns are clustered using the proposed dice similarity-based Bayesian fuzzy clustering, and the web page is predicted using the two-level match based on Bhattacharya distance. The experimentation is performed using the CTI and MSNBC data which proves the effectiveness of the proposed method. The proposed method shows 9.59, 21.22 and 10.17% improvement than the existing FCM-KNN in terms of precision, recall and F measure for the CTI dataset. Also, the proposed method shows 2.58, 22.17 and 7.83% improvement than the existing FCM-KNN in terms of precision, recall and F measure for the MSNBC dataset.

Publisher

Oxford University Press (OUP)

Subject

General Computer Science

Reference28 articles.

1. A web personalization system based on web usage mining techniques;Albanese,2004

2. An enhanced CBAR algorithm for improving recommendation systems accuracy;Duwairi;Simul. Model. Pract. Th.,2016

3. Non invasive estimation of blood pressure using a linear regression model from the photoplethysmogram (PPG) signal;Valsalan;Perspectivas em Ciencia da Informacao,2017

4. Support vector regression and extended nearest neighbor for video object retrieval;Ghuge;Evol. Intell.,2018

5. Author identification in text mining for used in forensics, international journal of research in advent;Ranjan;Technology,2013

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Deep Fuzzy Clustering and Deep Residual Network for Prediction of Web Pages from Weblog Data with Fractional Order Based Ranking;International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems;2023-06

2. A Novel Approach for Mining Closed Clickstream Patterns;Cybernetics and Systems;2021-01-11

3. Hybrid Group Anomaly Detection for Sequence Data: Application to Trajectory Data Analytics;IEEE Transactions on Intelligent Transportation Systems;2021

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3