Affiliation:
1. Jaypee Institute of Information Technology
Abstract
In this paper, we put forward a technique for keeping web pages up-to-date, later used by search engine to serve the end user queries. A major part of the Web is dynamic and hence, a need arises to constantly update the changed web documents in search engine’s repository. In this paper we used the client-server architecture for crawling the web and propose a technique for detecting changes in web page based on the content of the images present if any in web documents. Once it is being identified that the image embedded in the web document is changed then the previous copy of the web document present in the search engine’s database/repository is replaced with the changed one.
Publisher
Trans Tech Publications, Ltd.
Reference9 articles.
1. Z. Dalal, S. Dash, P. Dave, L. Francisco- Revilla, R. Furuta, U. Karadkar, and F. Shipman, Managing Distributed Collections: Evaluating Web Page Changes, Movement, and Replacement, Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries, pp.160-168, June (2004).
2. J. Cho, and H.G. -Molina, The Evolution of the Web and Implications for an Incremental Crawler, Proceedings of the 26th International Conference on Very Large Data Bases, p.200 – 209, (2000).
3. J. Cho, and Hector Garcia- Molina, Estimating Frequency of Change, ACM transaction on internet technology, vol3, issue 3, pp.256-290, Aug (2003).
4. F. Douglis and T. Ball, Tracking and viewing changes on the web, In USENIX Annual Technical Conference, p.165– 176, (1996).
5. D. Buttler, D. Rocco and L. Liu, Efficient Web Change Monitoring with Page Digest, In Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters, p.476 – 477, (2004).
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Image to Image Search Engine;International Journal of Advanced Research in Science, Communication and Technology;2023-04-28