Author:
Yu Hsiang‐Fu,Chen Yi‐Ming,Tseng Li‐Ming
Abstract
An archive is a file containing several related files. Many Internet resources, such as freeware, shareware and trail software, are often packaged into archives for easy installation and taking. Additionally, thousands of users search for archives and download them from different sources everyday. In this paper, previous research on archive downloading is extended via proxy cache to support archive searching. Internet proxy cache servers are used to gather a significant number of Web pages, detect those that contain archive links, and then use the obtained data to search archives by description or filename. Two schemes, iterative and backtracking, are proposed to obtain Web pages with archive links. The experimental results indicate that the precision that both of the schemes can achieve is about the same; however, the backtracking scheme reduces the number of checked pages by a factor of 26. Finally, a real system was implemented to demonstrate the proposed approaches.
Subject
Economics and Econometrics,Sociology and Political Science,Communication
Reference19 articles.
1. Amento, B., Terveen, L. and Hill, W. (2000), “Does ‘authority’ mean quality? Predicting expert quality ratings of Web documents”, Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 24‐28 July, pp. 296‐303.
2. Arasu, A., Cho, J., Garcia‐Molina, H., Paepcke, A. and Raghavan, S. (2001), “Searching the Web”, ACM Transactions on Internet Technologies, Vol. 1 No. 1, August, pp. 2‐43.
3. Deutsch, P. (1992), “Resource discovery in an Internet environment – the Archie approach”, Electronic Networking: Research, Applications and Policy, Vol. 2 No. 1, Spring, pp. 45‐51.
4. Distributed System Laboratory, the Department of EE, National Cheng Kung University, Taiwan (2002), “FtpLocate – make your own FTP search engine”, available at: http://ftp.ee.ncku.edu.tw/ftplocate/readme.english.html
5. Emtage, A. and Deutsch, P. (1992), “Archie – an electronic directory service for the Internet”, Proceedings of the Winter 1992 USENIX Conference, pp. 93‐110.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献