Abstract
Under the influence of rapid development in the sphere of information technologies, rises the challenge related to detection of malicious information sources on the Internet. To solve this we can use machine learning methods as one of the most popular and powerful tools designed to identify dependencies between input (observed) data and output (desired) results. This article presents a methodology which is aimed at multi-level processing of input data about malicious information objects on the Internet and providing their multi-aspect assessment and categorization using machine learning methods. The purpose of the investigation is to improve the efficiency of the detecting process of malicious information on the Internet using the examples of Web-pages classification.
Publisher
Bonch-Bruevich State University of Telecommunications
Reference11 articles.
1. Hayes P.J., Andersen P.M., Nirenburg I.B., Schmandt L.M. TCS: a shell for content-based text categorization. Proceedings of the Sixth Conference on Artificial Intelligence Applications, 5‒9 May 1990, Santa Barbara, USA. Piscataway, NJ: IEEE; 1990. vol.1. p.320–326. Available from: https://doi.org/10.1109/CAIA.1990.89206
2. Apté C., Damerau F., Weiss S.M. Automated learning of decision rules for text categorization. ACM Transactions on Information Systems (TOIS). 1994;12(3):233–251. Available from: https://doi.org/10.1145/183422.183423
3. Salton G., Buckley C. Term-weighting approaches in automatic text retrieval. Information Processing & Management. 1988;24(5):513–523. Available from: https://doi.org/10.1016/0306-4573(88)90021-0
4. Fattah M.A. A Novel Statistical Feature Selection Approach for Text Categorization. Journal of Information Processing Systems. 2017;13(5):1397–1409.
5. Lewis D.D., Ringuette M. A Comparison of Two Learning Algorithms for Text Categorization. In: Third Annual Symposium on Document Analysis and Information Retrieval. 1994. p.81–93.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献