Abstract
With the rapid development of the Internet, the continuous increase of malware and its variants have brought greatly challenges for cyber security. Due to the imbalance of the data distribution, the research on malware detection focuses on the accuracy of the whole data sample, while ignoring the detection rate of the minority categories’ malware. In the dataset sample, the normal data samples account for the majority, while the attacks’ malware accounts for the minority. However, the minority categories’ attacks will bring great losses to countries, enterprises, or individuals. For solving the problem, this study proposed the GNGS algorithm to construct a new balance dataset for the model algorithm to pay more attention to the feature learning of the minority attacks’ malware to improve the detection rate of attacks’ malware. The traditional malware detection method is highly dependent on professional knowledge and static analysis, so we used the Self-Attention with Gate mechanism (SAG) based on the Transformer to carry out feature extraction between the local and global features and filter irrelevant noise information, then extracted the long-distance dependency temporal sequence features by the BiGRU network, and obtained the classification results through the SoftMax classifier. In the study, we used the Alibaba Cloud dataset for malware multi-classification. Compared the GSB deep learning network model with other current studies, the experimental results showed that the Gaussian noise generation strategy (GNGS) could solve the unbalanced distribution of minority categories’ malware and the SAG-BiGRU algorithm obtained the accuracy rate of 88.7% on the eight-classification, which has better performance than other existing algorithms, and the GSB model also has a good effect on the NSL-KDD dataset, which showed the GSB model is effective for other network intrusion detection.
Publisher
Public Library of Science (PLoS)
Reference53 articles.
1. L. Onwuzurike, M. Almeida, E. Mariconti, “A Family of Droids-Android Malware Detection via Behavioral Modeling: Static vs Dynamic Analysis,” 2018 16th Annual Conference on Privacy, Security and Trust, 2018.
2. A comparison of static, dynamic, and hybrid analysis for malware detection;A. Damodaran;Journal of Computer Virology and Hacking Techniques,2015
3. An effective framework of behavior detection-advanced static analysis for malware detection;M. Louk;2014 14th International Symposium on Communications and Information Technologies (ISCIT),,2014
4. APT malware static trace analysis through bigrams and graph edit distance;A. D. Bolton;Statistical Analysis and Data Mining: The ASA Data Science Journal,2017
5. J. Gajrani, J. Sarswat, M. Tripathi, et al., “A robust dynamic analysis system preventing SandBox detection by Android malware,” Proceedings of the 8th International Conference on Security of Information and Networks, 2017.