A Context-Aware Neural Embedding for Function-Level Vulnerability Detection-Reference-Cited by-同舟云学术

A Context-Aware Neural Embedding for Function-Level Vulnerability Detection

Published:2021-11-17 Issue:11 Volume:14 Page:335
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Wei Hongwei,Lin Guanjun,Li Lin,Jia Heming^ORCID

Abstract

Exploitable vulnerabilities in software systems are major security concerns. To date, machine learning (ML) based solutions have been proposed to automate and accelerate the detection of vulnerabilities. Most ML techniques aim to isolate a unit of source code, be it a line or a function, as being vulnerable. We argue that a code segment is vulnerable if it exists in certain semantic contexts, such as the control flow and data flow; therefore, it is important for the detection to be context aware. In this paper, we evaluate the performance of mainstream word embedding techniques in the scenario of software vulnerability detection. Based on the evaluation, we propose a supervised framework leveraging pre-trained context-aware embeddings from language models (ELMo) to capture deep contextual representations, further summarized by a bidirectional long short-term memory (Bi-LSTM) layer for learning long-range code dependency. The framework takes directly a source code function as an input and produces corresponding function embeddings, which can be treated as feature sets for conventional ML classifiers. Experimental results showed that the proposed framework yielded the best performance in its downstream detection tasks. Using the feature representations generated by our framework, random forest and support vector machine outperformed four baseline systems on our data sets, demonstrating that the framework incorporated with ELMo can effectively capture the vulnerable data flow patterns and facilitate the vulnerability detection task.

Funder

Natural Science Foundation Project of Fujian Province

Publisher

MDPI AG

Subject

Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science

Link

https://www.mdpi.com/1999-4893/14/11/335/pdf

Reference62 articles.

1. Data-Driven Cybersecurity Incident Prediction: A Survey

2. Detecting and Preventing Cyber Insider Threats: A Survey

3. Android HIV: A Study of Repackaging Malware for Evading Machine-Learning Detection

4. Data-Driven Cyber Security in Perspective—Intelligent Traffic Analysis

5. VulDeePecker: A Deep Learning-Based System for Vulnerability Detection;Li;arXiv,2018

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Research on Computer Network Security Vulnerabilities and Encryption Technology in Cloud Computing Environment;Applied Mathematics and Nonlinear Sciences;2024-01-01

2. Static vulnerability detection based on class separation;Journal of Systems and Software;2023-12

3. The EarlyBIRD Catches the Bug: On Exploiting Early Layers of Encoder Models for More Efficient Code Classification;Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering;2023-11-30

4. AI and Blockchain-based source code vulnerability detection and prevention system for multiparty software development;Computers and Electrical Engineering;2023-03

5. Semantic-based vulnerability detection by functional connectivity of gated graph sequence neural networks;Soft Computing;2023-01-02