SimCoDe-NET: Similarity Detection in Binary Code using Deep Learning Network
-
Published:2023-03-20
Issue:1
Volume:12
Page:262-267
-
ISSN:2347-470X
-
Container-title:International Journal of Electrical and Electronics Research
-
language:en
-
Short-container-title:IJEER
Author:
Poornima S.1, Mahalakshmi R.2
Affiliation:
1. Research Scholar, Department of Computer Science Engineering, Presidency University, Bangalore, Karnataka, India 2. Professor, Department of Computer Science Engineering, Presidency University, Bangalore, Karnataka, India
Abstract
Binary code similarity detection is a fundamental task in the field of computer binary security. However, code similarity is crucial today because of the prevalence of issues like plagiarism, code cloning, and recycling in software due to the ongoing increase of software scale. To resolve these issues, a novel SIMilarity detection in binary COde using DEep learning NETwork (SimCoDe-NET) has been proposed. Initially, op-code features are extracted from the input data by using reverse engineering process and the opcode embedding is generated using N-skip gram method. The extracted features are fed into Bi-GRU neural network for classifying the similarity of the binary codes. The Bi-GRU neural network compares two data samples in feature space to identify whether they belong to similar data or non-similar data. The SimCoDe-NET framework is evaluated by using generated dataset to assess the efficiency of this method. The efficacy of the proposed SimCoDe-NET framework is assessed in terms of precision, accuracy, sensitivity, recall, similarity detection time and similarity detection rate. The accuracy of the proposed method is 99.10% which is relatively high compared to the existing method. The proposed SimCoDe-NET approach improves the accuracy by 84.9%, 88.58%, and 93.9% better than jTrans, UPPC, and HEBCS respectively.
Publisher
FOREX Publication
Reference25 articles.
1. Shin, E.C.R., Song, D. and Moazzezi, R. 2015. Recognizing functions in binaries with neural networks. In 24th {USENIX} Security Symposium, 611–626. 2. Lou, A., Cheng, S., Huang J. and Jiang, F. 2019. Tfdroid: android malware detection by topics and sensitive data flows using machine learning techniques. in Proceedings of the 2019 IEEE 2nd International Conference on Information and Computer Technologies, ICICT, Hawaii, HI, USA, 30–36. 3. Shalev, N. and Partush, N. 2018. Binary similarity detection using machine learning. In: Proceedings of the 13th workshop on programming languages and analysis for security. ACM, New York, NY, USA, 42–47. 4. Egele, M., Woo, M., Chapman, P. and Brumley, D. 2014. Blanket execution: Dynamic similarity testing for program binaries and components. In Proceedings of the 23rd USENIX Conference on Security Symposium. Berkeley, CA, USA: USENIX Association, 303–317. 5. Eschweiler, S., Yakdan, K. and Gerhards-Padilla, E. 2016. Discovre: Efficient crossarchitecture identification of bugs in binary code. In Proceedings of the 2016 network and distributed systems security symposium (NDSS).
|
|