Combining static analysis with probabilistic models to enable market-scale Android inter-component analysis

Author:

Octeau Damien1,Jha Somesh2,Dering Matthew3,McDaniel Patrick3,Bartel Alexandre4,Li Li5,Klein Jacques5,Le Traon Yves5

Affiliation:

1. University of Wisconsin, USA / Pennsylvania State University, USA

2. University of Wisconsin, USA / IMDEA Software Institute, Spain

3. Pennsylvania State University, USA

4. TU Darmstadt, Germany

5. University of Luxembourg, Luxembourg

Abstract

Static analysis has been successfully used in many areas, from verifying mission-critical software to malware detection. Unfortunately, static analysis often produces false positives, which require significant manual effort to resolve. In this paper, we show how to overlay a probabilistic model, trained using domain knowledge, on top of static analysis results, in order to triage static analysis results. We apply this idea to analyzing mobile applications. Android application components can communicate with each other, both within single applications and between different applications. Unfortunately, techniques to statically infer Inter-Component Communication (ICC) yield many potential inter-component and inter-application links, most of which are false positives. At large scales, scrutinizing all potential links is simply not feasible. We therefore overlay a probabilistic model of ICC on top of static analysis results. Since computing the inter-component links is a prerequisite to inter-component analysis, we introduce a formalism for inferring ICC links based on set constraints. We design an efficient algorithm for performing link resolution. We compute all potential links in a corpus of 11,267 applications in 30 minutes and triage them using our probabilistic approach. We find that over 95.1% of all 636 million potential links are associated with probability values below 0.01 and are thus likely unfeasible links. Thus, it is possible to consider only a small subset of all links without significant loss of information. This work is the first significant step in making static inter-application analysis more tractable, even at large scales.

Funder

National Science Foundation

Deutsche Forschungsgemeinschaft

Fonds National de la Recherche Luxembourg

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Graphics and Computer-Aided Design,Software

Reference51 articles.

1. Solving systems of set constraints

2. Introduction to set constraint-based program analysis

3. Decidability of Systems of Set Constraints with Negative Constraints

4. AppBrain. Number of available android applications. Available from http://www.appbrain.com/stats/number-of-android-apps. AppBrain. Number of available android applications. Available from http://www.appbrain.com/stats/number-of-android-apps.

Cited by 54 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. A comprehensive framework for inter-app ICC security analysis of Android apps;Automated Software Engineering;2024-06-04

2. SIAT: A systematic inter-component communication real-time analysis technique for detecting data leak threats on Android;Journal of Computer Security;2023-11-28

3. A Component-Sensitive Static Analysis Based Approach for Modeling Intents in Android Apps;2023 IEEE International Conference on Software Maintenance and Evolution (ICSME);2023-10-01

4. On the Impact of Sample Duplication in Machine-Learning-Based Android Malware Detection;ACM Transactions on Software Engineering and Methodology;2021-07-31

5. Taming Reflection;ACM Transactions on Software Engineering and Methodology;2021-07-31

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3