Automated patch assessment for program repair at scale-Reference-Cited by-同舟云学术

Automated patch assessment for program repair at scale

Published:2021-02-23 Issue:2 Volume:26 Page:
ISSN:1382-3256
Container-title:Empirical Software Engineering
language:en
Short-container-title:Empir Software Eng

Author:

Ye He^ORCID,Martinez Matias,Monperrus Martin

Abstract

AbstractIn this paper, we do automatic correctness assessment for patches generated by program repair systems. We consider the human-written patch as ground truth oracle and randomly generate tests based on it, a technique proposed by Shamshiri et al., called Random testing with Ground Truth (RGT) in this paper. We build a curated dataset of 638 patches for Defects4J generated by 14 state-of-the-art repair systems, we evaluate automated patch assessment on this dataset. The results of this study are novel and significant: First, we improve the state of the art performance of automatic patch assessment with RGT by 190% by improving the oracle; Second, we show that RGT is reliable enough to help scientists to do overfitting analysis when they evaluate program repair systems; Third, we improve the external validity of the program repair knowledge with the largest study ever.

Publisher

Springer Science and Business Media LLC

Subject

Software

Link

http://link.springer.com/content/pdf/10.1007/s10664-020-09920-w.pdf

Reference76 articles.

1. Arcuri A, Briand L (2011) A practical guide for using statistical tests to assess randomized algorithms in software engineering. In: Proceedings of the 33rd international conference on software engineering, ICSE ’11

2. Baresi L, Miraz M (2010) Testful: automatic unit-test generation for java classes, vol 2, pp 281–284, 01

3. Barr E T, Harman M, McMinn P, Shahbaz M, Yoo S (2015) The oracle problem in software testing: a survey. IEEE Trans Softw Eng 41 (5):507–525

4. Benton S, Ghanbari A, Zhang L (2019) Defexts: a curated dataset of reproducible real-world bugs for modern jvm languages. In: Proceedings of the 41st international conference on software engineering: companion proceedings. ICSE ’19

5. Binkley D (1995) Reducing the cost of regression testing by semantics guided test case selection. In: Proceedings of the international conference on software maintenance, ICSM ’95. ISBN 0818671416. IEEE Computer Society, p 251

Cited by 46 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Automated patch correctness predicting to fix software defect;Expert Systems with Applications;2024-12

2. Benchmarking and Categorizing the Performance of Neural Program Repair Systems for Java;ACM Transactions on Software Engineering and Methodology;2024-08-19

3. On the acceptance by code reviewers of candidate security patches suggested by Automated Program Repair tools;Empirical Software Engineering;2024-08-03

4. T5APR: Empowering automated program repair across languages through checkpoint ensemble;Journal of Systems and Software;2024-08

5. Automated program repair for variability bugs in software product line systems;Journal of Systems and Software;2024-08