Towards effective assessment of steady state performance in Java software: are we there yet?-Reference-Cited by-同舟云学术

Towards effective assessment of steady state performance in Java software: are we there yet?

Published:2022-11-28 Issue:1 Volume:28 Page:
ISSN:1382-3256
Container-title:Empirical Software Engineering
language:en
Short-container-title:Empir Software Eng

Author:

Traini Luca^ORCID,Cortellessa Vittorio^ORCID,Di Pompeo Daniele^ORCID,Tucci Michele^ORCID

Abstract

AbstractMicrobenchmarking is a widely used form of performance testing in Java software. A microbenchmark repeatedly executes a small chunk of code while collecting measurements related to its performance. Due to Java Virtual Machine optimizations, microbenchmarks are usually subject to severe performance fluctuations in the first phase of their execution (also known as warmup). For this reason, software developers typically discard measurements of this phase and focus their analysis when benchmarks reach a steady state of performance. Developers estimate the end of the warmup phase based on their expertise, and configure their benchmarks accordingly. Unfortunately, this approach is based on two strong assumptions: (i) benchmarks always reach a steady state of performance and (ii) developers accurately estimate warmup. In this paper, we show that Java microbenchmarks do not always reach a steady state, and often developers fail to accurately estimate the end of the warmup phase. We found that a considerable portion of studied benchmarks do not hit the steady state, and warmup estimates provided by software developers are often inaccurate (with a large error). This has significant implications both in terms of results quality and time-effort. Furthermore, we found that dynamic reconfiguration significantly improves warmup estimation accuracy, but still it induces suboptimal warmup estimates and relevant side-effects. We envision this paper as a starting point for supporting the introduction of more sophisticated automated techniques that can ensure results quality in a timely fashion.

Publisher

Springer Science and Business Media LLC

Subject

Software

Link

https://link.springer.com/content/pdf/10.1007/s10664-022-10247-x.pdf

Reference53 articles.

1. AlGhamdi H M, Bezemer C P, Shang W, Hassan A E, Flora P (2020) Towards reducing the time needed for load testing. J Softw: Evol Process e2276. https://doi.org/10.1002/smr.2276. https://onlinelibrary.wiley.com/doi/abs/10.1002/smr.2276, smr.2276

2. Antoch J, Huškova M, Prášková Z (1997) Effect of dependence on statistics for determination of change. J Stat Plan Inference 60(2):291–310. https://doi.org/10.1016/S0378-3758(96)00138-3. https://www.sciencedirect.com/science/article/pii/S0378375896001383

3. Bagley D, Fulgham B, Gouy I (2004) The computer language benchmarks game. https://benchmarksgame-team.pages.debian.net/benchmarksgame. Accessed: 2021-10-12

4. Barrett E, Bolz-Tereick C F, Killick R, Mount S, Tratt L (2017) Virtual machine warmup blows hot and cold. Proc ACM Program Lang 1(OOPSLA). https://doi.org/10.1145/3133876

5. Beller M, Gousios G, Zaidman A (2017) Oops, my tests broke the build: an explorative analysis of travis ci with github. In: 2017 IEEE/ACM 14th international conference on mining software repositories (MSR). https://doi.org/10.1109/MSR.2017.62, pp 356–367

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. OptiFeat: Enhancing Feature Selection, A Hybrid Approach Combining Subject Matter Expertise and Recursive Feature Elimination Method;2024-08-13

2. Evaluating Search-Based Software Microbenchmark Prioritization;IEEE Transactions on Software Engineering;2024-07

3. An Empirical Study on Code Coverage of Performance Testing;Proceedings of the 28th International Conference on Evaluation and Assessment in Software Engineering;2024-06-18

4. Time Series Forecasting of Runtime Software Metrics: An Empirical Study;Proceedings of the 15th ACM/SPEC International Conference on Performance Engineering;2024-05-07

5. VAMP: Visual Analytics for Microservices Performance;Proceedings of the 39th ACM/SIGAPP Symposium on Applied Computing;2024-04-08