Author:
Kunya Musa,Hamada Mohamed,Hassan Mohammed,Yusuf Ilu Saratu
Abstract
The ability to predict whether a specific section of a spreadsheet is faulty or not is frequently required for the development of spreadsheet functionality. Although errors in such spreadsheets are common and can have serious consequences, today’s spreadsheet creation and management tools offer weak capabilities for defect detection, localization, and fixing. In this thesis, we proposed a method for predicting faults in spreadsheet formulas that can detect faults in non-formula cells by combining a catalog of spreadsheet metrics with modern machine learning algorithms. An examination of the individual metrics in the catalog reveals that they are suited to detecting data where a formula is expected to have flaws. In this framework, Recall Score of 99% was achieved and performance was compared with that of Melford. The result of the experiment reveals that the proposed framework outperforms Melford framework.
Reference26 articles.
1. Joseph N., Number of Google Sheets and Excel users worldwide (2021), https://askwonder.com/research/number-google-sheets-users-worldwide-eoskdoxav
2. Mukhtar A., Hofer B., Jannach D., Wotawa F., Journal of Systems and Software p. 111119 (2021)
3. Leung Stuart,Sorry, Your Spreadsheet Has Errors (Almost 90% Do) (2014), https://www.forbes.com/sites/salesforce/2014/09/13/sorry-spreadsheet-errors/?sh=6cbe2bb756ab
4. Hofer B., Riboira A., Wotawa F., Abreu R., Getzner E., Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 7793 LNCS, 68 (2013)
5. Zeller A., Learning from 6,000 projects: Mining models in the large, in Proceedings - 10th IEEE International Working Conference on Source Code Analysis and Manipulation, SCAM 2010 (2010), pp. 3–6, ISBN 9780769541785