Affiliation:
1. Department of Computer Science, Stanford University, Stanford, California
Abstract
Detecting and representing changes to data is important for active databases, data warehousing, view maintenance, and version and configuration management. Most previous work in change management has dealt with flat-file and relational data; we focus on hierarchically structured data. Since in many cases changes must be computed from old and new versions of the data, we define the hierarchical change detection problem as the problem of finding a "minimum-cost edit script" that transforms one data tree to another, and we present efficient algorithms for computing such an edit script. Our algorithms make use of some key domain characteristics to achieve substantially better performance than previous, general-purpose algorithms. We study the performance of our algorithms both analytically and empirically, and we describe the application of our techniques to hierarchically structured documents.
Publisher
Association for Computing Machinery (ACM)
Subject
Information Systems,Software
Cited by
134 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Confix: Combining node-level fix templates and masked language model for automatic program repair;Journal of Systems and Software;2024-10
2. Alloy Repair Hint Generation Based on Historical Data;Lecture Notes in Computer Science;2024-09-13
3. gawd: A Differencing Tool for GitHub Actions Workflows;Proceedings of the 21st International Conference on Mining Software Repositories;2024-04-15
4. Fine-grained, accurate and scalable source differencing;Proceedings of the IEEE/ACM 46th International Conference on Software Engineering;2024-04-12
5. 7 Dimensions of software change patterns;Scientific Reports;2024-03-13