Affiliation:
1. University of Massachusetts at Amherst, USA
2. Microsoft Research, USA
Abstract
With hundreds of millions of users, spreadsheets are one of the most important end-user applications. Spreadsheets are easy to use and allow users great flexibility in storing data. This flexibility comes at a price: users often treat spreadsheets as a poor man's database, leading to creative solutions for storing high-dimensional data. The trouble arises when users need to answer queries with their data. Data manipulation tools make strong assumptions about data layouts and cannot read these ad-hoc databases. Converting data into the appropriate layout requires programming skills or a major investment in manual reformatting. The effect is that a vast amount of real-world data is "locked-in" to a proliferation of one-off formats. We introduce FlashRelate, a synthesis engine that lets ordinary users extract structured relational data from spreadsheets without programming. Instead, users extract data by supplying examples of output relational tuples. FlashRelate uses these examples to synthesize a program in Flare. Flare is a novel extraction language that extends regular expressions with geometric constructs. An interactive user interface on top of FlashRelate lets end users extract data by point-and-click. We demonstrate that correct Flare programs can be synthesized in seconds from a small set of examples for 43 real-world scenarios. Finally, our case study demonstrates FlashRelate's usefulness addressing the widespread problem of data trapped in corporate and government formats.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Graphics and Computer-Aided Design,Software
Cited by
26 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Anomaly Detection in IoT: Recent Advances, AI and ML Perspectives and Applications;Artificial Intelligence;2024-01-17
2. Programming by Example Made Easy;ACM Transactions on Software Engineering and Methodology;2023-11-24
3. Evolution of Editing Scripts From Examples;Proceedings of the Companion Conference on Genetic and Evolutionary Computation;2023-07-15
4. Auto-Tables: Synthesizing Multi-Step Transformations to Relationalize Tables without Using Examples;Proceedings of the VLDB Endowment;2023-07
5. WebRobot: web robotic process automation using interactive programming-by-demonstration;Proceedings of the 43rd ACM SIGPLAN International Conference on Programming Language Design and Implementation;2022-06-09