Affiliation:
1. Hasselt University and Transnational University of Limburg
Abstract
We study the succinctness of the complement and intersection of regular expressions. In particular, we show that when constructing a regular expression defining the complement of a given regular expression, a double exponential size increase cannot be avoided. Similarly, when constructing a regular expression defining the intersection of a fixed and an arbitrary number of regular expressions, an exponential and double exponential size increase, respectively, cannot be avoided. All mentioned lower bounds improve the existing ones by one exponential and are tight in the sense that the target expression can be constructed in the corresponding time class, that is, exponential or double exponential time. As a by-product, we generalize a theorem by Ehrenfeucht and Zeiger stating that there is a class of DFAs which are exponentially more succinct than regular expressions, to a fixed alphabet. When the given regular expressions are one-unambiguous, as for instance required by the XML Schema specification, the complement can be computed in polynomial time whereas the bounds concerning intersection continue to hold. For the subclass of single-occurrence regular expressions, we prove a tight exponential lower bound for intersection.
Funder
Seventh Framework Programme
Publisher
Association for Computing Machinery (ACM)
Subject
Computational Mathematics,Logic,General Computer Science,Theoretical Computer Science
Reference45 articles.
1. Aho A. Hopcroft J. and Ullman J. 1974. The Design and Analysis of Computer Algorithms. Addison-Wesley. Aho A. Hopcroft J. and Ullman J. 1974. The Design and Analysis of Computer Algorithms . Addison-Wesley.
2. Bray T. Paoli J. Sperberg-McQueen C. Maler E. and Yergeau F. 2004. Extensible Markup Language (XML). Tech. rep. World Wide Web Consortium. http://www.w3.org/TR/REC-xml/. Bray T. Paoli J. Sperberg-McQueen C. Maler E. and Yergeau F. 2004. Extensible Markup Language (XML). Tech. rep. World Wide Web Consortium. http://www.w3.org/TR/REC-xml/.
3. Inference of concise regular expressions and DTDs
4. Ambiguity in Graphs and Expressions
Cited by
26 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Linear Matching of JavaScript Regular Expressions;Proceedings of the ACM on Programming Languages;2024-06-20
2. Lean Formalization of Extended Regular Expression Matching with Lookarounds;Proceedings of the 13th ACM SIGPLAN International Conference on Certified Programs and Proofs;2024-01-09
3. Negation-closure for JSON Schema;Theoretical Computer Science;2023-04
4. Regular expression length via arithmetic formula complexity;Journal of Computer and System Sciences;2022-05
5. Games for Succinctness of Regular Expressions;Electronic Proceedings in Theoretical Computer Science;2021-09-17