Affiliation:
1. University of Copenhagen, Copenhagen, Denmark
Abstract
We present a new sound and complete axiomatization of regular expression containment. It consists of the conventional axiomatization of concatenation, alternation, empty set and (the singleton set containing) the empty string as an idempotent semiring, the fixed- point rule
E
* = 1 +
E
×
E
* for Kleene-star, and a general coinduction rule as the only additional rule.
Our axiomatization gives rise to a natural computational interpretation of regular expressions as simple types that represent parse trees, and of containment proofs as
coercions
. This gives the axiom- atization a Curry-Howard-style constructive interpretation: Containment proofs do not only certify a language-theoretic contain- ment, but, under our computational interpretation, constructively transform a membership proof of a string in one regular expression into a membership proof of the same string in another regular expression.
We show how to encode regular expression equivalence proofs in Salomaa's, Kozen's and Grabmayer's axiomatizations into our containment system, which equips their axiomatizations with a computational interpretation and implies completeness of our axiomatization. To ensure its soundness, we require that the computational interpretation of the coinduction rule be a hereditarily total function. Hereditary totality can be considered the mother of syn- tactic side conditions: it "explains" their soundness, yet cannot be used as a conventional side condition in its own right since it turns out to be undecidable.
We discuss application of
regular expressions as types
to bit coding of strings and hint at other applications to the wide-spread use of regular expressions for substring matching, where classical automata-theoretic techniques are
a priori
inapplicable.
Neither regular expressions as types nor subtyping interpreted coercively are novel
per se
. Somewhat surprisingly, this seems to be the first investigation of a general proof-theoretic framework for the latter in the context of the former, however.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Graphics and Computer-Aided Design,Software
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. A Verified Operational Semantics for Regular Expression Parsing;Proceedings of the XXVII Brazilian Symposium on Programming Languages;2023-09-25
2. Kleenex: compiling nondeterministic transducers to deterministic streaming transducers;ACM SIGPLAN Notices;2016-04-08
3. Kleenex: compiling nondeterministic transducers to deterministic streaming transducers;Proceedings of the 43rd Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages;2016-01-11
4. Simple linear string constraints;Formal Aspects of Computing;2013-11
5. Two-Pass Greedy Regular Expression Parsing;Implementation and Application of Automata;2013