Regular Expressions Avoiding Absorbing Patterns and the Significance of Uniform Distribution
-
Published:2024-07-31
Issue:
Volume:
Page:1-18
-
ISSN:0129-0541
-
Container-title:International Journal of Foundations of Computer Science
-
language:en
-
Short-container-title:Int. J. Found. Comput. Sci.
Author:
Broda Sabine1,
Machiavelo António1,
Moreira Nelma1,
Reis Rogério1
Affiliation:
1. CMUP & DM-DCC, Faculdade de Ciências da Universidade do Porto, Rua do Campo Alegre, 4169-007 Porto, Portugal
Abstract
Although regular expressions do not correspond univocally to regular languages, it is still worthwhile to study their properties and algorithms. For the average case analysis one often relies on the uniform random generation using a specific grammar for regular expressions, that can represent regular languages with more or less redundancy. Generators that are uniform on the set of expressions are not necessarily uniform on the set of regular languages. Nevertheless, it is not straightforward that asymptotic estimates obtained by considering the whole set of regular expressions are different from those obtained using a more refined set that avoids some large class of equivalent expressions. In this paper we study a set of expressions that avoid a given absorbing pattern. It is shown that, although this set is significantly smaller than the standard one, the asymptotic average estimates for the size of the Glushkov automaton for these expressions does not differ from the standard case. Furthermore, for this set the asymptotic density of [Formula: see text]-accepting expressions is also the same as for the set of all standard regular expressions.
Funder
Fundação para a Ciência e a Tecnologia, I.P.
Publisher
World Scientific Pub Co Pte Ltd