Off-line Serial Exact String Searching

Author:

Crochemore M.

Abstract

String searching or string-matching is the problem of locating all occurrences of a string x of length m, called the pattern, in another string t of length n, called the text. The algorithmic complexity of the problem is analyzed by means of standard measures: running time and amount of memory space required by the computations. This chapter deals with solutions in which the pattern only is preprocessed. There are mainly three kinds of methods to solve the problem: sequential methods simulating a finite automaton, practically fast methods, and time-space optimal methods. Alternative solutions based on a preprocessing of the text are described in Chapter 3. Parallel algorithms for the problem, presented in Chapter 2, sometimes also generate new serial algorithms. Finally, methods that search for approximate occurrences of a pattern are the subject of Chapters 4, 5, and 6. The problem is of main importance for several reasons. From a theoretical point of view, it is a paradigm for the design of efficient algorithms, in the same way as are, for instance, sorting methods. From a practical point of view, the algorithms developed in this chapter often serve as basic components in text facility software. In the whole chapter, x denotes the pattern of length m (m = |x|), and t is the text of length n (n = |t|). To avoid trivial situations, the pattern is assumed to be a non-empty string (x ≠ λ, m ≠ 0). Since the problem becomes a simple counting exercise when the alphabet reduces to only one symbol, the reader may consider in the following that the common alphabet Σ of the text and the pattern contains at least two symbols (|Σ| > 1). We assume that the pattern is given first. The text is given at search time only. This allows us to preprocess the pattern in order to accelerate the future search in the text. An instance of the problem arises when we try to recognize a specific pattern in various texts or streams of symbols arriving through a communication channel.

Publisher

Oxford University Press

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. String Matching Algorithm Using Multi-Characters Inverted Lists;WSEAS TRANSACTIONS ON COMPUTERS;2023-10-03

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3