Affiliation:
1. Dipartimento di Matematica e Informatica, University of Palermo, Via Archirafi 34, 90123 Palermo, Italy
Abstract
In this paper we are interested in the study of the combinatorial aspects related to the extension of the Burrows-Wheeler transform to a multiset of words. Such study involves the notion of suffixes and conjugates of words and is based on two different order relations, denoted by <lex and ≺ω, that, even if strictly connected, are quite different from the computational point of view. In particular, we introduce a method that only uses the <lex sorting among suffixes of a multiset of words in order to sort their conjugates according to ≺ω-order. In this study an important role is played by Lyndon words. This strategy could be used in applications specially in the field of Bioinformatics, where for instance the advent of “next-generation” DNA sequencing technologies has meant that huge collections of DNA sequences are now commonplace.
Publisher
World Scientific Pub Co Pte Lt
Subject
Computer Science (miscellaneous)
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Generic Non-recursive Suffix Array Construction;ACM Transactions on Algorithms;2024-04-13
2. Constructing and indexing the bijective and extended Burrows–Wheeler transform;Information and Computation;2024-03
3. Computing the optimal BWT of very large string collections;2023 Data Compression Conference (DCC);2023-03
4. Parallel Lossy Compression for Large FASTQ Files;Biomedical Engineering Systems and Technologies;2023
5. Lossy Compressor Preserving Variant Calling through Extended BWT;Proceedings of the 15th International Joint Conference on Biomedical Engineering Systems and Technologies;2022