Affiliation:
1. Univ. del Piemonte Orientale, Alessandria, Italy
Abstract
The Burrows—Wheeler Transform (also known as Block-Sorting) is at the base of compression algorithms that are the state of the art in lossless data compression. In this paper, we analyze two algorithms that use this technique. The first one is the original algorithm described by Burrows and Wheeler, which, despite its simplicity outperforms the Gzip compressor. The second one uses an additional run-length encoding step to improve compression. We prove that the compression ratio of both algorithms can be bounded in terms of the
k
th order empirical entropy of the input string for any
k
≥ 0. We make no assumptions on the input and we obtain bounds which hold in the worst case that is for every possible input string. All previous results for Block-Sorting algorithms were concerned with the average compression ratio and have been established assuming that the input comes from a finite-order Markov source.
Publisher
Association for Computing Machinery (ACM)
Subject
Artificial Intelligence,Hardware and Architecture,Information Systems,Control and Systems Engineering,Software
Reference25 articles.
1. ARNOLD R. AND BELL T. 2000. The Canterbury corpus home page. http://corpus.canterbury. ac.nz. ARNOLD R. AND BELL T. 2000. The Canterbury corpus home page. http://corpus.canterbury. ac.nz.
2. A locally adaptive data compression scheme
3. Unbounded length contexts for PPM;CLEARY J.G.;Comput. J.,1997
4. Data Compression Using Dynamic Markov Modelling
Cited by
229 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献