An Efficient Coding Technique for Stochastic Processes-Reference-Cited by-同舟云学术

An Efficient Coding Technique for Stochastic Processes

Published:2021-12-30 Issue:1 Volume:24 Page:65
ISSN:1099-4300
Container-title:Entropy
language:en
Short-container-title:Entropy

Author:

García Jesús^ORCID,González-López Verónica^ORCID,Tasca Gustavo^ORCID,Yaginuma Karina^ORCID

Abstract

In the framework of coding theory, under the assumption of a Markov process (Xt) on a finite alphabet A, the compressed representation of the data will be composed of a description of the model used to code the data and the encoded data. Given the model, the Huffman’s algorithm is optimal for the number of bits needed to encode the data. On the other hand, modeling (Xt) through a Partition Markov Model (PMM) promotes a reduction in the number of transition probabilities needed to define the model. This paper shows how the use of Huffman code with a PMM reduces the number of bits needed in this process. We prove the estimation of a PMM allows for estimating the entropy of (Xt), providing an estimator of the minimum expected codeword length per symbol. We show the efficiency of the new methodology on a simulation study and, through a real problem of compression of DNA sequences of SARS-CoV-2, obtaining in the real data at least a reduction of 10.4%.

Publisher

MDPI AG

Subject

General Physics and Astronomy

Link

https://www.mdpi.com/1099-4300/24/1/65/pdf

Reference16 articles.

1. A universal data compression system

2. Modeling by shortest data description

3. Partition Markov Model for Covid-19 Virus

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Metric Based on the Efficient Determination Criterion;Entropy;2024-06-19