Single and Binary Performance Comparison of Data Compression Algorithms for Text Files
Author:
KESKİN Serkan1ORCID, SEVLİ Onur1ORCID, OKATAN Ersan1ORCID
Affiliation:
1. BURDUR MEHMET AKİF ERSOY ÜNİVERSİTESİ
Abstract
Data compression is a technique used to reduce the size of a file. To reduce the size of a file, unnecessary information is removed or parts that repeat the same information are stored once. Thus a lossless compression is achieved. The extracted file has all the features of the compressed original file and can be used in the same way. Data compression can be done using different techniques. Some of these techniques are Huffman coding, Lempel-Ziv-Welch coding and Burrows-Wheeler Transform. Techniques such as Huffman coding, Lempel-Ziv-Welch coding and Burrows-Wheeler Transform are some of them. Which technique to use depends on the type and size of the data to be compressed. Huffman, Lempel-Ziv-Welch, Burrows-Wheeler Transform and Deflate algorithms are the most widely used techniques for text compression. Each algorithm uses different approaches and can produce different results in terms of compression ratios and performance. In this study, different data compression techniques were measured on specific data sets by using them individually and in pairs on top of each other. The most successful result was obtained with the Deflate algorithm when used alone and the achieved compression ratio was 29.08. When considered in the form of stacked pairs, the compression ratio of the Burrows-Wheeler Transform and Deflate gave the best result as 57.36. In addition, when compression is performed in pairs, which algorithm is applied first and which algorithm is applied afterwards can make a significant difference in the compression ratio. In this study, the performance measurements obtained by applying the algorithms in different orders are compared and suggestions are presented to obtain optimum performance
Publisher
Bitlis Eren Universitesi Fen Bilimleri Dergisi
Subject
Earth-Surface Processes
Reference32 articles.
1. [1] M. Ignatoski, J. Lerga, L. Stanković, and M. Daković, ‘Comparison of entropy and dictionary based text compression in English, German, French, Italian, Czech, Hungarian, Finnish, and Croatian’, Mathematics, vol. 8, no. 7, p. 1059, Jul. 2020, doi: 10.3390/MATH8071059. 2. [2] I. B. Ginzburg, S. N. Padalko, and M. N. Terentiev, ‘Short Message Compression Scheme for Wireless Sensor Networks’, Moscow Work. Electron. Netw. Technol. MWENT 2020 - Proc., Mar. 2020, doi: 10.1109/MWENT47943.2020.9067371. 3. [3] M. R. Hasan, ‘Data Compression using Huffman based LZW Encoding Technique’, Int. J. Sci. Eng. Res., vol. Volume 2, no. 11, pp. 1–7, 2011, Accessed: Mar. 20, 2023. [Online]. Available: http://www.ijser.org 4. [4] V. Ratnam Anappindi, ‘Issue 8 www.jetir.org (ISSN-2349-5162)’, JETIREZ06012 J. Emerg. Technol. Innov. Res., vol. 8, 2021, doi: 10.1109/EDSSC.2017.8126506.J. 5. [5] A. Habib, M. J. Islam, and M. S. Rahman, ‘A dictionary-based text compression technique using quaternary code’, Iran J. Comput. Sci., vol. 3, no. 3, pp. 127–136, Sep. 2020, doi: 10.1007/s42044-019-00047-w.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|