Determining Thresholds for Optimal Adaptive Discrete Cosine Transformation

Author:

Khanov Alexander1ORCID,Shulzhenko Anastasija2ORCID,Voroshilova Anzhelika3ORCID,Zubarev Alexander4ORCID,Karimov Timur1ORCID,Fahmi Shakeeb1ORCID

Affiliation:

1. Computer-Aided Design Department, St. Petersburg Electrotechnical University “LETI”, 5 Professora Popova St., 197022 Saint Petersburg, Russia

2. Information Security Department, St. Petersburg Electrotechnical University “LETI”, 5 Professora Popova St., 197022 Saint Petersburg, Russia

3. School of Public Administration and Entrepreneurship, Institute of Economics and Management, Ural Federal University Named after the First President of Russia B.N.Yeltsin, 51 Lenina Ave., 620075 Yekaterinburg, Russia

4. Department of Electrical Engineering, St. Petersburg Electrotechnical University “LETI”, 5 Professora Popova St., 197022 Saint Petersburg, Russia

Abstract

The discrete cosine transform (DCT) is widely used for image and video compression. Lossy algorithms such as JPEG, WebP, BPG and many others are based on it. Multiple modifications of DCT have been developed to improve its performance. One of them is adaptive DCT (ADCT) designed to deal with heterogeneous image structure and it may be found, for example, in the HEVC video codec. Adaptivity means that the image is divided into an uneven grid of squares: smaller ones retain information about details better, while larger squares are efficient for homogeneous backgrounds. The practical use of adaptive DCT algorithms is complicated by the lack of optimal threshold search algorithms for image partitioning procedures. In this paper, we propose a novel method for optimal threshold search in ADCT using a metric based on tonal distribution. We define two thresholds: pm, the threshold defining solid mean coloring, and ps, defining the quadtree fragment splitting. In our algorithm, the values of these thresholds are calculated via polynomial functions of the tonal distribution of a particular image or fragment. The polynomial coefficients are determined using the dedicated optimization procedure on the dataset containing images from the specific domain, urban road scenes in our case. In the experimental part of the study, we show that ADCT allows a higher compression ratio compared to non-adaptive DCT at the same level of quality loss, up to 66% for acceptable quality. The proposed algorithm may be used directly for image compression, or as a core of video compression framework in traffic-demanding applications, such as urban video surveillance systems.

Funder

Russian Science Foundation

Publisher

MDPI AG

Reference26 articles.

1. Bischoff, P. (2024, May 13). WebP, Surveillance Camera Statistics: Which Are the Most Surveilled Cities?. Available online: https://www.comparitech.com/vpn-privacy/the-worlds-most-surveilled-cities/.

2. The JPEG algorithm for image compression standard;Wallace;Commun. ACM,1991

3. JPEG (2024, March 03). WebP, ITU-T T.800 (T.JPEG2000). Available online: https://handle.itu.int/11.1002/1000/15939.

4. Melenchón Maldonado, J. (2024, March 16). WebP, a New Web Oriented Image Format. Available online: http://mosaic.uoc.edu/2010/11/18/webp-a-new-weboriented-image-format-english-version/.

5. Si, Z., and Shen, K. (2016). Research on the WebP image format. Advanced Graphic Communications, Packaging Technology and Materials, Springer.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3