Imperceptible and Reversible Acoustic Watermarking Based on Modified Integer Discrete Cosine Transform Coefficient Expansion-Reference-Cited by-同舟云学术

Imperceptible and Reversible Acoustic Watermarking Based on Modified Integer Discrete Cosine Transform Coefficient Expansion

Published:2024-03-25 Issue:7 Volume:14 Page:2757
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Huang Xuping¹²^ORCID,Ito Akinori¹^ORCID

Affiliation:

1. Department of Communications Engineering, Graduate School of Engineering, Tohoku University, Sendai 980-8577, Japan

2. Interdisciplinary Faculty of Science and Engineering, Shimane University, Matsue 690-8504, Japan

Abstract

This paper aims to explore an alternative reversible digital watermarking solution to guarantee the integrity of and detect tampering with data of probative importance. Since the payload for verification is embedded in the contents, algorithms for reversible embedding and extraction, imperceptibility, payload capacity, and computational time are issues to evaluate. Thus, we propose a reversible and imperceptible audio information-hiding algorithm based on modified integer discrete cosine transform (intDCT) coefficient expansion. In this work, the original signal is segmented into fixed-length frames, and then intDCT is applied to each frame to transform signals from the time domain into integer DCT coefficients. Expansion is applied to DCT coefficients at a higher frequency to reserve hiding capacity. Objective evaluation of speech quality is conducted using listening quality objective mean opinion (MOS-LQO) and the segmental signal-to-noise ratio (segSNR). The audio quality of different frame lengths and capacities is evaluated. Averages of 4.41 for MOS-LQO and 23.314 [dB] for segSNR for 112 ITU-T test signals were obtained with a capacity of 8000 bps, which assured imperceptibility with the sufficient capacity of the proposed method. This shows comparable audio quality to conventional work based on Linear Predictive Coding (LPC) regarding MOS-LQO. However, all segSNR scores of the proposed method have comparable or better performance in the time domain. Additionally, comparing histograms of the normalized maximum absolute value of stego data shows a lower possibility of overflow than the LPC method. A computational cost, including hiding and transforming, is an average of 4.884 s to process a 10 s audio clip. Blind tampering detection without the original data is achieved by the proposed embedding and extraction method.

Funder

Japan Society for the Promotion of Science

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/7/2757/pdf

Reference50 articles.

1. Bourouis, S., Alroobaea, R., Alharbi, A.M., Andejany, M., and Rubaiee, S. (2020). Recent advances in digital multimedia tampering detection for forensics analysis. Symmetry, 12.

2. Recent advances in digital image manipulation detection techniques: A brief review;Thakur;Forensic Sci. Int.,2020

3. Digital video tampering detection: An overview of passive techniques;Sitara;Digit. Investig.,2016

4. Echizen, I., Yamada, T., Tezuka, S., Singh, S., and Yoshiura, H. (2006, January 18–20). Improved video verification method using digital watermarking. Proceedings of the International Conference of Intelligent Information Hiding and Multimedia Signal Processing, Pasadena, CA, USA.

5. A semi-fragile watermarking tamper localization method based on QDFT and multi-view fusion;Ouyang;Multimed. Tools Appl.,2023