Digital Audio Tampering Detection Based on Deep Temporal–Spatial Features of Electrical Network Frequency-Reference-Cited by-同舟云学术

Digital Audio Tampering Detection Based on Deep Temporal–Spatial Features of Electrical Network Frequency

Published:2023-04-22 Issue:5 Volume:14 Page:253
ISSN:2078-2489
Container-title:Information
language:en
Short-container-title:Information

Author:

Zeng Chunyan¹^ORCID,Kong Shuai¹,Wang Zhifeng²^ORCID,Li Kun¹,Zhao Yuhao¹

Affiliation:

1. Hubei Key Laboratory for High-Efficiency Utilization of Solar Energy and Operation Control of Energy Storage System, Hubei University of Technology, Wuhan 430068, China

2. Department of Digital Media Technology, Central China Normal University, Wuhan 430079, China

Abstract

In recent years, digital audio tampering detection methods by extracting audio electrical network frequency (ENF) features have been widely applied. However, most digital audio tampering detection methods based on ENF have the problems of focusing on spatial features only, without effective representation of temporal features, and do not fully exploit the effective information in the shallow ENF features, which leads to low accuracy of audio tamper detection. Therefore, this paper proposes a new method for digital audio tampering detection based on the deep temporal–spatial feature of ENF. To extract the temporal and spatial features of the ENF, firstly, a highly accurate ENF phase sequence is extracted using the first-order Discrete Fourier Transform (DFT), and secondly, different frame processing methods are used to extract the ENF shallow temporal and spatial features for the temporal and spatial information contained in the ENF phase. To fully exploit the effective information in the shallow ENF features, we construct a parallel RDTCN-CNN network model to extract the deep temporal and spatial information by using the processing ability of Residual Dense Temporal Convolutional Network (RDTCN) and Convolutional Neural Network (CNN) for temporal and spatial information, and use the branch attention mechanism to adaptively assign weights to the deep temporal and spatial features to obtain the temporal–spatial feature with greater representational capacity, and finally, adjudicate whether the audio is tampered with by the MLP network. The experimental results show that the method in this paper outperforms the four baseline methods in terms of accuracy and F1-score.

Publisher

MDPI AG

Subject

Information Systems

Link

https://www.mdpi.com/2078-2489/14/5/253/pdf

Reference52 articles.

1. Liu, Z., and Lu, W. (2017, January 26–29). Fast Copy-Move Detection of Digital Audio. Proceedings of the 2017 IEEE Second International Conference on Data Science in Cyberspace (DSC), Shenzhen, China.

2. An End-to-End Deep Source Recording Device Identification System for Web Media Forensics;Zeng;Int. J. Web Inf. Syst.,2020

3. Detection of Speech Smoothing on Very Short Clip;Yan;IEEE Trans. Inf. Forensics Secur.,2019

4. Shallow and Deep Feature Fusion for Digital Audio Tampering Detection;Wang;EURASIP J. Adv. Signal Process.,2022

5. Audio Tampering Forensics Based on Representation Learning of ENF Phase Sequence;Zeng;Int. J. Digit. Crime Forensics,2022

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Squeeze-and-Excitation Self-Attention Mechanism Enhanced Digital Audio Source Recognition Based on Transfer Learning;Circuits, Systems, and Signal Processing;2024-09-13

2. ENFformer: Long-short term representation of electric network frequency for digital audio tampering detection;Knowledge-Based Systems;2024-08

3. Discriminative Component Analysis Enhanced Feature Fusion of Electrical Network Frequency for Digital Audio Tampering Detection;Circuits, Systems, and Signal Processing;2024-07-26

4. 1D-CNN-based audio tampering detection using ENF signals;Scientific Reports;2024-05-16

5. Imperceptible and Reversible Acoustic Watermarking Based on Modified Integer Discrete Cosine Transform Coefficient Expansion;Applied Sciences;2024-03-25