Affiliation:
1. Hubei Key Laboratory for High-Efficiency Utilization of Solar Energy and Operation Control of Energy Storage System, Hubei University of Technology, Wuhan 430068, China
2. Department of Digital Media Technology, Central China Normal University, Wuhan 430079, China
Abstract
In recent years, digital audio tampering detection methods by extracting audio electrical network frequency (ENF) features have been widely applied. However, most digital audio tampering detection methods based on ENF have the problems of focusing on spatial features only, without effective representation of temporal features, and do not fully exploit the effective information in the shallow ENF features, which leads to low accuracy of audio tamper detection. Therefore, this paper proposes a new method for digital audio tampering detection based on the deep temporal–spatial feature of ENF. To extract the temporal and spatial features of the ENF, firstly, a highly accurate ENF phase sequence is extracted using the first-order Discrete Fourier Transform (DFT), and secondly, different frame processing methods are used to extract the ENF shallow temporal and spatial features for the temporal and spatial information contained in the ENF phase. To fully exploit the effective information in the shallow ENF features, we construct a parallel RDTCN-CNN network model to extract the deep temporal and spatial information by using the processing ability of Residual Dense Temporal Convolutional Network (RDTCN) and Convolutional Neural Network (CNN) for temporal and spatial information, and use the branch attention mechanism to adaptively assign weights to the deep temporal and spatial features to obtain the temporal–spatial feature with greater representational capacity, and finally, adjudicate whether the audio is tampered with by the MLP network. The experimental results show that the method in this paper outperforms the four baseline methods in terms of accuracy and F1-score.
Reference52 articles.
1. Liu, Z., and Lu, W. (2017, January 26–29). Fast Copy-Move Detection of Digital Audio. Proceedings of the 2017 IEEE Second International Conference on Data Science in Cyberspace (DSC), Shenzhen, China.
2. An End-to-End Deep Source Recording Device Identification System for Web Media Forensics;Zeng;Int. J. Web Inf. Syst.,2020
3. Detection of Speech Smoothing on Very Short Clip;Yan;IEEE Trans. Inf. Forensics Secur.,2019
4. Shallow and Deep Feature Fusion for Digital Audio Tampering Detection;Wang;EURASIP J. Adv. Signal Process.,2022
5. Audio Tampering Forensics Based on Representation Learning of ENF Phase Sequence;Zeng;Int. J. Digit. Crime Forensics,2022
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献