Author:
Zeng Chunyan,Zhu Dongliang,Wang Zhifeng,Wang Zhenghui,Zhao Nan,He Lu
Abstract
Purpose
Most source recording device identification models for Web media forensics are based on a single feature to complete the identification task and often have the disadvantages of long time and poor accuracy. The purpose of this paper is to propose a new method for end-to-end network source identification of multi-feature fusion devices.
Design/methodology/approach
This paper proposes an efficient multi-feature fusion source recording device identification method based on end-to-end and attention mechanism, so as to achieve efficient and convenient identification of recording devices of Web media forensics.
Findings
The authors conducted sufficient experiments to prove the effectiveness of the models that they have proposed. The experiments show that the end-to-end system is improved by 7.1% compared to the baseline i-vector system, compared to the authors’ previous system, the accuracy is improved by 0.4%, and the training time is reduced by 50%.
Research limitations/implications
With the development of Web media forensics and internet technology, the use of Web media as evidence is increasing. Among them, it is particularly important to study the authenticity and accuracy of Web media audio.
Originality/value
This paper aims to promote the development of source recording device identification and provide effective technology for Web media forensics and judicial record evidence that need to apply device source identification technology.
Subject
Computer Networks and Communications,Information Systems
Reference28 articles.
1. Regularized nonlinear discriminant analysis – an approach to robust dimensionality reduction for data visualization,2017
2. Support vector machines using GMM supervectors for speaker verification;IEEE Signal Processing Letters,2006
3. Front-end factor analysis for speaker verification;IEEE Transactions on Audio, Speech, and Language Processing,2011
4. Neural turing machines;Computer Science,2014
5. Hanilçi, C. and Kinnunen, T. (2014), “Source cell-phone recognition from recorded speech using non-speech segments”, Digital Signal Processing, 35, pp. 75-85, available at: www.researchgate.net/publication/265338172_Source_cell-phone_recognition_from_recorded_speech_using_non-speech_segments
Cited by
19 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献