EconPapers    
Economics at your fingertips  
 

An End-to-End Transfer Learning Framework of Source Recording Device Identification for Audio Sustainable Security

Zhifeng Wang (), Jian Zhan (), Guozhong Zhang, Daliang Ouyang and Huaiyong Guo
Additional contact information
Zhifeng Wang: Department of Digital Media Technology, Central China Normal University, Wuhan 430079, China
Jian Zhan: Aerospace Science & Industry Shenzhen (Group) Co., Ltd., Shenzhen 518048, China
Guozhong Zhang: Aerospace Science & Industry Shenzhen (Group) Co., Ltd., Shenzhen 518048, China
Daliang Ouyang: Aerospace Science & Industry Shenzhen (Group) Co., Ltd., Shenzhen 518048, China
Huaiyong Guo: Aerospace Science & Industry Shenzhen (Group) Co., Ltd., Shenzhen 518048, China

Sustainability, 2023, vol. 15, issue 14, 1-22

Abstract: Source recording device identification poses a significant challenge in the field of Audio Sustainable Security (ASS). Most existing studies on end-to-end identification of digital audio sources follow a two-step process: extracting device-specific features and utilizing them in machine learning or deep learning models for decision-making. However, these approaches often rely on empirically set hyperparameters, limiting their generalization capabilities. To address this limitation, this paper leverages the self-learning ability of deep neural networks and the temporal characteristics of audio data. We propose a novel approach that utilizes the Sinc function for audio preprocessing and combine it with a Deep Neural Network (DNN) to establish a comprehensive end-to-end identification model for digital audio sources. By allowing the parameters of the preprocessing and feature extraction processes to be learned through gradient optimization, we enhance the model’s generalization. To overcome practical challenges such as limited timeliness, small sample sizes, and incremental expression, this paper explores the effectiveness of an end-to-end transfer learning model. Experimental verification demonstrates that the proposed end-to-end transfer learning model achieves both timely and accurate results, even with small sample sizes. Moreover, it avoids the need for retraining the model with a large number of samples due to incremental expression. Our experiments showcase the superiority of our method, achieving an impressive 97.7% accuracy when identifying 141 devices. This outperforms four state-of-the-art methods, demonstrating an absolute accuracy improvement of 4.1%. This research contributes to the field of ASS and provides valuable insights for future studies in audio source identification and related applications of information security, digital forensics, and copyright protection.

Keywords: Audio Sustainable Security; source recording device identification; deep neural network; end-to-end; transfer learning (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2023
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2071-1050/15/14/11272/pdf (application/pdf)
https://www.mdpi.com/2071-1050/15/14/11272/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:15:y:2023:i:14:p:11272-:d:1197818

Access Statistics for this article

Sustainability is currently edited by Ms. Alexandra Wu

More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jsusta:v:15:y:2023:i:14:p:11272-:d:1197818