EconPapers    
Economics at your fingertips  
 

The ASR Post-Processor Performance Challenges of BackTranScription (BTS): Data-Centric and Model-Centric Approaches

Chanjun Park, Jaehyung Seo, Seolhwa Lee, Chanhee Lee and Heuiseok Lim ()
Additional contact information
Chanjun Park: Department of Computer Science and Engineering, Korea University, Seoul 02841, Korea
Jaehyung Seo: Department of Computer Science and Engineering, Korea University, Seoul 02841, Korea
Seolhwa Lee: Department of Computer Science, University of Copenhagen, DK-2100 Copenhagen, Denmark
Chanhee Lee: Naver Corporation, Seongnam 13561, Korea
Heuiseok Lim: Department of Computer Science and Engineering, Korea University, Seoul 02841, Korea

Mathematics, 2022, vol. 10, issue 19, 1-8

Abstract: Training an automatic speech recognition (ASR) post-processor based on sequence-to-sequence (S2S) requires a parallel pair (e.g., speech recognition result and human post-edited sentence) to construct the dataset, which demands a great amount of human labor. BackTransScription (BTS) proposes a data-building method to mitigate the limitations of the existing S2S based ASR post-processors, which can automatically generate vast amounts of training datasets, reducing time and cost in data construction. Despite the emergence of this novel approach, the BTS-based ASR post-processor still has research challenges and is mostly untested in diverse approaches. In this study, we highlight these challenges through detailed experiments by analyzing the data-centric approach (i.e., controlling the amount of data without model alteration) and the model-centric approach (i.e., model modification). In other words, we attempt to point out problems with the current trend of research pursuing a model-centric approach and alert against ignoring the importance of the data. Our experiment results show that the data-centric approach outperformed the model-centric approach by +11.69, +17.64, and +19.02 in the F1-score, BLEU, and GLEU tests.

Keywords: backtranscription; machine translation; data-centric; model-centric; automatic speech recognition; post-processor (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/10/19/3618/pdf (application/pdf)
https://www.mdpi.com/2227-7390/10/19/3618/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:10:y:2022:i:19:p:3618-:d:932368

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jmathe:v:10:y:2022:i:19:p:3618-:d:932368