The ASR Post-Processor Performance Challenges of BackTranScription (BTS): Data-Centric and Model-Centric Approaches

Park, Chanjun; Seo, Jaehyung; Lee, Seolhwa; Lee, Chanhee; Lim, Heuiseok

The ASR Post-Processor Performance Challenges of BackTranScription (BTS): Data-Centric and Model-Centric Approaches

Chanjun Park, Jaehyung Seo, Seolhwa Lee, Chanhee Lee and Heuiseok Lim ()
Additional contact information
Chanjun Park: Department of Computer Science and Engineering, Korea University, Seoul 02841, Korea
Jaehyung Seo: Department of Computer Science and Engineering, Korea University, Seoul 02841, Korea
Seolhwa Lee: Department of Computer Science, University of Copenhagen, DK-2100 Copenhagen, Denmark
Chanhee Lee: Naver Corporation, Seongnam 13561, Korea
Heuiseok Lim: Department of Computer Science and Engineering, Korea University, Seoul 02841, Korea

Mathematics, 2022, vol. 10, issue 19, 1-8

Abstract: Training an automatic speech recognition (ASR) post-processor based on sequence-to-sequence (S2S) requires a parallel pair (e.g., speech recognition result and human post-edited sentence) to construct the dataset, which demands a great amount of human labor. BackTransScription (BTS) proposes a data-building method to mitigate the limitations of the existing S2S based ASR post-processors, which can automatically generate vast amounts of training datasets, reducing time and cost in data construction. Despite the emergence of this novel approach, the BTS-based ASR post-processor still has research challenges and is mostly untested in diverse approaches. In this study, we highlight these challenges through detailed experiments by analyzing the data-centric approach (i.e., controlling the amount of data without model alteration) and the model-centric approach (i.e., model modification). In other words, we attempt to point out problems with the current trend of research pursuing a model-centric approach and alert against ignoring the importance of the data. Our experiment results show that the data-centric approach outperformed the model-centric approach by +11.69, +17.64, and +19.02 in the F1-score, BLEU, and GLEU tests.

Keywords: backtranscription; machine translation; data-centric; model-centric; automatic speech recognition; post-processor (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/10/19/3618/pdf (application/pdf)
https://www.mdpi.com/2227-7390/10/19/3618/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:10:y:2022:i:19:p:3618-:d:932368

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().