Audio-as-Data Tools: Replicating Computational Data Processing

Lukito, Josephine; Greenfield, Jason; Yang, Yunkang; Dahlke, Ross; Brown, Megan A.; Lewis, Rebecca; Chen, Bin

Audio-as-Data Tools: Replicating Computational Data Processing

Josephine Lukito, Jason Greenfield, Yunkang Yang, Ross Dahlke, Megan A. Brown, Rebecca Lewis and Bin Chen
Additional contact information
Josephine Lukito: School of Journalism and Media, University of Texas at Austin, USA
Jason Greenfield: Center for Social Media and Politics, New York University, USA
Yunkang Yang: Department of Communication & Journalism, Texas A&M University, USA
Ross Dahlke: Department of Communication, Stanford University, USA
Megan A. Brown: School of Information, University of Michigan, USA
Rebecca Lewis: Department of Communication, Stanford University, USA
Bin Chen: School of Journalism and Media, University of Texas at Austin, USA / Journalism and Media Studies Centre, University of Hong Kong

Media and Communication, 2024, vol. 12

Abstract: The rise of audio-as-data in social science research accentuates a fundamental challenge: establishing reproducible and reliable methodologies to guide this emerging area of study. In this study, we focus on the reproducibility of audio-as-data preparation methods in computational communication research and evaluate the accuracy of popular audio-as-data tools. We analyze automated transcription and computational phonology tools applied to 200 episodes of conservative talk shows hosted by Rush Limbaugh and Alex Jones. Our findings reveal that the tools we tested are highly accurate. However, despite different transcription and audio signal processing tools yield similar results, subtle yet significant variations could impact the findings’ reproducibility. Specifically, we find that discrepancies in automated transcriptions and auditory features such as pitch and intensity underscore the need for meticulous reproduction of data preparation procedures. These insights into the variability introduced by different tools stress the importance of detailed methodological reporting and consistent processing techniques to ensure the replicability of research outcomes. Our study contributes to the broader discourse on replicability and reproducibility by highlighting the nuances of audio data preparation and advocating for more transparent and standardized practices in this area.

Keywords: audio-as-data; computational methods; conservative talk shows; data processing; reproduction; talk radio (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www.cogitatiopress.com/mediaandcommunication/article/view/7851 (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:cog:meanco:v12:y:2024:a:7851

DOI: 10.17645/mac.7851

Access Statistics for this article

Media and Communication is currently edited by Raquel Silva

More articles in Media and Communication from Cogitatio Press
Bibliographic data for series maintained by António Vieira () and IT Department ().