Ideology and Policy Preferences in Synthetic Data: The Potential of LLMs for Public Opinion Analysis
Keyeun Lee,
Jaehyuk Park,
Suh-hee Choi and
Changkeun Lee
Additional contact information
Keyeun Lee: Department of Communication, Seoul National University, South Korea
Jaehyuk Park: KDI School of Public Policy and Management, South Korea
Suh-hee Choi: Department of Geography, Kyung Hee University, South Korea
Changkeun Lee: KDI School of Public Policy and Management, South Korea
Media and Communication, 2025, vol. 13
Abstract:
This study investigates whether large language models (LLMs) can meaningfully extend or generate synthetic public opinion survey data on labor policy issues in South Korea. Unlike prior work conducted on people’s general sociocultural values or specific political topics such as voting intentions, our research examines policy preferences on tangible social and economic topics, offering deeper insights for news media and data analysts. In two key applications, we first explore whether LLMs can predict public sentiment on emerging or rapidly evolving issues using existing survey data. We then assess how LLMs generate synthetic datasets resembling real-world survey distributions. Our findings reveal that while LLMs capture demographic and ideological traits with reasonable accuracy, they tend to overemphasize ideological orientation for politically charged topics—a bias that is more pronounced in fully synthetic data, raising concerns about perpetuating societal stereotypes. Despite these challenges, LLMs hold promise for enhancing data-driven journalism and policy research, particularly in polarized societies. We call for further study into how LLM-based predictions align with human responses in diverse sociopolitical settings, alongside improved tools and guidelines to mitigate embedded biases.
Keywords: AI-generated text; ChatGPT; large language models; news media; policy preferences; public opinions (search for similar items in EconPapers)
Date: 2025
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.cogitatiopress.com/mediaandcommunication/article/view/9677 (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:cog:meanco:v13:y:2025:a:9677
DOI: 10.17645/mac.9677
Access Statistics for this article
Media and Communication is currently edited by Raquel Silva
More articles in Media and Communication from Cogitatio Press
Bibliographic data for series maintained by António Vieira () and IT Department ().