EconPapers    
Economics at your fingertips  
 

Automated Topic Analysis with Large Language Models

Andrei Kirilenko () and Svetlana Stepchenkova
Additional contact information
Andrei Kirilenko: University of Florida
Svetlana Stepchenkova: University of Florida

A chapter in Information and Communication Technologies in Tourism 2024, 2024, pp 29-34 from Springer

Abstract: Abstract Topic modeling is a popular method in tourism data analysis. Many authors have applied various approaches to summarize the main themes of travel blogs, reviews, video diaries, and similar media. One common shortcoming of these methods is their severe limitation in working with short documents, such as blog readers’ feedback (reactions). In the past few years, a new crop of large language models (LLMs), such as ChatGPT, has become available for researchers. We investigate LLM capability in extracting the main themes of viewers’ reactions to popular videos of a rural China destination that explores the cultural, technological, and natural heritage of the countryside. We compare the extracted topics and model accuracy with the results of the traditional Latent Dirichlet Allocation approach. Overall, LLM results are more accurate, specific, and better at separating discussion topics.

Keywords: Large Language Model (LLM); GPT-3; topic modeling; social media (search for similar items in EconPapers)
Date: 2024
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:prbchp:978-3-031-58839-6_3

Ordering information: This item can be ordered from
http://www.springer.com/9783031588396

DOI: 10.1007/978-3-031-58839-6_3

Access Statistics for this chapter

More chapters in Springer Proceedings in Business and Economics from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-04-01
Handle: RePEc:spr:prbchp:978-3-031-58839-6_3