Text Algorithms in Economics
Elliott Ash and
Stephen Hansen
Annual Review of Economics, 2023, vol. 15, issue 1, 659-688
Abstract:
This article provides an overview of the methods used for algorithmic text analysis in economics, with a focus on three key contributions. First, we introduce methods for representing documents as high-dimensional count vectors over vocabulary terms, for representing words as vectors, and for representing word sequences as embedding vectors. Second, we define four core empirical tasks that encompass most text-as-data research in economics and enumerate the various approaches that have been taken so far to accomplish these tasks. Finally, we flag limitations in the current literature, with a focus on the challenge of validating algorithmic output.
Keywords: text as data; topic models; word embeddings; large language models; transformer models (search for similar items in EconPapers)
JEL-codes: C18 C45 C55 (search for similar items in EconPapers)
Date: 2023
References: Add references at CitEc
Citations: View citations in EconPapers (14)
Downloads: (external link)
https://doi.org/10.1146/annurev-economics-082222-074352
Full text downloads are only available to subscribers. Visit the abstract page for more information.
Related works:
Working Paper: Text Algorithms in Economics (2023) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:anr:reveco:v:15:y:2023:p:659-688
Ordering information: This journal article can be ordered from
http://www.annualreviews.org/action/ecommerce
DOI: 10.1146/annurev-economics-082222-074352
Access Statistics for this article
More articles in Annual Review of Economics from Annual Reviews Annual Reviews 4139 El Camino Way Palo Alto, CA 94306, USA.
Bibliographic data for series maintained by http://www.annualreviews.org ().