Fad or future? Automated analysis of financial text and its implications for corporate reporting
Craig Lewis and
Steven Young
Accounting and Business Research, 2019, vol. 49, issue 5, 587-615
Abstract:
This paper describes the current state of natural language processing (NLP) as it applies to corporate reporting. We document dramatic increases in the quantity of verbal content that is an integral part of company reporting packages, as well as the evolution of text analytic approaches being employed to analyse this content. We provide intuitive descriptions of the leading analytic approaches applied in the academic accounting and finance literatures. This discussion includes key word searches and counts, attribute dictionaries, naïve Bayesian classification, cosine similarity, and latent Dirichlet allocation. We also discuss how increasing interest in NLP processing of the corporate reporting package could and should influence financial reporting regulation and note that textual analysis is currently more of an afterthought, if it is even considered. Opportunities for improving the usefulness of NLP processing are discussed, as well as possible impediments.
Date: 2019
References: View complete reference list from CitEc
Citations: View citations in EconPapers (17)
Downloads: (external link)
http://hdl.handle.net/10.1080/00014788.2019.1611730 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:acctbr:v:49:y:2019:i:5:p:587-615
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/RABR20
DOI: 10.1080/00014788.2019.1611730
Access Statistics for this article
Accounting and Business Research is currently edited by Vivien Beattie
More articles in Accounting and Business Research from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().