Analyzing Business Conditions by Quantitative Text Analysis–Time Series Analysis Using Appearance Rate and Principal Component
Nariyasu Yamazawa
ESRI Discussion paper series from Economic and Social Research Institute (ESRI)
Abstract:
We present a procedure for analyzing the current business conditions and forecasting GDP growth rate by quantitative text analysis. We use text data of Economy Watcher Survey conducted by Cabinet Office. We extract words from 190 thousands sentence, and construct time series data by counting appearance rate every month. The analyses consist of four parts: (1) visualizing appearance rate by drawing graphs, (2) correlation analysis, (3) principal component analysis, and (4) forecasting GDP growth rate. First, we draw graphs of the appearance rate of words which are influenced by business conditions. We find that the graphs show the effect of policy on business conditions clearly. Second, we construct word lists which correlate business conditions by computing correlation coefficients. And we also construct lists which reversely correlate business conditions. Third, we extract principal component from 150 frequent words. We find that the 1st principal component move together with business conditions. The last, we forecast quarterly real GDP growth rate by text data. We find that forecast accuracy improved by adding the text data. It shows that text data have useful information about GDP forecasting.
Pages: 32 pages
Date: 2018-03
New Economics Papers: this item is included in nep-big and nep-for
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.esri.go.jp/jp/archive/e_dis/e_dis345/e_dis345.pdf (application/pdf)
Our link check indicates that this URL is bad, the error code is: 500 Can't connect to www.esri.go.jp:80 (No such host is known. )
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:esj:esridp:345
Access Statistics for this paper
More papers in ESRI Discussion paper series from Economic and Social Research Institute (ESRI) Contact information at EDIRC.
Bibliographic data for series maintained by HORI nobuko ( this e-mail address is bad, please contact ).