Classifying Firms with Text Mining
Giacomo Caterini ()
No 2018/09, DEM Working Papers from Department of Economics and Management
Abstract:
Statistics on the births, deaths and survival rates of firms are crucial pieces of information, as they enter as an input in the computation of GDP, the identification of each sector’s contribution to the economy, and the assessment of gross job creation and destruction rates. Official statistics on firm demography are made available only several months after data collection and storage, however. Furthermore, unprocessed and untimely administrative data can lead to a misrepresentation of the life-cycle stage of a firm. In this paper we implement an automated version of Eurostat’s algorithm aimed at distinguishing true startup endeavors from the resurrection of pre-existing but apparently defunct firms. The potential gains from combining machine learning, natural language processing and econometric tools for pre- processing and analyzing granular data are exposed, and a machine learning method predicting reactivations of deceptively dead firms is proposed.
Keywords: Business Demography; Classification; Text Mining (search for similar items in EconPapers)
JEL-codes: C01 C52 C53 C55 C80 G33 L11 L25 L26 M13 R11 (search for similar items in EconPapers)
Date: 2018
New Economics Papers: this item is included in nep-bec, nep-big, nep-cmp and nep-sbm
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.economia.unitn.it/alfresco/download/wo ... ecc44/DEM2018_09.pdf (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:trn:utwprg:2018/09
Access Statistics for this paper
More papers in DEM Working Papers from Department of Economics and Management Contact information at EDIRC.
Bibliographic data for series maintained by roberto.gabriele@unitn.it ().