Using Text Data to Improve Industrial Statistics in the UK
Alex Bishop,
Juan Mateos-Garcia () and
George Richardson
Economic Statistics Centre of Excellence (ESCoE) Discussion Papers from Economic Statistics Centre of Excellence (ESCoE)
Abstract:
We use business website data to explore the limitations of the Standard Industrial Classification taxonomy and develop a prototype for a bottom-up industrial taxonomy based on semantic similarities between company descriptions. This prototype makes it possible to decompose uninformative SIC codes into granular industries, build user-driven industry groups which might be of interest to policymakers (e.g. 'green economy') and build indices of local economic composition that are more strongly associated with local economic performance than those based on the SIC taxonomy. We consider potential avenues to combine official and bottom-up taxonomies in order to improve our understanding the economy and inform economic policy.
Keywords: emerging industries; industrial policy; industrial taxonomy; machine learning; web data (search for similar items in EconPapers)
JEL-codes: C81 L52 R12 (search for similar items in EconPapers)
Date: 2022-01
New Economics Papers: this item is included in nep-big, nep-dem and nep-geo
References: View complete reference list from CitEc
Citations: View citations in EconPapers (4)
Downloads: (external link)
https://escoe-website.s3.amazonaws.com/wp-content/ ... 841/DP-2022-01-1.pdf
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nsr:escoed:escoe-dp-2022-01
Access Statistics for this paper
More papers in Economic Statistics Centre of Excellence (ESCoE) Discussion Papers from Economic Statistics Centre of Excellence (ESCoE) King's College London Strand London WC2R 2LS. Contact information at EDIRC.
Bibliographic data for series maintained by ESCoE Centre Manager ().