EconPapers    
Economics at your fingertips  
 

Predicting innovative firms using web mining and deep learning

Jan Kinne () and David Lenz

No 19-001, ZEW Discussion Papers from ZEW - Leibniz Centre for European Economic Research

Abstract: Innovation is considered as a main driver of economic growth. Promoting the development of innovation through STI (science, technology and innovation) policies requires accurate indicators of innovation. Traditional indicators often lack coverage, granularity as well as timeliness and involve high data collection costs, especially when conducted at a large scale. In this paper, we propose a novel approach on how to create firm-level innovation indicators at the scale of millions of firms. We use traditional firm-level innovation indicators from the questionnaire-based Community Innovation Survey (CIS) survey to train an artificial neural network classification model on labelled (innovative/non-innovative) web texts of surveyed firms. Subsequently, we apply this classification model to the web texts of hundreds of thousands of firms in Germany to predict their innovation status. Our results show that this approach produces credible predictions and has the potential to be a valuable and highly cost-efficient addition to the existing set of innovation indicators, especially due to its coverage and regional granularity. The predicted firm-level probabilities can also directly be interpreted as a continuous measure of innovativeness, opening up additional advantages over traditional binary innovation indicators.

Keywords: Web Mining; Web Scraping; R&D; R&I; STI; Innovation; Indicators; Text Mining; Natural Language Processing; NLP; Deep Learning (search for similar items in EconPapers)
JEL-codes: C81 C83 O30 (search for similar items in EconPapers)
Date: 2019
New Economics Papers: this item is included in nep-bec, nep-big, nep-cmp, nep-ino, nep-knm, nep-sbm and nep-tid
References: Add references at CitEc
Citations: View citations in EconPapers (14)

Downloads: (external link)
https://www.econstor.eu/bitstream/10419/191615/1/1047440679.pdf (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:zbw:zewdip:19001

Access Statistics for this paper

More papers in ZEW Discussion Papers from ZEW - Leibniz Centre for European Economic Research Contact information at EDIRC.
Bibliographic data for series maintained by ZBW - Leibniz Information Centre for Economics ().

 
Page updated 2025-03-20
Handle: RePEc:zbw:zewdip:19001