Integrating big data with KNIME as an alternative without programming code: an application to the PATSTAT patent database
Fernando H. Taques (),
Coro Chasco Yrigoyen and
Flávio H. Taques
Additional contact information
Fernando H. Taques: Universidad Autónoma de Madrid
Flávio H. Taques: Faculdades Metropolitanas Unidas
Journal of Geographical Systems, 2025, vol. 27, issue 1, No 4, 61 pages
Abstract:
Abstract Accessing massive datasets can be challenging for users unfamiliar with programming codes. Combining Konstanz Information Miner (KNIME) and MySQL tools on standard configuration equipment allows for addressing this issue. This research proposal aims to present a methodology that describes the necessary configuration steps in both tools and the required manipulation in KNIME to transmit the information to the MySQL environment for further processing in a database management system (DBMS). In addition, we propose a procedure so that the use of this point-and-click software in research work can gain in reproducibility and, therefore, in credibility in the scientific community. To achieve this, we will use a big database regarding patent applications as a reference, the PATSTAT Global 2023, provided by the European Patent Office (EPO). As well known, patent data can be a valuable source for understanding innovation dynamics and technological trends, whether for studies on companies, sectors, nations or even regions, at aggregated and disaggregated levels.
Keywords: Big data; EPO; KNIME; MySQL; Patent; PATSTAT (search for similar items in EconPapers)
JEL-codes: C80 C88 O30 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s10109-024-00445-0 Abstract (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:kap:jgeosy:v:27:y:2025:i:1:d:10.1007_s10109-024-00445-0
Ordering information: This journal article can be ordered from
http://www.springer. ... ce/journal/10109/PS2
DOI: 10.1007/s10109-024-00445-0
Access Statistics for this article
Journal of Geographical Systems is currently edited by Manfred M. Fischer and Antonio Páez
More articles in Journal of Geographical Systems from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().