EconPapers    
Economics at your fingertips  
 

How to kill inventors: testing the Massacrator© algorithm for inventor disambiguation

Michele Pezzoni, Francesco Lissoni and Gianluca Tarasconi

Post-Print from HAL

Abstract: Inventor disambiguation is an increasingly important issue for users of patent data. We propose and test a number of refinements to the original Massacrator algorithm, originally proposed by Lissoni et al. (The keins database on academic inventors: methodology and contents, 2006) and now applied to APE-INV, a free access database funded by the European Science Foundation. Following Raffo and Lhuillery (Res Policy 38:1617–1627, 2009) we describe disambiguation as a three step process: cleaning&parsing, matching, and filtering. By means of sensitivity analysis, based on MonteCarlo simulations, we show how various filtering criteria can be manipulated in order to obtain optimal combinations of precision and recall (type I and type II errors). We also show how these different combinations generate different results for applications to studies on inventors' productivity, mobility, and networking; and discuss quality issues related to linguistic issues. The filtering criteria based upon information on inventors' addresses are sensitive to data quality, while those based upon information on co-inventorship networks are always effective. Details on data access and data quality improvement via feedback collection are also discussed.

Keywords: Patent; data; Inventors; Name; disambiguation (search for similar items in EconPapers)
Date: 2014
References: Add references at CitEc
Citations: View citations in EconPapers (41)

Published in Scientometrics, 2014, 101 (1), pp.477-504. ⟨10.1007/s11192-014-1375-7⟩

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
Journal Article: How to kill inventors: testing the Massacrator© algorithm for inventor disambiguation (2014) Downloads
Working Paper: How To Kill Inventors: Testing The Massacrator© Algorithm For Inventor Disambiguation (2012) Downloads
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:hal:journl:halshs-01074536

DOI: 10.1007/s11192-014-1375-7

Access Statistics for this paper

More papers in Post-Print from HAL
Bibliographic data for series maintained by CCSD ().

 
Page updated 2025-03-22
Handle: RePEc:hal:journl:halshs-01074536