A flexible, scaleable approach to the international patent 'name game'
Amma Serwaah and
Georg Zachmann ()
Working Papers from Bruegel
The inventors in PATSTAT are often duplicates- the same person or company may be split into multiple entries in PATSTAT, each associated to different patents. In this paper, we address this problem with an algorithm that efficiently de-duplicates the data. It needs minimal manual input and works well even on consumer-grade computers. Comparisons between entries are not limited to their names, and thus this algorithm is an improvement over earlier...
New Economics Papers: this item is included in nep-gth, nep-ino, nep-ipr and nep-pr~
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1) Track citations by RSS feed
Downloads: (external link)
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
Persistent link: https://EconPapers.repec.org/RePEc:bre:wpaper:850
Access Statistics for this paper
More papers in Working Papers from Bruegel Contact information at EDIRC.
Bibliographic data for series maintained by Bruegel ().