EconPapers    
Economics at your fingertips  
 

Classifying Patents Based on their Semantic Content

Antonin Bergeaud (), Yoann Potiron and Juste Raimbault

Working papers from Banque de France

Abstract: In this paper, we extend some usual techniques of classification resulting from a largescale data-mining and network approach. This new technology, which in particular is designed to be suitable to big data, is used to construct an open consolidated database from raw data on 4 million patents taken from the US patent office from 1976 onward. To build the pattern network, not only do we look at each patent title, but we also examine their full abstract and extract the relevant keywords accordingly. We refer to this classification as semantic approach in contrast with the more common technological approach which consists in taking the topology when considering US Patent office technological classes. Moreover, we document that both approaches have highly different topological measures and strong statistical evidence that they feature a different model. This suggests that our method is a useful tool to extract endogenous information.

Keywords: Patents; Semantic Analysis; Network; Modularity; Innovation; USPTO (search for similar items in EconPapers)
JEL-codes: O3 O39 (search for similar items in EconPapers)
New Economics Papers: this item is included in nep-big, nep-ino and nep-ipr
Date: 2018
References: View references in EconPapers View complete reference list from CitEc
Citations Track citations by RSS feed

Downloads: (external link)
https://publications.banque-france.fr/sites/defaul ... /documents/wp685.pdf (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bfr:banfra:685

Access Statistics for this paper

More papers in Working papers from Banque de France Banque de France 31 Rue Croix des Petits Champs LABOLOG - 49-1404 75049 PARIS. Contact information at EDIRC.
Bibliographic data for series maintained by Michael brassart ().

 
Page updated 2018-11-11
Handle: RePEc:bfr:banfra:685