Reliance on Science by Inventors: Hybrid Extraction of In-text Patent-to-Article Citations
Matt Marx and
Aaron Fuegi
No 27987, NBER Working Papers from National Bureau of Economic Research, Inc
Abstract:
We curate and characterize a complete set of citations from patents to scientific articles, including nearly 16 million from the full text of USPTO and EPO patents. Combining heuristics and machine learning, we achieve 25% higher performance than machine learning alone. At 99.4% accuracy, coverage of 87.6% is achieved, and coverage above 90% with accuracy above 93%. Performance is evaluated with a set of 5,939 randomly-sampled, cross-verified “known good” citations, which the authors have never seen. We compare these “in-text” citations with the “official” citations on the front page of patents. In-text citations are more diverse temporally, geographically, and topically. They are less self-referential and less likely to be recycled from one patent to the next. That said, in-text citations have been overshadowed by front-page in the past few decades, dropping from 80% of all paper-to-patent citations to less than 40%. In replicating two published articles that use only citations on the front page of patents, we show that failing to capture those in the body text leads to understating the relationship between academic science and commercial invention. All patent-to-article citations, as well as the known-good test set, are available at http://relianceonscience.org.
JEL-codes: O31 O32 O33 O34 (search for similar items in EconPapers)
Date: 2020-10
New Economics Papers: this item is included in nep-big, nep-ino, nep-ipr, nep-sbm and nep-tid
Note: PR
References: Add references at CitEc
Citations: View citations in EconPapers (9)
Published as Matt Marx & Aaron Fuegi, 2022. "Reliance on science by inventors: Hybrid extraction of in‐text patent‐to‐article citations," Journal of Economics & Management Strategy, vol 31(2), pages 369-392.
Downloads: (external link)
http://www.nber.org/papers/w27987.pdf (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nbr:nberwo:27987
Ordering information: This working paper can be ordered from
http://www.nber.org/papers/w27987
Access Statistics for this paper
More papers in NBER Working Papers from National Bureau of Economic Research, Inc National Bureau of Economic Research, 1050 Massachusetts Avenue Cambridge, MA 02138, U.S.A.. Contact information at EDIRC.
Bibliographic data for series maintained by ().