Exploiting word embedding for heterogeneous topic model towards patent recommendation
Jie Chen (chenjie200398@163.com),
Jialin Chen (cjialin2019@outlook.com),
Shu Zhao (zhaoshuzs2002@hotmail.com),
Yanping Zhang (zhangyp2@gmail.com) and
Jie Tang (jietang@tsinghua.edu.cn)
Additional contact information
Jie Chen: Ministry of Education
Jialin Chen: Ministry of Education
Shu Zhao: Ministry of Education
Yanping Zhang: Ministry of Education
Jie Tang: Tsinghua University
Scientometrics, 2020, vol. 125, issue 3, No 11, 2108 pages
Abstract:
Abstract Patent recommendation aims to recommend patent documents that have similar content to a given target patent. With the explosive growth in patent applications, how to recommend relevant patents from the massive number of patents has become an extremely challenging problem. The main obstacle in patent recommendation is how to distinguish the meanings of the same word in different contexts or associate multiple words that express the same meaning. In this paper, we propose a Heterogeneous Topic model exploiting Word embedding to enhance word semantics (HTW). First, we model the relationship among text, inventors, and applicants around the topic to build a heterogeneous topic model and learn the patent feature representation to capture contextual word semantics. Second, a word embedding is constructed to extract the deep semantics for associating multiple words that express the same meaning. Finally, with words as connections, the mapping from patent feature representations to patent embedding is established through a matrix operation, which integrates the information between the word embedding and patent feature representation. HTW considers the heterogeneity of patents and enhances the distinction or association among words simultaneously. The experimental results on real-world datasets show that HTW exceeds typical keyword-based methods, topic models, and embedding models on patent recommendations.
Keywords: Patent recommendation; Heterogeneous topic model; Word embedding (search for similar items in EconPapers)
Date: 2020
References: Add references at CitEc
Citations: View citations in EconPapers (7)
Downloads: (external link)
http://link.springer.com/10.1007/s11192-020-03666-4 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:scient:v:125:y:2020:i:3:d:10.1007_s11192-020-03666-4
Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/11192
DOI: 10.1007/s11192-020-03666-4
Access Statistics for this article
Scientometrics is currently edited by Wolfgang Glänzel
More articles in Scientometrics from Springer, Akadémiai Kiadó
Bibliographic data for series maintained by Sonal Shukla (sonal.shukla@springer.com) and Springer Nature Abstracting and Indexing (indexing@springernature.com).