Research on cross-lingual multi-label patent classification based on pre-trained model
Yonghe Lu (),
Lehua Chen (),
Xinyu Tong (),
Yongxin Peng () and
Hou Zhu ()
Additional contact information
Yonghe Lu: Sun Yat-sen University
Lehua Chen: Sun Yat-sen University
Xinyu Tong: Sun Yat-sen University
Yongxin Peng: Sun Yat-sen University
Hou Zhu: Sun Yat-sen University
Scientometrics, 2024, vol. 129, issue 6, No 6, 3067-3087
Abstract:
Abstract Patent classification is an important part of the patent examination and management process. Using efficient and accurate automatic patent classification can significantly improve patent retrieval performance. Current monolingual patent classification models, on the other hand, are insufficient for cross-lingual patent tasks. Therefore, research into cross-lingual patent categorization is crucial. In this paper, we proposed a cross-lingual patent classification model based on the pre-trained model named XLM-R–CNN. Besides, we constructed a large patent dataset called XLPatent including Chinese, English, and German. We conducted experiments to evaluate model performance with several metrics. The experimental results showed that XLM-R–CNN achieved a classification accuracy of 73% and average precision of 94%.
Keywords: Patent classification; Cross-lingual text embedding; Pre-trained model (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s11192-024-05024-0 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:scient:v:129:y:2024:i:6:d:10.1007_s11192-024-05024-0
Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/11192
DOI: 10.1007/s11192-024-05024-0
Access Statistics for this article
Scientometrics is currently edited by Wolfgang Glänzel
More articles in Scientometrics from Springer, Akadémiai Kiadó
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().