Multi-task learning model for citation intent classification in scientific publications
Ruihua Qi (),
Jia Wei,
Zhen Shao,
Zhengguang Li (),
Heng Chen,
Yunhao Sun and
Shaohua Li
Additional contact information
Ruihua Qi: Dalian University of Foreign Languages
Jia Wei: Dalian University of Foreign Languages
Zhen Shao: Dalian University of Foreign Languages
Zhengguang Li: Dalian University of Foreign Languages
Heng Chen: Dalian University of Foreign Languages
Yunhao Sun: Dalian University of Foreign Languages
Shaohua Li: Dalian University of Foreign Languages
Scientometrics, 2023, vol. 128, issue 12, No 4, 6335-6355
Abstract:
Abstract Citations play a significant role in the evaluation of scientific literature and researchers. Citation intent analysis is essential for academic literature understanding. Meanwhile, it is useful for enriching semantic information representation for the citation intent classification task because of the rapid growth of publicly accessible full-text literature. However, some useful information that is readily available in citation context and facilitates citation intent analysis has not been fully explored. Furthermore, some deep learning models may not be able to learn relevant features effectively due to insufficient training samples of citation intent analysis tasks. Multi-task learning aims to exploit useful information between multiple tasks to help improve learning performance and exhibits promising results on many natural language processing tasks. In this paper, we propose a joint semantic representation model, which consists of pretrained language models and heterogeneous features of citation intent texts. Considering the correlation between citation intents, citation section and citation worthiness classification tasks, we build a multi-task citation classification framework with soft parameter sharing constraint and construct independent models for multiple tasks to improve the performance of citation intent classification. The experimental results demonstrate that the heterogeneous features and the multi-task framework with soft parameter sharing constraint proposed in this paper enhance the overall citation intent classification performance.
Keywords: Citation intent classification; Multi-task; Pretrained language model; Heterogeneous features (search for similar items in EconPapers)
Date: 2023
References: Add references at CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s11192-023-04858-4 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:scient:v:128:y:2023:i:12:d:10.1007_s11192-023-04858-4
Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/11192
DOI: 10.1007/s11192-023-04858-4
Access Statistics for this article
Scientometrics is currently edited by Wolfgang Glänzel
More articles in Scientometrics from Springer, Akadémiai Kiadó
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().