Research on the Automatic Subject-Indexing Method of Academic Papers Based on Climate Change Domain Ontology
Heng Yang,
Nan Wang,
Lina Yang,
Wei Liu and
Sili Wang ()
Additional contact information
Heng Yang: Chinese Academy of Sciences, Northwest Institute of Eco-Environment and Resources, Lanzhou 730000, China
Nan Wang: Chinese Academy of Sciences, Northwest Institute of Eco-Environment and Resources, Lanzhou 730000, China
Lina Yang: Chinese Academy of Sciences, Northwest Institute of Eco-Environment and Resources, Lanzhou 730000, China
Wei Liu: Chinese Academy of Sciences, Northwest Institute of Eco-Environment and Resources, Lanzhou 730000, China
Sili Wang: Chinese Academy of Sciences, Northwest Institute of Eco-Environment and Resources, Lanzhou 730000, China
Sustainability, 2023, vol. 15, issue 5, 1-13
Abstract:
It is important to classify academic papers in a fine-grained manner to uncover deeper implicit themes and semantics in papers for better semantic retrieval, paper recommendation, research trend prediction, topic analysis, and a series of other functions. Based on the ontology of the climate change domain, this study used an unsupervised approach to combine two methods, syntactic structure and semantic modeling, to build a framework of subject-indexing techniques for academic papers in the climate change domain. The framework automatically indexes a set of conceptual terms as research topics from the domain ontology by inputting the titles, abstracts and keywords of the papers using natural language processing techniques such as syntactic dependencies, text similarity calculation, pre-trained language models, semantic similarity calculation, and weighting factors such as word frequency statistics and graph path calculation. Finally, we evaluated the proposed method using the gold standard of manually annotated articles and demonstrated significant improvements over the other five alternative methods in terms of precision, recall and F1-score. Overall, the method proposed in this study is able to identify the research topics of academic papers more accurately, and also provides useful references for the application of domain ontologies and unsupervised data annotation.
Keywords: ontology; automatic subject indexing; climate change; semantic; deep mining (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2071-1050/15/5/3919/pdf (application/pdf)
https://www.mdpi.com/2071-1050/15/5/3919/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:15:y:2023:i:5:p:3919-:d:1075774
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().