A Web Semantic-Based Text Analysis Approach for Enhancing Named Entity Recognition Using PU-Learning and Negative Sampling
Shunqin Zhang,
Sanguo Zhang,
Wenduo He and
Xuan Zhang
Additional contact information
Shunqin Zhang: School of Mathematics Sciences, University of Chinese Academy of Sciences, Beijing, China
Sanguo Zhang: School of Mathematics Sciences, University of Chinese Academy of Sciences, Beijing, China
Wenduo He: Institute for Network Sciences and Cyberspace (INSC), Tsinghua University, Beijing, China
Xuan Zhang: Tsinghua University, China
International Journal on Semantic Web and Information Systems (IJSWIS), 2024, vol. 20, issue 1, 1-23
Abstract:
The NER task is largely developed based on well-annotated data. However, in many scenarios, the entities may not be fully annotated, leading to serious performance degradation. To address this issue, the authors propose a robust NER approach that combines a novel PU-learning algorithm and negative sampling. Unlike many existing studies, the proposed method adopts a two-step procedure for handling unlabeled entities, thereby enhancing its capability to mitigate the impact of such entities. Moreover, this algorithm demonstrates high versatility and can be integrated into any token-level NER model with ease. The effectiveness of the proposed method is verified on several classic NER models and datasets, demonstrating its strong ability to handle unlabeled entities. Finally, the authors achieve competitive performances on synthetic and real-world datasets.
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://services.igi-global.com/resolvedoi/resolve ... 0.4018/IJSWIS.335113 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:igg:jswis0:v:20:y:2024:i:1:p:1-23
Access Statistics for this article
International Journal on Semantic Web and Information Systems (IJSWIS) is currently edited by Brij Gupta
More articles in International Journal on Semantic Web and Information Systems (IJSWIS) from IGI Global
Bibliographic data for series maintained by Journal Editor ().