EconPapers    
Economics at your fingertips  
 

Semantic Annotation Model and Method Based on Internet Open Dataset

Xin Gao, Yansong Wang, Fang Wang, Baoqun Zhang, Caie Hu, Jian Wang and Longfei Ma
Additional contact information
Xin Gao: State Grid Beijing Electric Power Company, China
Yansong Wang: State Grid Beijing Electric Power Company, China
Fang Wang: State Grid Beijing Electric Power Company, China
Baoqun Zhang: State Grid Beijing Electric Power Company, China
Caie Hu: State Grid Beijing Electric Power Company, China
Jian Wang: State Grid Beijing Electric Power Company, China
Longfei Ma: State Grid Beijing Electric Power Company, China

International Journal of Intelligent Information Technologies (IJIIT), 2025, vol. 21, issue 1, 1-19

Abstract: Traditional semantic annotation faces the problem of dataset diversity. Different fields and scenarios need to be specially annotated, and annotation work usually requires a lot of manpower and time investment. To meet these challenges, this paper deeply studies the semantic annotation model and method based on internet open datasets, aiming to improve annotation efficiency and accuracy and promote data resource sharing and utilization. This paper selects Common Crawl dataset to provide sufficient training samples; methods such as removing stop words and deduplication are used to preprocess data to improve data quality; a keyword extraction model based on heuristic rules and text context is constructed. In terms of semantic annotation model, this paper constructs a model based on Bidirectional Long Short-Term Memory (BiLSTM), which can make full use of the part-of-speech information of the corpus context, capture the part-of-speech features of the corpus, and generate semantic tags through supervised learning.

Date: 2025
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJIIT.370966 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:igg:jiit00:v:21:y:2025:i:1:p:1-19

Access Statistics for this article

International Journal of Intelligent Information Technologies (IJIIT) is currently edited by Vijayan Sugumaran

More articles in International Journal of Intelligent Information Technologies (IJIIT) from IGI Global
Bibliographic data for series maintained by Journal Editor ().

 
Page updated 2025-04-05
Handle: RePEc:igg:jiit00:v:21:y:2025:i:1:p:1-19