EconPapers    
Economics at your fingertips  
 

CHTopoNER model-based method for recognizing Chinese place names from social media information

Mengwei Zhang (), Xingui Liu (), Zheng Zhang (), Yue Qiu (), Zhipeng Jiang () and Pengyu Zhang ()
Additional contact information
Mengwei Zhang: Information Engineering University
Xingui Liu: Information Engineering University
Zheng Zhang: Information Engineering University
Yue Qiu: Information Engineering University
Zhipeng Jiang: Information Engineering University
Pengyu Zhang: University of Electronic Science and Technology of China

Journal of Geographical Systems, 2024, vol. 26, issue 1, No 7, 149-179

Abstract: Abstract Chinese toponym recognition is crucial in named entity recognition and has significant implications for improving geographic information systems. Based on the real-time nature of social media and rich geographical data contained in social media, it is important to identify Chinese toponyms, including compound toponyms, informal toponyms, and other forms of social media content, for automatic geospatial information extraction. However, the strong word-building ability, diverse features, and ambiguity of Chinese toponyms combined with the linguistic irregularities of social media pose significant challenges for accurately locating toponym boundaries and resolving ambiguities. Furthermore, existing Chinese toponym recognition methods often ignore the fusion of local and global features during feature extraction, resulting in semantic information loss. Therefore, we used the Chinese-roberta-wwm-ext pre-trained language model to encode input text and obtain character-level information. An improved SoftLexicon-based statistical method was employed to acquire word-level semantic information, which was then integrated with character-level semantic information. A two-channel neural network layer comprising a bi-directional long short-term memory and an inception-dilated convolutional neural network was utilized to extract global and local features from text. Additionally, a conditional random field was applied to establish label constraints. The proposed deep neural network model, called CHTopoNER, is designed to identify various forms of Chinese toponyms in irregular Chinese social media content. Its effectiveness was validated on four publicly available annotated toponym datasets and a custom social media dataset. CHTopoNER surpasses state-of-the-art Chinese toponym recognition models and achieves promising results for extracting various types of toponyms and spatial location terms.

Keywords: Named entity recognition; Chinese place name recognition; Deep learning; Geographic information acquisition; Disambiguation of place names (search for similar items in EconPapers)
JEL-codes: Y90 (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s10109-023-00433-w Abstract (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:kap:jgeosy:v:26:y:2024:i:1:d:10.1007_s10109-023-00433-w

Ordering information: This journal article can be ordered from
http://www.springer. ... ce/journal/10109/PS2

DOI: 10.1007/s10109-023-00433-w

Access Statistics for this article

Journal of Geographical Systems is currently edited by Manfred M. Fischer and Antonio Páez

More articles in Journal of Geographical Systems from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-19
Handle: RePEc:kap:jgeosy:v:26:y:2024:i:1:d:10.1007_s10109-023-00433-w