EconPapers    
Economics at your fingertips  
 

Chinese POS Disambiguation and Unknown Word Guessing with Lexicalized HMMs

Guohong Fu and Kang-Kwong Luke
Additional contact information
Guohong Fu: The University of Hong Kong, Hong Kong
Kang-Kwong Luke: The University of Hong Kong, Hong Kong

International Journal of Technology and Human Interaction (IJTHI), 2006, vol. 2, issue 1, 39-50

Abstract: This article presents a lexicalized HMM-based approach to Chinese part-of-speech (POS) disambiguation and unknown word guessing (UWG). In order to explore word-internal morphological features for Chinese POS tagging, four types of pattern tags are defined to indicate the way lexicon words are used in a segmented sentence. Such patterns are combined further with POS tags. Thus, Chinese POS disambiguation and UWG can be unified as a single task of assigning each known word to input a proper hybrid tag. Furthermore, a uniformly lexicalized HMM-based tagger also is developed to perform this task, which can incorporate both internal word-formation patterns and surrounding contextual information for Chinese POS tagging under the framework of HMMs. Experiments on the Peking University Corpus indicate that the tagging precision can be improved with efficiency by the proposed approach.

Date: 2006
References: Add references at CitEc
Citations:

Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 4018/jthi.2006010103 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:igg:jthi00:v:2:y:2006:i:1:p:39-50

Access Statistics for this article

International Journal of Technology and Human Interaction (IJTHI) is currently edited by Anabela Mesquita

More articles in International Journal of Technology and Human Interaction (IJTHI) from IGI Global
Bibliographic data for series maintained by Journal Editor ().

 
Page updated 2025-03-19
Handle: RePEc:igg:jthi00:v:2:y:2006:i:1:p:39-50