EconPapers    
Economics at your fingertips  
 

A lexical knowledge base approach for English–Chinese cross‐language information retrieval

Jiangping Chen

Journal of the American Society for Information Science and Technology, 2006, vol. 57, issue 2, 233-243

Abstract: This study proposes and explores a natural language processing‐ (NLP) based strategy to address out‐of‐dictionary and vocabulary mismatch problems in query translation based English–Chinese Cross‐Language Information Retrieval (EC‐CLIR). The strategy, named the LKB approach, is to construct a lexical knowledge base (LKB) and to use it for query translation. In this article, the author describes the LKB construction process, which customizes available translation resources based on the document collection of the EC‐CLIR system. The evaluation shows that the LKB approach is very promising. It consistently increased the percentage of correct translations and decreased the percentage of missing translations in addition to effectively detecting the vocabulary gap between the document collection and the translation resource of the system. The comparative analysis of the top EC‐CLIR results using the LKB and two other translation resources demonstrates that the LKB approach has produced significant improvement in EC‐CLIR performance compared to performance using the original translation resource without customization. It has also achieved the same level of performance as a sophisticated machine translation system. The study concludes that the LKB approach has the potential to be an empirical model for developing real‐world CLIR systems. Linguistic knowledge and NLP techniques, if appropriately used, can improve the effectiveness of English–Chinese cross‐language information retrieval.

Date: 2006
References: Add references at CitEc
Citations:

Downloads: (external link)
https://doi.org/10.1002/asi.20273

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bla:jamist:v:57:y:2006:i:2:p:233-243

Ordering information: This journal article can be ordered from
https://doi.org/10.1002/(ISSN)1532-2890

Access Statistics for this article

More articles in Journal of the American Society for Information Science and Technology from Association for Information Science & Technology
Bibliographic data for series maintained by Wiley Content Delivery ().

 
Page updated 2025-03-19
Handle: RePEc:bla:jamist:v:57:y:2006:i:2:p:233-243