Exploiting the Web as the multilingual corpus for unknown query translation
Jenq‐Haur Wang,
Jei‐Wen Teng,
Wen‐Hsiang Lu and
Lee‐Feng Chien
Journal of the American Society for Information Science and Technology, 2006, vol. 57, issue 5, 660-670
Abstract:
Users' cross‐lingual queries to a digital library system might be short and the query terms may not be included in a common translation dictionary (unknown terms). In this article, the authors investigate the feasibility of exploiting the Web as the multilingual corpus source to translate unknown query terms for cross‐language information retrieval in digital libraries. They propose a Web‐based term translation approach to determine effective translations for unknown query terms by mining bilingual search‐result pages obtained from a real Web search engine. This approach can enhance the construction of a domain‐specific bilingual lexicon and bring multilingual support to a digital library that only has monolingual document collections. Very promising results have been obtained in generating effective translation equivalents for many unknown terms, including proper nouns, technical terms, and Web query terms, and in assisting bilingual lexicon construction for a real digital library system.
Date: 2006
References: Add references at CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1002/asi.20328
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:jamist:v:57:y:2006:i:5:p:660-670
Ordering information: This journal article can be ordered from
https://doi.org/10.1002/(ISSN)1532-2890
Access Statistics for this article
More articles in Journal of the American Society for Information Science and Technology from Association for Information Science & Technology
Bibliographic data for series maintained by Wiley Content Delivery ().