EconPapers    
Economics at your fingertips  
 

Finding a good query‐related topic for boosting pseudo‐relevance feedback

Zheng Ye, Jimmy Xiangji Huang and Hongfei Lin

Journal of the American Society for Information Science and Technology, 2011, vol. 62, issue 4, 748-760

Abstract: Pseudo‐relevance feedback (PRF) via query expansion (QE) assumes that the top‐ranked documents from the first‐pass retrieval are relevant. The most informative terms in the pseudo‐relevant feedback documents are then used to update the original query representation in order to boost the retrieval performance. Most current PRF approaches estimate the importance of the candidate expansion terms based on their statistics on document level. However, a document for PRF may consist of different topics, which may not be all related to the query even if the document is judged relevant. The main argument of this article is the proposal to conduct PRF on a granularity smaller than on the document level. In this article, we propose a topic‐based feedback model with three different strategies for finding a good query‐related topic based on the Latent Dirichlet Allocation model. The experimental results on four representative TREC collections show that QE based on the derived topic achieves statistically significant improvements over a strong feedback model in the language modeling framework, which updates the query representation based on the top‐ranked documents.

Date: 2011
References: Add references at CitEc
Citations:

Downloads: (external link)
https://doi.org/10.1002/asi.21501

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bla:jamist:v:62:y:2011:i:4:p:748-760

Ordering information: This journal article can be ordered from
https://doi.org/10.1002/(ISSN)1532-2890

Access Statistics for this article

More articles in Journal of the American Society for Information Science and Technology from Association for Information Science & Technology
Bibliographic data for series maintained by Wiley Content Delivery ().

 
Page updated 2025-03-19
Handle: RePEc:bla:jamist:v:62:y:2011:i:4:p:748-760