EconPapers    
Economics at your fingertips  
 

Multi Stage Retrieval for Web Search During Crisis

Claudiu Constantin Tcaciuc, Daniele Rege Cambrin and Paolo Garza ()
Additional contact information
Claudiu Constantin Tcaciuc: Dipartimento di Automatica e Informatica, Politecnico di Torino, 10129 Torino, Italy
Daniele Rege Cambrin: Dipartimento di Automatica e Informatica, Politecnico di Torino, 10129 Torino, Italy
Paolo Garza: Dipartimento di Automatica e Informatica, Politecnico di Torino, 10129 Torino, Italy

Future Internet, 2025, vol. 17, issue 6, 1-17

Abstract: During crisis events, digital information volume can increase by over 500% within hours, with social media platforms alone generating millions of crisis-related posts. This volume creates critical challenges for emergency responders who require timely access to the concise subset of accurate information they are interested in. Existing approaches strongly rely on the power of large language models. However, the use of large language models limits the scalability of the retrieval procedure and may introduce hallucinations. This paper introduces a novel multi-stage text retrieval framework to enhance information retrieval during crises. Our framework employs a novel three-stage extractive pipeline where (1) a topic modeling component filters candidates based on thematic relevance, (2) an initial high-recall lexical retriever identifies a broad candidate set, and (3) a dense retriever reranks the remaining documents. This architecture balances computational efficiency with retrieval effectiveness, prioritizing high recall in early stages while refining precision in later stages. The framework avoids the introduction of hallucinations, achieving a 15% improvement in BERT-Score compared to existing solutions without requiring any costly abstractive model. Moreover, our sequential approach accelerates the search process by 5% compared to the use of a single-stage based on a dense retrieval approach, with minimal effect on the performance in terms of BERT-Score.

Keywords: web search; Internet and social media; text retrieval; crisis management (search for similar items in EconPapers)
JEL-codes: O3 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/1999-5903/17/6/239/pdf (application/pdf)
https://www.mdpi.com/1999-5903/17/6/239/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jftint:v:17:y:2025:i:6:p:239-:d:1667343

Access Statistics for this article

Future Internet is currently edited by Ms. Grace You

More articles in Future Internet from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-05-30
Handle: RePEc:gam:jftint:v:17:y:2025:i:6:p:239-:d:1667343