EconPapers    
Economics at your fingertips  
 

Optimal Query Generation for Hidden Web Extraction through Response Analysis

Sonali Gupta and Komal Kumar Bhatia
Additional contact information
Sonali Gupta: Department of Computer Engineering, YMCA University of Science & Technology, Faridabad, Haryana, India
Komal Kumar Bhatia: Department of Computer Engineering, YMCA University of Science & Technology, Faridabad, Haryana, India

International Journal of Information Retrieval Research (IJIRR), 2014, vol. 4, issue 2, 1-18

Abstract: A huge number of Hidden Web databases exists over the WWW forming a massive source of high quality information. Retrieval of this information for enriching the repository of the search engine is the prime target of a Hidden web crawler. Besides this, the crawler should perform this task at an affordable cost and resource utilization. This paper proposes a Random ranking mechanism whereby the queries to be raised by the hidden web crawler have been ranked. By ranking the queries according to the proposed mechanism, the Hidden Web crawler is able to make an optimal choice among the candidate queries and efficiently retrieve the Hidden web databases. The Hidden Web crawler proposed here also possesses an extensible and scalable framework to improve the efficiency of crawling. The proposed approach has also been compared with other methods of Hidden Web crawling existing in the literature.

Date: 2014
References: Add references at CitEc
Citations:

Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 018/ijirr.2014040101 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:igg:jirr00:v:4:y:2014:i:2:p:1-18

Access Statistics for this article

International Journal of Information Retrieval Research (IJIRR) is currently edited by Zhongyu Lu

More articles in International Journal of Information Retrieval Research (IJIRR) from IGI Global
Bibliographic data for series maintained by Journal Editor ().

 
Page updated 2025-03-19
Handle: RePEc:igg:jirr00:v:4:y:2014:i:2:p:1-18