Building the Multidimensional Semantic Index of Webpages for Facet Extraction
Xiao Wei,
Chenglei Qin and
Zheng Xu
Additional contact information
Xiao Wei: Shanghai Institute of Technology, Shanghai, China & Institute of Automation, Chinese Academy of Sciences, Beijing, China
Chenglei Qin: Shanghai Institute of Technology, Shanghai, China
Zheng Xu: The Third Research Institute of Ministry of Public Security, Shanghai, China & Tsinghua University, Beijing, China
International Journal of Cognitive Informatics and Natural Intelligence (IJCINI), 2015, vol. 9, issue 2, 1-23
Abstract:
Faceted search is an efficient search method to use the big data and one of its key issues is to extract facets from unstructured webpages automatically. It is still a problem to extract facets from massive unstructured webpages exactly and automatically. To solve the problem, this paper first proposed a novel index structure of webpages, the Multidimensional Semantic Index (MDSI), which holds rich semantics and are helpful to extract facets. In MDSI, the differently dimensional semantic indexes are bridged by mining the semantic mapping between them. Then, an automatic facet extraction method is proposed by analysing semantic mapping relations in MDSI. At last, to validate the effect of the proposed method, two datasets are constructed and the experimental results show that the proposed method is feasible and comparatively precise.
Date: 2015
References: Add references at CitEc
Citations:
Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 18/IJCINI.2015040101 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:igg:jcini0:v:9:y:2015:i:2:p:1-23
Access Statistics for this article
International Journal of Cognitive Informatics and Natural Intelligence (IJCINI) is currently edited by Kangshun Li
More articles in International Journal of Cognitive Informatics and Natural Intelligence (IJCINI) from IGI Global
Bibliographic data for series maintained by Journal Editor ().