EconPapers    
Economics at your fingertips  
 

Contextualized Text OLAP Based on Information Retrieval

Lamia Oukid, Nadjia Benblidia, Fadila Bentayeb, Ounas Asfari and Omar Boussaid
Additional contact information
Lamia Oukid: LRDSI Laboratory, University of Blida 1, Blida, Algeria
Nadjia Benblidia: LRDSI Laboratory, University of Blida 1, Blida, Algeria
Fadila Bentayeb: ERIC Laboratory, University of Lyon 2, Lyon, France
Ounas Asfari: ERIC Laboratory, University of Lyon 2, Lyon, France
Omar Boussaid: ERIC Laboratory, University of Lyon 2, Lyon, France

International Journal of Data Warehousing and Mining (IJDWM), 2015, vol. 11, issue 2, 1-21

Abstract: Current data warehousing and On-Line Analytical Processing (OLAP) systems are not yet particularly appropriate for textual data analysis. It is therefore crucial to develop a new data model and an OLAP system to provide the necessary analyses for textual data. To achieve this objective, this paper proposes a new approach based on information retrieval (IR) techniques. Moreover, several contextual factors may significantly affect the information relevant to a decision-maker. Thus, the paper proposes to consider contextual factors in an OLAP system to provide relevant results. It provides a generalized approach for Text OLAP analysis which consists of two parts: The first one is a context-based text cube model, denoted CXT-Cube. It is characterized by several contextual dimensions. Hence, during the OLAP analysis process, CXT-Cube exploits the contextual information in order to better consider the semantics of textual data. Besides, the work associates to CXT-Cube a new text analysis measure based on an OLAP-adapted vector space model and a relevance propagation technique. The second part is an OLAP aggregation operator called ORank (OLAP-Rank) which allows to aggregate textual data in an OLAP environment while considering relevant contextual factors. To consider the user context, this paper proposes a query expansion method based on a decision-maker profile. Based on IR metrics, it evaluates the proposed aggregation operator in different cases using several data analysis queries. The evaluation shows that the precision of the system is significantly better than that of a Text OLAP system based on classical IR. This is due to the consideration of the contextual factors.

Date: 2015
References: Add references at CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 018/ijdwm.2015040101 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:igg:jdwm00:v:11:y:2015:i:2:p:1-21

Access Statistics for this article

International Journal of Data Warehousing and Mining (IJDWM) is currently edited by Eric Pardede

More articles in International Journal of Data Warehousing and Mining (IJDWM) from IGI Global
Bibliographic data for series maintained by Journal Editor ().

 
Page updated 2025-03-19
Handle: RePEc:igg:jdwm00:v:11:y:2015:i:2:p:1-21