EconPapers    
Economics at your fingertips  
 

A feature-based selection technique for reduction of large scale data

Ritu Chauhan and Harleen Kaur ()

International Journal of Data Analysis Techniques and Strategies, 2017, vol. 9, issue 3, 207-221

Abstract: The inflated development in public healthcare domain has forced numerous organisations to construct and maintain large scale databases or data warehouses. However, the prediction of knowledge should be an automated process to discover hidden information from large scale databases. The elaborated studies in the past suggest that minimum interesting variables can determine qualified information while preserving information among the data. In addition, it is determined that large scale databases usually comprise of redundant and irrelevant features which have proven to be a major setback for efficient and effective analysis of data. This paper intends to provide an integrated approach by utilising machine learning technique and other convention statistical techniques for extraction of information from large scale databases. In the formulated approach, we have potentially exploited two approaches where the first approach emphasises on retrieval of feature subsets using MODTree filtering technique from discretised datasets with relative application domain on real datasets of Substance Abuse and Mental Health Data Archive (SAMHDA) collected from different states of USA. The second phase of study exploits statistical techniques on potential targets for discovery of interesting information from reduced datasets. We present a novel perspective using feature selection and statistical techniques for determination of knowledge from large scale databases.

Keywords: data mining; feature selection; abusive substance; statistical analysis; alcohol consumption. (search for similar items in EconPapers)
Date: 2017
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.inderscience.com/link.php?id=86630 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:ids:injdan:v:9:y:2017:i:3:p:207-221

Access Statistics for this article

More articles in International Journal of Data Analysis Techniques and Strategies from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().

 
Page updated 2025-03-19
Handle: RePEc:ids:injdan:v:9:y:2017:i:3:p:207-221