EconPapers    
Economics at your fingertips  
 

Cost-Restricted Feature Selection for Data Acquisition

Xiaoping Liu (), Xiao-Bai Li () and Sumit Sarkar ()
Additional contact information
Xiaoping Liu: D’Amore-McKim School of Business, Northeastern University, Boston, Massachusetts 02115
Xiao-Bai Li: Department of Operations and Information Systems, Manning School of Business, University of Massachusetts Lowell, Lowell, Massachusetts 01854
Sumit Sarkar: Naveen Jindal School of Management, University of Texas at Dallas, Richardson, Texas 75080

Management Science, 2023, vol. 69, issue 7, 3976-3992

Abstract: When acquiring consumer data for marketing or new business initiatives, it is important to decide what attributes or features of potential customers should be acquired. We study a new feature selection problem in the context of customer data acquisition in which different features have different acquisition costs. This feature selection problem is studied for linear regression and logistic regression. We formulate the feature selection and acquisition problems as nonlinear discrete optimization problems that minimize prediction errors subject to a budget constraint. We derive the analytical properties of the solutions for the problems, develop a computational procedure for solving the problems, provide an intuitive interpretation for the feature selection criteria, and discuss managerial implications of the solution approach. The results of the experimental study demonstrate the effectiveness of our approach.

Keywords: data acquisition; feature selection; Lasso; linear regression; logistic regression (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)

Downloads: (external link)
http://dx.doi.org/10.1287/mnsc.2022.4551 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:inm:ormnsc:v:69:y:2023:i:7:p:3976-3992

Access Statistics for this article

More articles in Management Science from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().

 
Page updated 2025-03-19
Handle: RePEc:inm:ormnsc:v:69:y:2023:i:7:p:3976-3992