Cost-Restricted Feature Selection for Data Acquisition
Xiaoping Liu (),
Xiao-Bai Li () and
Sumit Sarkar ()
Additional contact information
Xiaoping Liu: D’Amore-McKim School of Business, Northeastern University, Boston, Massachusetts 02115
Xiao-Bai Li: Department of Operations and Information Systems, Manning School of Business, University of Massachusetts Lowell, Lowell, Massachusetts 01854
Sumit Sarkar: Naveen Jindal School of Management, University of Texas at Dallas, Richardson, Texas 75080
Management Science, 2023, vol. 69, issue 7, 3976-3992
Abstract:
When acquiring consumer data for marketing or new business initiatives, it is important to decide what attributes or features of potential customers should be acquired. We study a new feature selection problem in the context of customer data acquisition in which different features have different acquisition costs. This feature selection problem is studied for linear regression and logistic regression. We formulate the feature selection and acquisition problems as nonlinear discrete optimization problems that minimize prediction errors subject to a budget constraint. We derive the analytical properties of the solutions for the problems, develop a computational procedure for solving the problems, provide an intuitive interpretation for the feature selection criteria, and discuss managerial implications of the solution approach. The results of the experimental study demonstrate the effectiveness of our approach.
Keywords: data acquisition; feature selection; Lasso; linear regression; logistic regression (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
http://dx.doi.org/10.1287/mnsc.2022.4551 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:inm:ormnsc:v:69:y:2023:i:7:p:3976-3992
Access Statistics for this article
More articles in Management Science from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().