EconPapers    
Economics at your fingertips  
 

Loss-Based Cross-Validated Deletion/Substitution/Addition Algorithms in Estimation

Sandra Sinisi and Mark van der Laan
Additional contact information
Sandra Sinisi: Division of Biostatistics, School of Public Health, University of California, Berkeley
Mark van der Laan: Division of Biostatistics, School of Public Health, University of California, Berkeley

No 1142, U.C. Berkeley Division of Biostatistics Working Paper Series from Berkeley Electronic Press

Abstract: In van der Laan and Dudoit (2003) we propose and theoretically study a unified loss function based statistical methodology, which provides a road map for estimation and performance assessment. Given a parameter of interest which can be described as the minimizer of the population mean of a loss function, the road map involves as important ingredients cross-validation for estimator selection and minimizing over subsets of basis functions the empirical risk of the subset-specific estimator of the parameter of interest, where the basis functions correspond to a parameterization of a specified subspace of the complete parameter space. In this article we first review this approach. Then we propose a general deletion/substitution/addition algorithm for minimizing over subsets of variables (e.g., basis functions) the empirical risk of subset-specific estimators of the parameter of interest. In particular, in the regression context, this algorithm corresponds to minimizing over subsets of variables the sum of squared residuals of the subset-specific linear regression estimator. This algorithm provides us with a new class of loss-based cross-validated algorithms in prediction of univariate and multivariate outcomes, conditional density and hazard estimation, and we generalize it to censored outcomes such as survival. In the context of regression, using polynomial basis functions, we study the properties of the deletion/substitution/addition algorithm in simulations and apply the method to detect binding sites in yeast gene expression experiments.

Keywords: Censored data; cross-validation; estimation; loss function; model selection; polynomial regression; prediction; risk; variable selection (search for similar items in EconPapers)
Date: 2004-07-11
Note: oai:bepress.com:ucbbiostat-1142
References: View complete reference list from CitEc
Citations: View citations in EconPapers (2)

Downloads: (external link)
http://www.bepress.com/cgi/viewcontent.cgi?article=1142&context=ucbbiostat (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bep:ucbbio:1142

Access Statistics for this paper

More papers in U.C. Berkeley Division of Biostatistics Working Paper Series from Berkeley Electronic Press
Bibliographic data for series maintained by Christopher F. Baum ().

 
Page updated 2025-03-19
Handle: RePEc:bep:ucbbio:1142