Avoiding biases from data-dependent specification search: an application to a tillage choice model
Sanchita Sengupta,
Lyubov Kurkalova () and
Catherine Kling
No 21399, 2006 Annual meeting, July 23-26, Long Beach, CA from American Agricultural Economics Association (New Name 2008: Agricultural and Applied Economics Association)
Abstract:
The study evaluates the gains of avoiding data-dependent specification search on an estimation sample in an application to discrete choice models. We incorporate data splitting, the process by which the total available sample is randomly split in two or more sub-samples with the first (specification) sub-sample used for specification search, and the second (estimation) sub-sample used for obtaining clean estimates using the model chosen on the specification sub-sample according to a set criterion. We estimate 14 binary Logit models of the adoption of conservation tillage corresponding to the major sub-watersheds of the Upper Mississippi River Basin. For each of the sub-watershed models, we use the specification sub-sample to choose the explanatory variables that lead to the highest number of correct predictions provided that estimated coefficients are in conformity with economic theory. To evaluate the gains of avoiding specification search on the estimation sub-sample, we follow Gong (1986)[8] and calculate the expected excess error, which is a measure of excess optimism concerning model fit on the specification sample. We find that the excess optimism varies with the sub-watersheds and has a tendency to be larger for the sub-watersheds with smaller samples.
Keywords: Research; Methods/; Statistical; Methods (search for similar items in EconPapers)
Pages: 23
Date: 2006
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://ageconsearch.umn.edu/record/21399/files/sp06sa05.pdf (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ags:aaea06:21399
DOI: 10.22004/ag.econ.21399
Access Statistics for this paper
More papers in 2006 Annual meeting, July 23-26, Long Beach, CA from American Agricultural Economics Association (New Name 2008: Agricultural and Applied Economics Association) Contact information at EDIRC.
Bibliographic data for series maintained by AgEcon Search ().