Sample Selection
Steven Finlay
Chapter 3 in Credit Scoring, Response Modeling, and Insurance Rating, 2012, pp 66-88 from Palgrave Macmillan
Abstract:
Abstract The databases available for model construction can be vast. Some consumer databases contain hundreds of millions of records (observations) and thousands of predictor variables. Even with modern computing facilities it may not be practical to use all of the data available. There will also be cases that are not suitable for model construction, and these need to be identified and dealt with. The data used for model construction should also be as similar as possible to the data that will exist when the completed model is put into service — which usually means that the sample used to construct the model should be as recent as possible to mitigate against changes in the patterns of behavior that accumulate over time. For these reasons it is common for models to be constructed using a sub-set (a sample) of the available data, rather than the full population.
Keywords: Model Construction; Insurance Rate; Forecast Horizon; Adaptive Sampling; Balance Sample (search for similar items in EconPapers)
Date: 2012
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:pal:palchp:978-1-137-03169-3_3
Ordering information: This item can be ordered from
http://www.palgrave.com/9781137031693
DOI: 10.1057/9781137031693_3
Access Statistics for this chapter
More chapters in Palgrave Macmillan Books from Palgrave Macmillan
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().