Sample Selection

Finlay, Steven

Sample Selection

Steven Finlay

Chapter 3 in Credit Scoring, Response Modeling, and Insurance Rating, 2012, pp 66-88 from Palgrave Macmillan

Abstract: Abstract The databases available for model construction can be vast. Some consumer databases contain hundreds of millions of records (observations) and thousands of predictor variables. Even with modern computing facilities it may not be practical to use all of the data available. There will also be cases that are not suitable for model construction, and these need to be identified and dealt with. The data used for model construction should also be as similar as possible to the data that will exist when the completed model is put into service — which usually means that the sample used to construct the model should be as recent as possible to mitigate against changes in the patterns of behavior that accumulate over time. For these reasons it is common for models to be constructed using a sub-set (a sample) of the available data, rather than the full population.

Keywords: Model Construction; Insurance Rate; Forecast Horizon; Adaptive Sampling; Balance Sample (search for similar items in EconPapers)
Date: 2012
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:pal:palchp:978-1-137-03169-3_3

Ordering information: This item can be ordered from
http://www.palgrave.com/9781137031693

DOI: 10.1057/9781137031693_3

Access Statistics for this chapter

More chapters in Palgrave Macmillan Books from Palgrave Macmillan
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().