Understanding Relationships in Data
Steven Finlay
Chapter 5 in Credit Scoring, Response Modeling, and Insurance Rating, 2012, pp 114-143 from Palgrave Macmillan
Abstract:
Abstract Once data has been gathered and prepared there should be a single, clean and well formatted data set. The data set should contain all predictor variables that are going to be considered for inclusion within the model, as well as the dependent variable (the modeling objective). Predictor variables that were suggested during the project planning phase, but which have subsequently been shown to contain meaningless or incorrect data for the majority of observations, should also have been excluded. For individual observations where data is missing or spurious, then a suitable imputation process should have been applied to infer what the true value is likely to have been, or a default value assigned to represent them.
Keywords: Predictor Variable; Credit Card; Insurance Rate; Interaction Variable; Residential Status (search for similar items in EconPapers)
Date: 2012
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:pal:palchp:978-1-137-03169-3_5
Ordering information: This item can be ordered from
http://www.palgrave.com/9781137031693
DOI: 10.1057/9781137031693_5
Access Statistics for this chapter
More chapters in Palgrave Macmillan Books from Palgrave Macmillan
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().