Data Quality Control Based on Metric Data Models
Veit Köppen () and
Hans-J. Lenz ()
Additional contact information
Veit Köppen: Freie Universität Berlin, Institute of Production, Information Systems and Operations Research
Hans-J. Lenz: Freie Universität Berlin, Institute of Statistics and Econometrics
A chapter in Frontiers in Statistical Quality Control 9, 2010, pp 263-276 from Springer
Abstract:
Summary We consider statistical edits defined on a metric data space spanned by the nonkey attributes (variables) of a given database. Integrity constraints are defined on this data space based on definitions, behavioral equations or a balance equation system. As an example think of a set of business or economic indicators. The variables are linked by the four basic arithmetic operations only. Assuming a multivariate Gaussian distribution and an error in the variables model estimation of the unknown (latent) variables can be carried out by a generalized least-squares (GLS) procedure. The drawback of this approach is that the equations form a non-linear equation system due to multiplication and division of variables, and that generally one assumes independence between all variables due to a lack of information in real applications. As there exists no finite parameter density family which is closed under all four arithmetic operations we use MCMC-simulation techniques, cf. Smith and Gelfand (1992) and Chib (2004) to derive the “exact” distributions in the non-normal case and under cross-correlation. The research can be viewed as an extension of Köppen and Lenz (2005) in the sense of studying the robustness of the GLS approach with respect to non-normality and correlation.
Keywords: Multivariate Gaussian Distribution; Data Quality Control; Statistical Quality Control; Validation Rule; Right Hand Side Variable (search for similar items in EconPapers)
Date: 2010
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-3-7908-2380-6_17
Ordering information: This item can be ordered from
http://www.springer.com/9783790823806
DOI: 10.1007/978-3-7908-2380-6_17
Access Statistics for this chapter
More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().