Identifying excessively rounded or truncated data
Kevin H. Knuth (),
J. Patrick Castle () and
Kevin R. Wheeler ()
Additional contact information
Kevin H. Knuth: University at Albany, Department of Physics
J. Patrick Castle: NASA Ames Research Center, Intelligent Systems Division
Kevin R. Wheeler: Monterey Bay Aquarium Research Institute
A chapter in Compstat 2006 - Proceedings in Computational Statistics, 2006, pp 313-323 from Springer
Abstract:
Abstract All data are digitized, and hence are essentially integers rather than true real numbers. Ordinarily this causes no difficulties since the truncation or rounding usually occurs below the noise level. However, in some instances, when the instruments or data delivery and storage systems are designed with less than optimal regard for the data or the subsequent data analysis, the effects of digitization may be comparable to important features contained within the data. In these cases, information has been irrevocably lost in the truncation process. While there exist techniques for dealing with truncated data, we propose a straightforward method that will allow us to detect this problem before the data analysis stage. It is based on an optimal histogram binning algorithm that can identify when the statistical structure of the digitization is on the order of the statistical structure of the data set itself.
Keywords: Data; information; truncated; rounded; Bayesian; histogram (search for similar items in EconPapers)
Date: 2006
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-3-7908-1709-6_24
Ordering information: This item can be ordered from
http://www.springer.com/9783790817096
DOI: 10.1007/978-3-7908-1709-6_24
Access Statistics for this chapter
More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().