EconPapers    
Economics at your fingertips  
 

Data Editing and Imputation in Business Surveys Using “R”

Elena Romascanu
Additional contact information
Elena Romascanu: National Institute of Statistics, Romania

Romanian Statistical Review, 2014, vol. 62, issue 2, 129-146

Abstract: Purpose – Missing data are a recurring problem that can cause bias or lead to inefficient analyses. The objective of this paper is a direct comparison between the two statistical software features R and SPSS, in order to take full advantage of the existing automated methods for data editing process and imputation in business surveys (with a proper design of consistency rules) as a partial alternative to the manual editing of data. Approach – The comparison of different methods on editing surveys data, in R with the ‘editrules’ and ‘survey’ packages because inside those, exist commonly used transformations in official statistics, as visualization of missing values pattern using "Amelia" and "VIM" packages, imputation approaches for longitudinal data using "VIMGUI" and a comparison of another statistical software performance on the same features, such as SPSS. Findings – Data on business statistics received by NIS’s (National Institute of Statistics) are not ready to be used for direct analysis due to in-record inconsistencies, errors and missing values from the collected data sets. The appropriate automatic methods from R packages, offers the ability to set the erroneous fields in edit-violating records, to verify the results after the imputation of missing values providing for users a flexible, less time consuming approach and easy to perform automation in R than in SPSS Macros syntax situations, when macros are very handy.

Keywords: Automated Edit Rules; Business Surveys; Missing Values; Multiple Imputation; Non-Response Weights; Pattern of Missing; Random vs. Systematic Errors; SPSS; SQL; Statistical software R (search for similar items in EconPapers)
Date: 2014
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
http://www.revistadestatistica.ro/wp-content/uploads/2014/07/RRS_2_2014_a11.pdf (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:rsr:journl:v:62:y:2014:i:2:p:129-146

Access Statistics for this article

More articles in Romanian Statistical Review from Romanian Statistical Review Contact information at EDIRC.
Bibliographic data for series maintained by Adrian Visoiu ().

 
Page updated 2025-03-19
Handle: RePEc:rsr:journl:v:62:y:2014:i:2:p:129-146