A Regression-constrained Optimization Approach to Estimating Suppressed Information using Time-series Data
Sumei Zhang and
Jean-Michel Guldmann
International Regional Science Review, 2015, vol. 38, issue 2, 119-150
Abstract:
Most regional economic databases (e.g., US Economic Census and County Business Patterns [CBP]) have some employment records suppressed and then represented as ranges, in order to guarantee the confidentiality of the data. This article incorporates the implicit temporal relationships between annual employment data over several years into an optimization model designed to estimate suppressed records. This model minimizes (1) the sum of the deviations between the estimates and target values within the corresponding ranges and (2) the sum of the deviations between the estimates and an employment trend curve endogenously determined through absolute-value regression. The 1999–2006 CBP data for Arizona are used to test the model. Two decision-theoretic criteria (Pareto frontier and concordance–discordance analysis) are used to analyze the results, pointing to a specific set of parameters yielding the best estimates.
Keywords: data suppression; county business patterns; employment trend curve; absolute-value regression; optimization (search for similar items in EconPapers)
Date: 2015
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://journals.sagepub.com/doi/10.1177/0160017613501866 (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:sae:inrsre:v:38:y:2015:i:2:p:119-150
DOI: 10.1177/0160017613501866
Access Statistics for this article
More articles in International Regional Science Review
Bibliographic data for series maintained by SAGE Publications ().