Aggregated Data and Compositional Variables: Methodological Note
Données Agrégées et Variables Compositionnelles: Note Méthodologique
Enora Belz (enora.belz@univ-rennes1.fr) and
Arthur Charpentier (charpentier.arthur@uqam.ca)
Additional contact information
Arthur Charpentier: CREM - Centre de recherche en économie et management - UNICAEN - Université de Caen Normandie - NU - Normandie Université - UR - Université de Rennes - CNRS - Centre National de la Recherche Scientifique
Working Papers from HAL
Abstract:
The reform of personal data law in Europe makes it difficult to access personal data (even often non-nominative), especially when looking for data considered sensitive (and often income falls into this category). One solution often considered is the provision of spatially aggregated data. However, these data pose two technical problems. The first is that categorical data become compositions. The second is related to the ecological paradox that says it is dangerous to infer individual econometric relationships from aggregate data. We will see here how to work with compositional data (possibly just to validate a classical linear regression approach - easier to interpret). And we will discuss the second, but unfortunately it remains too general to be dealt with satisfactorily. Translated with www.DeepL.com/Translator
Date: 2019-04-12
Note: View the original document on HAL open archive server: https://hal.science/hal-02097031
References: Add references at CitEc
Citations:
Downloads: (external link)
https://hal.science/hal-02097031/document (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:hal:wpaper:hal-02097031
Access Statistics for this paper
More papers in Working Papers from HAL
Bibliographic data for series maintained by CCSD (hal@ccsd.cnrs.fr).