EconPapers    
Economics at your fingertips  
 

Towards a pragmatic approach to compositional data analysis

Michael Greenacre

Economics Working Papers from Department of Economics and Business, Universitat Pompeu Fabra

Abstract: Compositional data are nonnegative data with the property of closure: that is, each set of values on their components, or so-called parts, has a fixed sum, usually 1 or 100%. The approach to compositional data analysis originated by John Aitchison uses ratios of parts as the fundamental starting point for description and modeling. I show that a compositional data set can be effectively replaced by a set of ratios, one less than the number of parts, and that these ratios describe an acyclic connected graph of all the parts. Contrary to recent literature, I show that the additive log-ratio transformation can be an excellent substitute for the original data set, as shown in an archaeological data set as well as in three other examples. I propose further that a smaller set of ratios of parts can be determined, either by expert choice or by automatic selection, which explains as much variance as required for all practical purposes. These part ratios can then be validly summarized and analyzed by conventional univariate methods, as well as multivariate methods, where the ratios are preferably log-transformed.

Keywords: compositional data; log-ratio transformation; log-ratio analysis; log-ratio distance; multivariate analysis; ratios; subcompositional coherence; univariate statistics. (search for similar items in EconPapers)
JEL-codes: C19 C38 C55 Z32 (search for similar items in EconPapers)
Date: 2017-01
New Economics Papers: this item is included in nep-ecm
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://econ-papers.upf.edu/papers/1554.pdf Whole Paper (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:upf:upfgen:1554

Access Statistics for this paper

More papers in Economics Working Papers from Department of Economics and Business, Universitat Pompeu Fabra
Bibliographic data for series maintained by ( this e-mail address is bad, please contact ).

 
Page updated 2025-03-22
Handle: RePEc:upf:upfgen:1554