Incorporating Clustering Techniques into GAMLSS
Thiago G. Ramires,
Luiz R. Nakamura,
Ana J. Righetto,
Andréa C. Konrath and
Carlos A. B. Pereira
Additional contact information
Thiago G. Ramires: Campus Apucarana, Universidade Tecnológica Federal do Paraná, Apucarana 86812-460, Brazil
Luiz R. Nakamura: Departamento de Informática e Estatística, Universidade Federal de Santa Catarina, Florianópolis 88040-900, Brazil
Ana J. Righetto: Alvaz Agritech, Londrina 86050-268, Brazil
Andréa C. Konrath: Departamento de Informática e Estatística, Universidade Federal de Santa Catarina, Florianópolis 88040-900, Brazil
Carlos A. B. Pereira: Instituto de Matemática e Estatística, Universidade de São Paulo, São Paulo 05508-090, Brazil
Stats, 2021, vol. 4, issue 4, 1-15
Abstract:
A method for statistical analysis of multimodal and/or highly distorted data is presented. The new methodology combines different clustering methods with the GAMLSS (generalized additive models for location, scale, and shape) framework, and is therefore called c-GAMLSS, for “clustering GAMLSS. ” In this new extended structure, a latent variable (cluster) is created to explain the response-variable (target). Any and all parameters of the distribution for the response variable can also be modeled by functions of the new covariate added to other available resources (features). The method of selecting resources to be used is carried out in stages, a step-based method. A simulation study considering multiple scenarios is presented to compare the c-GAMLSS method with existing Gaussian mixture models. We show by means of four different data applications that in cases where other authentic explanatory variables are or are not available, the c-GAMLSS structure outperforms mixture models, some recently developed complex distributions, cluster-weighted models, and a mixture-of-experts model. Even though we use simple distributions in our examples, other more sophisticated distributions can be used to explain the response variable.
Keywords: bimodal distributions; GAMLSS; mixture models; regression models; statistical learning (search for similar items in EconPapers)
JEL-codes: C1 C10 C11 C14 C15 C16 (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
https://www.mdpi.com/2571-905X/4/4/53/pdf (application/pdf)
https://www.mdpi.com/2571-905X/4/4/53/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jstats:v:4:y:2021:i:4:p:53-930:d:677377
Access Statistics for this article
Stats is currently edited by Mrs. Minnie Li
More articles in Stats from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().