The effect of random-effects misspecification on classification accuracy
El Saeiti Riham,
García-Fiñana Marta and
Hughes David M. ()
Additional contact information
El Saeiti Riham: Health Data Science, University of Liverpool Faculty of Health and Life Sciences, Liverpool, UK
García-Fiñana Marta: Health Data Science, University of Liverpool Faculty of Health and Life Sciences, Liverpool, UK
Hughes David M.: Health Data Science, University of Liverpool Faculty of Health and Life Sciences, Liverpool, UK
The International Journal of Biostatistics, 2022, vol. 18, issue 1, 279-292
Abstract:
Mixed models are a useful way of analysing longitudinal data. Random effects terms allow modelling of patient specific deviations from the overall trend over time. Correlation between repeated measurements are captured by specifying a joint distribution for all random effects in a model. Typically, this joint distribution is assumed to be a multivariate normal distribution. For Gaussian outcomes misspecification of the random effects distribution usually has little impact. However, when the outcome is discrete (e.g. counts or binary outcomes) generalised linear mixed models (GLMMs) are used to analyse longitudinal trends. Opinion is divided about how robust GLMMs are to misspecification of the random effects. Previous work explored the impact of random effects misspecification on the bias of model parameters in single outcome GLMMs. Accepting that these model parameters may be biased, we investigate whether this affects our ability to classify patients into clinical groups using a longitudinal discriminant analysis. We also consider multiple outcomes, which can significantly increase the dimensions of the random effects distribution when modelled simultaneously. We show that when there is severe departure from normality, more flexible mixture distributions can give better classification accuracy. However, in many cases, wrongly assuming a single multivariate normal distribution has little impact on classification accuracy.
Keywords: classification; generalised linear mixed models; longitudinal discriminant analysis; multivariate longitudinal data; random effects (search for similar items in EconPapers)
Date: 2022
References: Add references at CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1515/ijb-2019-0159 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bpj:ijbist:v:18:y:2022:i:1:p:279-292:n:15
Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/ijb/html
DOI: 10.1515/ijb-2019-0159
Access Statistics for this article
The International Journal of Biostatistics is currently edited by Antoine Chambaz, Alan E. Hubbard and Mark J. van der Laan
More articles in The International Journal of Biostatistics from De Gruyter
Bibliographic data for series maintained by Peter Golla ().