The effect of random-effects misspecification on classification accuracy

Riham, El Saeiti; Marta, García-Fiñana; M., Hughes David

The effect of random-effects misspecification on classification accuracy

El Saeiti Riham, García-Fiñana Marta and Hughes David M. ()
Additional contact information
El Saeiti Riham: Health Data Science, University of Liverpool Faculty of Health and Life Sciences, Liverpool, UK
García-Fiñana Marta: Health Data Science, University of Liverpool Faculty of Health and Life Sciences, Liverpool, UK
Hughes David M.: Health Data Science, University of Liverpool Faculty of Health and Life Sciences, Liverpool, UK

The International Journal of Biostatistics, 2022, vol. 18, issue 1, 279-292

Abstract: Mixed models are a useful way of analysing longitudinal data. Random effects terms allow modelling of patient specific deviations from the overall trend over time. Correlation between repeated measurements are captured by specifying a joint distribution for all random effects in a model. Typically, this joint distribution is assumed to be a multivariate normal distribution. For Gaussian outcomes misspecification of the random effects distribution usually has little impact. However, when the outcome is discrete (e.g. counts or binary outcomes) generalised linear mixed models (GLMMs) are used to analyse longitudinal trends. Opinion is divided about how robust GLMMs are to misspecification of the random effects. Previous work explored the impact of random effects misspecification on the bias of model parameters in single outcome GLMMs. Accepting that these model parameters may be biased, we investigate whether this affects our ability to classify patients into clinical groups using a longitudinal discriminant analysis. We also consider multiple outcomes, which can significantly increase the dimensions of the random effects distribution when modelled simultaneously. We show that when there is severe departure from normality, more flexible mixture distributions can give better classification accuracy. However, in many cases, wrongly assuming a single multivariate normal distribution has little impact on classification accuracy.

Keywords: classification; generalised linear mixed models; longitudinal discriminant analysis; multivariate longitudinal data; random effects (search for similar items in EconPapers)
Date: 2022
References: Add references at CitEc
Citations:

Downloads: (external link)
https://doi.org/10.1515/ijb-2019-0159 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bpj:ijbist:v:18:y:2022:i:1:p:279-292:n:15

Ordering information: This journal article can be ordered from
https://www.degruyte ... journal/key/ijb/html

DOI: 10.1515/ijb-2019-0159

Access Statistics for this article

The International Journal of Biostatistics is currently edited by Antoine Chambaz, Alan E. Hubbard and Mark J. van der Laan

More articles in The International Journal of Biostatistics from De Gruyter
Bibliographic data for series maintained by Peter Golla ().