A solution to minimum sample size for regressions
David G Jenkins and
Pedro F Quintana-Ascencio
PLOS ONE, 2020, vol. 15, issue 2, 1-15
Abstract:
Regressions and meta-regressions are widely used to estimate patterns and effect sizes in various disciplines. However, many biological and medical analyses use relatively low sample size (N), contributing to concerns on reproducibility. What is the minimum N to identify the most plausible data pattern using regressions? Statistical power analysis is often used to answer that question, but it has its own problems and logically should follow model selection to first identify the most plausible model. Here we make null, simple linear and quadratic data with different variances and effect sizes. We then sample and use information theoretic model selection to evaluate minimum N for regression models. We also evaluate the use of coefficient of determination (R2) for this purpose; it is widely used but not recommended. With very low variance, both false positives and false negatives occurred at N
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (20)
Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0229345 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 29345&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0229345
DOI: 10.1371/journal.pone.0229345
Access Statistics for this article
More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().