EconPapers    
Economics at your fingertips  
 

Sample Size Calculations for Designing Clinical Proteomic Profiling Studies Using Mass Spectrometry

Nyangoma Stephen O., Collins Stuart I., Altman Douglas G., Johnson Philip and Billingham Lucinda J.
Additional contact information
Nyangoma Stephen O.: Centre for Molecular Medicine, Ninewells Hospital, and University of Dundee
Collins Stuart I.: Cancer Research UK and University of Birmingham
Altman Douglas G.: University of Oxford
Johnson Philip: Cancer Research UK and University of Birmingham
Billingham Lucinda J.: Cancer Research UK and University of Birmingham

Statistical Applications in Genetics and Molecular Biology, 2012, vol. 11, issue 3, 42

Abstract: In cancer clinical proteomics, MALDI and SELDI profiling are used to search for biomarkers of potentially curable early-stage disease. A given number of samples must be analysed in order to detect clinically relevant differences between cancers and controls, with adequate statistical power. From clinical proteomic profiling studies, expression data for each peak (protein or peptide) from two or more clinically defined groups of subjects are typically available. Typically, both exposure and confounder information on each subject are also available, and usually the samples are not from randomized subjects. Moreover, the data is usually available in replicate. At the design stage, however, covariates are not typically available and are often ignored in sample size calculations. This leads to the use of insufficient numbers of samples and reduced power when there are imbalances in the numbers of subjects between different phenotypic groups. A method is proposed for accommodating information on covariates, data imbalances and design-characteristics, such as the technical replication and the observational nature of these studies, in sample size calculations. It assumes knowledge of a joint distribution for the protein expression values and the covariates. When discretized covariates are considered, the effect of the covariates enters the calculations as a function of the proportions of subjects with specific attributes. This makes it relatively straightforward (even when pilot data on subject covariates is unavailable) to specify and to adjust for the effect of the expected heterogeneities. The new method suggests certain experimental designs which lead to the use of a smaller number of samples when planning a study. Analysis of data from the proteomic profiling of colorectal cancer reveals that fewer samples are needed when a study is balanced than when it is unbalanced, and when the IMAC30 chip-type is used. The method is implemented in the clippda package and is available in R at: http://www.bioconductor.org/help/bioc-views/release/bioc/html/clippda.html.

Keywords: sample size calculations; data imbalance; heterogeneity; covariates; technical replicates; observational study; expected Fisher information; cancer; clinical proteomics; SELDI; designing a proteomic profiling experiment (search for similar items in EconPapers)
Date: 2012
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://doi.org/10.1515/1544-6115.1686 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bpj:sagmbi:v:11:y:2012:i:3:n:2

Ordering information: This journal article can be ordered from
https://www.degruyte ... urnal/key/sagmb/html

DOI: 10.1515/1544-6115.1686

Access Statistics for this article

Statistical Applications in Genetics and Molecular Biology is currently edited by Michael P. H. Stumpf

More articles in Statistical Applications in Genetics and Molecular Biology from De Gruyter
Bibliographic data for series maintained by Peter Golla ().

 
Page updated 2025-06-11
Handle: RePEc:bpj:sagmbi:v:11:y:2012:i:3:n:2