EconPapers    
Economics at your fingertips  
 

Quantile Regression Approach for Analyzing Similarity of Gene Expressions under Multiple Biological Conditions

Dianliang Deng and Mashfiqul Huq Chowdhury
Additional contact information
Dianliang Deng: Department of Mathematics and Statistics, University of Regina, Regina, SK S4S 0A2, Canada
Mashfiqul Huq Chowdhury: Department of Statistics, Mawlana Bhashani Science and Technology University, Santosh, Tangail 1902, Bangladesh

Stats, 2022, vol. 5, issue 3, 1-23

Abstract: Temporal gene expression data contain ample information to characterize gene function and are now widely used in bio-medical research. A dense temporal gene expression usually shows various patterns in expression levels under different biological conditions. The existing literature investigates the gene trajectory using the mean function. However, temporal gene expression curves usually show a strong degree of heterogeneity under multiple conditions. As a result, rates of change for gene expressions may be different in non-central locations and a mean function model may not capture the non-central location of the gene expression distribution. Further, the mean regression model depends on the normality assumptions of the error terms of the model, which may be impractical when analyzing gene expression data. In this research, a linear quantile mixed model is used to find the trajectory of gene expression data. This method enables the changes in gene expression over time to be studied by estimating a family of quantile functions. A statistical test is proposed to test the similarity between two different gene expressions based on estimated parameters using a quantile model. Then, the performance of the proposed test statistic is examined using extensive simulation studies. Simulation studies demonstrate the good statistical performance of this proposed test statistic and show that this method is robust against normal error assumptions. As an illustration, the proposed method is applied to analyze a dataset of 18 genes in P. aeruginosa , expressed in 24 biological conditions. Furthermore, a minimum Mahalanobis distance is used to find the clustering tree for gene expressions.

Keywords: chi-square test; classification; linear mixed model; Mahalanobis distance; quantile analysis; temporal gene expressions (search for similar items in EconPapers)
JEL-codes: C1 C10 C11 C14 C15 C16 (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2571-905X/5/3/36/pdf (application/pdf)
https://www.mdpi.com/2571-905X/5/3/36/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jstats:v:5:y:2022:i:3:p:36-605:d:854374

Access Statistics for this article

Stats is currently edited by Mrs. Minnie Li

More articles in Stats from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jstats:v:5:y:2022:i:3:p:36-605:d:854374