Statistical issues associated with modeling of synonymous mutation data
Huzurbazar Snehalata (),
Singh Sarabdeep and
Schlueter Jessica A.
Additional contact information
Huzurbazar Snehalata: Statistical and Applied Mathematical Sciences Institute, 19 T.W. Alexander Drive, P.O. Box 14006, Research Triangle Park, NC 27709-4006, USA Department of Statistics, University of Wyoming, Dept. 3332, 1000 E. University Ave, Laramie, WY 82071, USA Department of Statistics, North Carolina State University, 5109 SAS Hall, 2311 Stinson Drive, Raleigh, NC 27695-8203, USA
Singh Sarabdeep: National Center for Biotechnology Information, National Institutes of Health, Bldg. 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
Schlueter Jessica A.: Bioinformatics and Genomics, University of North Carolina at Charlotte, 9201 University City Blvd., Bioinformatics, Room 309, Charlotte, NC 28223, USA
Statistical Applications in Genetics and Molecular Biology, 2013, vol. 12, issue 3, 361-374
Abstract:
The explosion of data in evolutionary bioinformatics has led to sometimes ad hoc, incomplete and even inaccurate data analyses. Taking dS data, namely, data on synonymous substitutions per synonymous sites, we go through a statistical analysis for modeling the time since duplications of genes. We explore the shortcomings of previous analyses, especially with a view towards their effect on inference for the gene duplication process. We present a statistical analysis which respects the assumptions of the models and the integrity of the data, and emphasize that exploratory data analysis, formulation of a data model, its estimation and finally, assessment of the model are important steps in a complete data analysis. Furthermore, for dS data, we develop Bayesian discrete-continuous mixture models and present analyses using two genomes.
Keywords: Bayesian models; truncation; discrete-continuous mixture distribution; distributional fit; Weibull (search for similar items in EconPapers)
Date: 2013
References: Add references at CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1515/sagmb-2012-0033 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bpj:sagmbi:v:12:y:2013:i:3:p:361-374:n:1005
Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/sagmb/html
DOI: 10.1515/sagmb-2012-0033
Access Statistics for this article
Statistical Applications in Genetics and Molecular Biology is currently edited by Michael P. H. Stumpf
More articles in Statistical Applications in Genetics and Molecular Biology from De Gruyter
Bibliographic data for series maintained by Peter Golla ().