Improved variational Bayes inference for transcript expression estimation
Papastamoulis Panagiotis (),
Hensman James,
Glaus Peter and
Rattray Magnus
Additional contact information
Papastamoulis Panagiotis: University of Manchester, Michael Smith Building, Oxford Road, Manchester, M13 9PT, UK
Hensman James: University of Sheffield, The Sheffield Institute for Translational Neuroscience, 385A Glossop Road, Sheffield, S10 2HQ, UK
Glaus Peter: University of Manchester, Michael Smith Building, Oxford Road, Manchester, M13 9PT, UK
Rattray Magnus: University of Manchester, Michael Smith Building, Oxford Road, Manchester, M13 9PT, UK
Statistical Applications in Genetics and Molecular Biology, 2014, vol. 13, issue 2, 203-216
Abstract:
RNA-seq studies allow for the quantification of transcript expression by aligning millions of short reads to a reference genome. However, transcripts share much of their sequence, so that many reads map to more than one place and their origin remains uncertain. This problem can be dealt using mixtures of distributions and transcript expression reduces to estimating the weights of the mixture. In this paper, variational Bayesian (VB) techniques are used in order to approximate the posterior distribution of transcript expression. VB has previously been shown to be more computationally efficient for this problem than Markov chain Monte Carlo. VB methodology can precisely estimate the posterior means, but leads to variance underestimation. For this reason, a novel approach is introduced which integrates the latent allocation variables out of the VB approximation. It is shown that this modification leads to a better marginal likelihood bound and improved estimate of the posterior variance. A set of simulation studies and application to real RNA-seq datasets highlight the improved performance of the proposed method.
Keywords: Kullback-Leibler divergence; marginal likelihood bound; generalized Dirichlet distribution; BitSeq; mixture model (search for similar items in EconPapers)
Date: 2014
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1515/sagmb-2013-0054 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bpj:sagmbi:v:13:y:2014:i:2:p:203-216:n:6
Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/sagmb/html
DOI: 10.1515/sagmb-2013-0054
Access Statistics for this article
Statistical Applications in Genetics and Molecular Biology is currently edited by Michael P. H. Stumpf
More articles in Statistical Applications in Genetics and Molecular Biology from De Gruyter
Bibliographic data for series maintained by Peter Golla ().