Likelihood-based tree reconstruction on a concatenation of aligned sequence data sets can be statistically inconsistent
Sebastien Roch and
Mike Steel
Theoretical Population Biology, 2015, vol. 100, issue C, 56-62
Abstract:
The reconstruction of a species tree from genomic data faces a double hurdle. First, the (gene) tree describing the evolution of each gene may differ from the species tree, for instance, due to incomplete lineage sorting. Second, the aligned genetic sequences at the leaves of each gene tree provide merely an imperfect estimate of the topology of the gene tree. In this note, we demonstrate formally that a basic statistical problem arises if one tries to avoid accounting for these two processes and analyses the genetic data directly via a concatenation approach. More precisely, we show that, under the multispecies coalescent with a standard site substitution model, maximum likelihood estimation on sequence data that has been concatenated across genes and performed under the incorrect assumption that all sites have evolved independently and identically on a fixed tree is a statistically inconsistent estimator of the species tree. Our results provide a formal justification of simulation results described of Kubatko and Degnan (2007) and others, and complements recent theoretical results by DeGIorgio and Degnan (2010) and Chifman and Kubtako (2014).
Keywords: Phylogenetic reconstruction; Incomplete lineage sorting; Maximum likelihood; Consistency (search for similar items in EconPapers)
Date: 2015
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (3)
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0040580914001075
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:thpobi:v:100:y:2015:i:c:p:56-62
DOI: 10.1016/j.tpb.2014.12.005
Access Statistics for this article
Theoretical Population Biology is currently edited by Jeremy Van Cleve
More articles in Theoretical Population Biology from Elsevier
Bibliographic data for series maintained by Catherine Liu ().