EconPapers    
Economics at your fingertips  
 

Synonymous and nonsynonymous distances help untangle convergent evolution and recombination

Chi Peter B., Chattopadhyay Sujay, Lemey Philippe, Sokurenko Evgeni V. and Minin Vladimir N. ()
Additional contact information
Chi Peter B.: Department of Statistics, California Polytechnic State University, San Luis Obispo, CA, 93407, USA
Chattopadhyay Sujay: Department of Microbiology, University of Washington, Seattle, WA, 98195, USA
Lemey Philippe: Rega Institute, Department of Microbiology and Immunology, KU Leuven – University of Leuven, B-3000 Leuven, Belgium
Sokurenko Evgeni V.: Department of Microbiology, University of Washington, Seattle, WA, 98195, USA
Minin Vladimir N.: Departments of Statistics and Biology, University of Washington, Seattle, WA, 98195, USA

Statistical Applications in Genetics and Molecular Biology, 2015, vol. 14, issue 4, 375-389

Abstract: When estimating a phylogeny from a multiple sequence alignment, researchers often assume the absence of recombination. However, if recombination is present, then tree estimation and all downstream analyses will be impacted, because different segments of the sequence alignment support different phylogenies. Similarly, convergent selective pressures at the molecular level can also lead to phylogenetic tree incongruence across the sequence alignment. Current methods for detection of phylogenetic incongruence are not equipped to distinguish between these two different mechanisms and assume that the incongruence is a result of recombination or other horizontal transfer of genetic information. We propose a new recombination detection method that can make this distinction, based on synonymous codon substitution distances. Although some power is lost by discarding the information contained in the nonsynonymous substitutions, our new method has lower false positive probabilities than the comparable recombination detection method when the phylogenetic incongruence signal is due to convergent evolution. We apply our method to three empirical examples, where we analyze: (1) sequences from a transmission network of the human immunodeficiency virus, (2) tlpB gene sequences from a geographically diverse set of 38 Helicobacter pylori strains, and (3) hepatitis C virus sequences sampled longitudinally from one patient.

Keywords: bacterial evolution; evolutionary distances; phylogenetics; viral evolution (search for similar items in EconPapers)
Date: 2015
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://doi.org/10.1515/sagmb-2014-0078 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bpj:sagmbi:v:14:y:2015:i:4:p:375-389:n:5

Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/sagmb/html

DOI: 10.1515/sagmb-2014-0078

Access Statistics for this article

Statistical Applications in Genetics and Molecular Biology is currently edited by Michael P. H. Stumpf

More articles in Statistical Applications in Genetics and Molecular Biology from De Gruyter
Bibliographic data for series maintained by Peter Golla ().

 
Page updated 2025-03-19
Handle: RePEc:bpj:sagmbi:v:14:y:2015:i:4:p:375-389:n:5