Robust Remote Homology Detection by Feature Based Profile Hidden Markov Models
Plötz Thomas and
Fink Gernot A.
Additional contact information
Plötz Thomas: Bielefeld University, Faculty of Technology
Fink Gernot A.: Bielefeld University, Faculty of Technology
Statistical Applications in Genetics and Molecular Biology, 2005, vol. 4, issue 1, 28
Abstract:
The detection of remote homologies is of major importance for molecular biology applications like drug discovery. The problem is still very challenging even for state-of-the-art probabilistic models of protein families, namely Profile HMMs. In order to improve remote homology detection we propose feature based semi-continuous Profile HMMs. Based on a richer sequence representation consisting of features which capture the biochemical properties of residues in their local context, family specific semi-continuous models are estimated completely data-driven. Additionally, for substantially reducing the number of false predictions an explicit rejection model is estimated. Both the family specific semi-continuous Profile HMM and the non-target model are competitively evaluated. In the experimental evaluation of superfamily based screening of the SCOP database we demonstrate that semi-continuous Profile HMMs significantly outperform their discrete counterparts. Using the rejection model the number of false positive predictions could be reduced substantially which is an important prerequisite for target identification applications.
Keywords: Profile Hidden Markov Models (Profile HMMs); remote homology detection; protein sequence analysis; feature representation; target identification (search for similar items in EconPapers)
Date: 2005
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://doi.org/10.2202/1544-6115.1159 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bpj:sagmbi:v:4:y:2005:i:1:n:21
Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/sagmb/html
DOI: 10.2202/1544-6115.1159
Access Statistics for this article
Statistical Applications in Genetics and Molecular Biology is currently edited by Michael P. H. Stumpf
More articles in Statistical Applications in Genetics and Molecular Biology from De Gruyter
Bibliographic data for series maintained by Peter Golla ().