Secondary protein structure prediction combining protein structural class, relative surface accessibility, and contact number
Imad Rahal and
Jonathon Walz
International Journal of Data Science, 2018, vol. 3, issue 1, 68-85
Abstract:
With huge amounts of molecular data produced from ever-increasing numbers of genomic and proteomic studies, predicting the secondary structure of proteins from amino acid sequences has become a common expectation among scientists. Several studies in the literature have demonstrated that the accuracy of such predictions can be drastically improved by incorporating additional types of protein data into the prediction process; however, no work has studied the effect of incorporating multiple types of protein data simultaneously. In this work, we report our findings from an extensive experimental study that uses neural networks designed to study the effect of using different combinations of protein data on the accuracy of predicting secondary protein structures. Overall, our experimental results indicate that accuracy improves the most when incorporating contact number, relative surface accessibility or any combination that includes at least one of the two into the prediction process.
Keywords: protein structure prediction; neural networks; machine learning; scientific data mining; data science; bioinformatics; protein structural class; relative surface accessibility; protein contact number. (search for similar items in EconPapers)
Date: 2018
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.inderscience.com/link.php?id=90624 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ids:ijdsci:v:3:y:2018:i:1:p:68-85
Access Statistics for this article
More articles in International Journal of Data Science from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().