Using persistent homology and dynamical distances to analyze protein binding
Kovacev-Nikolic Violeta,
Bubenik Peter,
Nikolić Dragan and
Heo Giseon ()
Additional contact information
Kovacev-Nikolic Violeta: Department of Mathematical and Statistical Sciences, University of Alberta, Canada
Bubenik Peter: Department of Mathematics, University of Florida, USA
Nikolić Dragan: Department of Mechanical Engineering, University of Alberta and National Institute for Nanotechnology, Canada
Heo Giseon: School of Dentistry; Department of Mathematical and Statistical Sciences, University of Alberta, Edmonton, Canada, T6G 2N8
Statistical Applications in Genetics and Molecular Biology, 2016, vol. 15, issue 1, 19-38
Abstract:
Persistent homology captures the evolution of topological features of a model as a parameter changes. The most commonly used summary statistics of persistent homology are the barcode and the persistence diagram. Another summary statistic, the persistence landscape, was recently introduced by Bubenik. It is a functional summary, so it is easy to calculate sample means and variances, and it is straightforward to construct various test statistics. Implementing a permutation test we detect conformational changes between closed and open forms of the maltose-binding protein, a large biomolecule consisting of 370 amino acid residues. Furthermore, persistence landscapes can be applied to machine learning methods. A hyperplane from a support vector machine shows the clear separation between the closed and open proteins conformations. Moreover, because our approach captures dynamical properties of the protein our results may help in identifying residues susceptible to ligand binding; we show that the majority of active site residues and allosteric pathway residues are located in the vicinity of the most persistent loop in the corresponding filtered Vietoris-Rips complex. This finding was not observed in the classical anisotropic network model.
Keywords: dynamical distance; persistence landscape; persistent homology; simplicial complex; support vector machine (search for similar items in EconPapers)
Date: 2016
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1515/sagmb-2015-0057 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bpj:sagmbi:v:15:y:2016:i:1:p:19-38:n:3
Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/sagmb/html
DOI: 10.1515/sagmb-2015-0057
Access Statistics for this article
Statistical Applications in Genetics and Molecular Biology is currently edited by Michael P. H. Stumpf
More articles in Statistical Applications in Genetics and Molecular Biology from De Gruyter
Bibliographic data for series maintained by Peter Golla ().