Protein-protein interaction prediction based on multiple kernels and partial network with linear programming

Author(s)Huang, Lei
Author(s)Liao, Li
Author(s)Wu, Cathy H.
Ordered AuthorLei Huang, Li Liao and Cathy H. Wu
UD AuthorHuang, Leien_US
UD AuthorLiao, Lien_US
UD AuthorWu, Cathy H.en_US
Date Accessioned2016-10-13T14:08:37Z
Date Available2016-10-13T14:08:37Z
Copyright DateCopyright © 2016 The Author(s).en_US
Publication Date2016-08-01
DescriptionPublisher's PDFen_US
AbstractBACKGROUND: Prediction of de novo protein-protein interaction is a critical step toward reconstructing PPI networks, which is a central task in systems biology. Recent computational approaches have shifted from making PPI prediction based on individual pairs and single data source to leveraging complementary information from multiple heterogeneous data sources and partial network structure. However, how to quickly learn weights for heterogeneous data sources remains a challenge. In this work, we developed a method to infer de novo PPIs by combining multiple data sources represented in kernel format and obtaining optimal weights based on random walk over the existing partial networks. RESULTS: Our proposed method utilizes Barker algorithm and the training data to construct a transition matrix which constrains how a random walk would traverse the partial network. Multiple heterogeneous features for the proteins in the network are then combined into the form of weighted kernel fusion, which provides a new "adjacency matrix" for the whole network that may consist of disconnected components but is required to comply with the transition matrix on the training subnetwork. This requirement is met by adjusting the weights to minimize the element-wise difference between the transition matrix and the weighted kernels. The minimization problem is solved by linear programming. The weighted kernel fusion is then transformed to regularized Laplacian (RL) kernel to infer missing or new edges in the PPI network, which can potentially connect the previously disconnected components. CONCLUSIONS: The results on synthetic data demonstrated the soundness and robustness of the proposed algorithms under various conditions. And the results on real data show that the accuracies of PPI prediction for yeast data and human data measured as AUC are increased by up to 19 % and 11 % respectively, as compared to a control method without using optimal weights. Moreover, the weights learned by our method Weight Optimization by Linear Programming (WOLP) are very consistent with that learned by sampling, and can provide insights into the relations between PPIs and various feature kernel, thereby improving PPI prediction even for disconnected PPI networks.en_US
DepartmentUniversity of Delaware. Department of Computer and Information Sciences.en_US
DepartmentUniversity of Delaware. Center for Bioinformatics & Computational Biology.en_US
CitationHuang, Lei, Li Liao, and Cathy H. Wu. "Protein-protein interaction prediction based on multiple kernels and partial network with linear programming." BMC Systems Biology 10.2 (2016): 45.en_US
DOIDOI 10.1186/s12918-016-0296-xen_US
ISSN1752-0509en_US
URLhttp://udspace.udel.edu/handle/19716/19808
Languageen_USen_US
PublisherBioMed Centralen_US
dc.rightsCC BY 4.0 The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.en_US
dc.sourceBMC Systems Biologyen_US
dc.source.urihttps://bmcsystbiol.biomedcentral.com/en_US
TitleProtein-protein interaction prediction based on multiple kernels and partial network with linear programmingen_US
TypeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Protein-protein interaction prediction_1471018524T1015.pdf
Size:
1.28 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.22 KB
Format:
Item-specific license agreed upon to submission
Description: