Correlated Protein Function Prediction via Maximization of Data-Knowledge Consistency

Hua Wang, Heng Huang, Chris Ding.

JCB - 2015

Conventional computational approaches for protein function prediction usually predict one function at a time, fundamentally. As a result, the protein functions are treated as separate target classes. However, biological processes are highly correlated in reality, which makes multiple functions assigned to a protein not independent. Therefore, it would be beneficial to make use of function category correlations when predicting protein functions. In this article, we propose a novel Maximization of Data-Knowledge Consistency (MDKC) approach to exploit function category correlations for protein function prediction. Our approach banks on the assumption that two proteins are likely to have large overlap in their annotated functions if they are highly similar according to certain experimental data. We first establish a new pairwise protein similarity using protein annotations from knowledge perspective. Then by maximizing the consistency between the established knowledge similarity upon annotations and the data similarity upon biological experiments, putative functions are assigned to unannotated proteins. Most importantly, function category correlations are gracefully incorporated into our learning objective through the knowledge similarity. Comprehensive experimental evaluations on the Saccharomyces cerevisiae species have demonstrated promising results that validate the performance of our methods.

Links

Cite this paper

MLA Copied to clipboard!
Wang, Hua, Heng Huang, and Chris Ding. "Correlated protein function prediction via maximization of data-knowledge consistency." Journal of Computational Biology 22.6 (2015): 546-562.
BibTeX Copied to clipboard!
@article{wang2015correlated,
  title={Correlated protein function prediction via maximization of data-knowledge consistency},
  author={Wang, Hua and Huang, Heng and Ding, Chris},
  journal={Journal of Computational Biology},
  volume={22},
  number={6},
  pages={546--562},
  year={2015},
  publisher={Mary Ann Liebert, Inc. 140 Huguenot Street, 3rd Floor New Rochelle, NY 10801 USA}
}