Using the Multi-instance Learning Method to Predict Protein-Protein Interactions with Domain Information

Zhang, Yan-Ping; Zha, Yongliang; Li, Xinrui; Zhao, Shu; Du, Xiuquan

doi:10.1007/978-3-319-11740-9_24

Yan-Ping Zhang^10,11,
Yongliang Zha^10,11,
Xinrui Li^10,11,
Shu Zhao^10,11 &
…
Xiuquan Du^10,11

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8818))

Included in the following conference series:

International Conference on Rough Sets and Knowledge Technology

3797 Accesses
1 Citations

Abstract

Identifying protein-protein interactions (PPIs) can help us to know the protein function and is critical for understanding the mechanisms of proteome. Recently, lots of computational methods such as the domain-based approach have been developed for predicting the protein-protein interactions. The conventional domain-based methods usually need to infer the interacting domain pairs from already known interacting sets of proteins, and then to predict the PPIs. However, it is difficult to provide the detailed information that which of the domain pairs will actually interact for the PPIs prediction. Therefore, it is of great importance to develop a new computational model which can ignore the information whether a domain pair is interacting or not. In this paper, we propose a novel method using multi-instance learning (MIL) for predicting protein-protein interactions based on the domain information. Firstly, the domain pairs of two proteins were composed. Then, we use the amino acid composition feature encoding method to encode the domain pairs. Finally, two multi-instance learning methods were used for training the data. The experiment results demonstrate that the proposed method is effective.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Shi, M.G., et al.: Predicting protein–protein interactions from sequence using correlation coefficient and high-quality interaction dataset. Amino Acids 38(3), 891–899 (2010)
Article Google Scholar
Guo, Y., Yu, L., Wen, Z., et al.: Using support vector machine combined with auto covariance to predict protein–protein interactions from protein sequences. Nucleic Acids Research 36(9), 3025–3030 (2008)
Article Google Scholar
Skrabanek, L., Saini, H.K., Bader, G.D., et al.: Computational prediction of protein–protein Interactions. Molecular Biotechnology 38(1), 1–17 (2008)
Article Google Scholar
Yu, J., Fotouhi, F.: Computational approaches for predicting protein–protein interactions: A survey. Journal of Medical Systems 30(1), 39–44 (2006)
Article Google Scholar
Zhang, Q.C., Petrey, D., Deng, L., et al.: Structure-based prediction of protein-protein interactions on a genome-wide scale. Nature 490(7421), 556–560 (2012)
Article Google Scholar
You, Z.H., Lei, Y.K., Zhu, L., et al.: Prediction of protein-protein interactions from amino acid sequences with ensemble extreme learning machines and principal component analysis. BMC Bioinformatics 14(suppl. 8), S10 (2013)
Google Scholar
Zahiri, J., Yaghoubi, O., Mohammad-Noori, M., et al.: PPIevo: Protein–protein interaction prediction from PSSM based evolutionary information. Genomics 102(4), 237–242 (2013)
Article Google Scholar
Memi, V., Wallqvist, A., Reifman, J.: Reconstituting protein interaction networks using parameter-dependent domain-domain interactions. BMC Bioinformatics 14(1), 154 (2013)
Article Google Scholar
Wojcik, J., Schächter, V.: Protein-protein interaction map inference using interacting domain profile pairs. Bioinformatics 17(suppl. 1), S296–S305 (2001)
Google Scholar
Roslan, R., Othman, R.M., Shah, Z.A., et al.: Utilizing shared interacting domain patterns and Gene Ontology information to improve protein–protein interaction prediction. Computers in Biology and Medicine 40(6), 555–564 (2010)
Article Google Scholar
Binny, P.S., Saha, S., Anishetty, R., et al.: A matrix based algorithm for protein–protein interaction prediction using domain–domain associations. Journal of Theoretical Biology 326, 36–42 (2013)
Article MathSciNet Google Scholar
Jang, W.H., Jung, S.H., Han, D.S.: A computational model for predicting protein interactions based on multidomain collaboration. IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB) 9(4), 1081–1090 (2012)
Article Google Scholar
Ray, S., Scott, S., Blockeel, H.: Multi-instance learning. In: Encyclopedia of Machine Learning, pp. 701–710 (2010)
Google Scholar
Zhou, Z.H.: Multi-instance learning: A survey. Department of Computer Science and Technology. Nanjing University (2004)
Google Scholar
Gärtner, T., Flach, P.A., et al.: Multi-Instance Kernels. In: Proceedings of the 19th International Conference on Machine Learning, Sydney, Australia, pp. 179–186 (2002)
Google Scholar
Mei, S.Y., Fei, W.: Structural Domain Based Multiple Instance Learning for Predicting Gram-Positive Bacterial Protein Subcellular Localization. In: International Joint Conference, pp. 195–200. IEEE (2009)
Google Scholar
Wang, J., Zucker, J.D.: Solving multiple-instance problem: A lazy learning approach. In: Proceedings of the 17th International Conference on Machine Learning, San Francisco, pp. 1119–1125 (2000)
Google Scholar
Zhou, Z.-H., Zhang, M.-L.: Ensembles of multi-instance learners. In: Lavrač, N., Gamberger, D., Todorovski, L., Blockeel, H. (eds.) ECML 2003. LNCS (LNAI), vol. 2837, pp. 492–502. Springer, Heidelberg (2003)
Google Scholar
Zhang, Y.P., Zhang, H., et al.: Multiple-Instance Learning with Instance Selection via Constructive Covering Algorithm. Tsinghua Science and Technology 19 (2014)
Google Scholar
Zhang, L., Zhang, B.: A geometrical-representationMcCulloch-Neural model and its application. IEEETransactions on Neural Networks 10, 925–929 (1999)
Article Google Scholar
Jang, W.H., Jung, S.H., Han, D.S.: A computational model for predicting protein interactions based on multidomain collaboration. IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB) 9(4), 1081–1090 (2012)
Article Google Scholar
Shen, J., Zhang, J., et al.: Predicting protein–protein interactions based only on sequences information. Proceedings of the National Academy of Sciences 104(11), 4337–4341 (2007)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Anhui University, Hefei, 230601, Anhui Province, P.R. China
Yan-Ping Zhang, Yongliang Zha, Xinrui Li, Shu Zhao & Xiuquan Du
School of Computer Science and Technology, Anhui University, Hefei, 230601, P.R. China
Yan-Ping Zhang, Yongliang Zha, Xinrui Li, Shu Zhao & Xiuquan Du

Authors

Yan-Ping Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yongliang Zha
View author publications
You can also search for this author in PubMed Google Scholar
Xinrui Li
View author publications
You can also search for this author in PubMed Google Scholar
Shu Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Xiuquan Du
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Tongji University, Shanghai, China
Duoqian Miao
Department of Electrical and Computer En, University of Alberta, Edmonton, Alberta, Canada
Witold Pedrycz
University of Warsaw, Warsaw, Poland
Dominik Ślȩzak
University of Applied Sciences, München, Germany
Georg Peters
Tianjin University, Tianjin, China
Qinghua Hu
Tongji University, Shanghai, China
Ruizhi Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, YP., Zha, Y., Li, X., Zhao, S., Du, X. (2014). Using the Multi-instance Learning Method to Predict Protein-Protein Interactions with Domain Information. In: Miao, D., Pedrycz, W., Ślȩzak, D., Peters, G., Hu, Q., Wang, R. (eds) Rough Sets and Knowledge Technology. RSKT 2014. Lecture Notes in Computer Science(), vol 8818. Springer, Cham. https://doi.org/10.1007/978-3-319-11740-9_24

Download citation

DOI: https://doi.org/10.1007/978-3-319-11740-9_24
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11739-3
Online ISBN: 978-3-319-11740-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics